top of page

2024   2023   2022   2021   2020   2019   2018   2017   2016 and before

Selected highlights

Generative AI for designing and validating easily synthesizable and structurally novel antibiotics. [PDF]

Kyle Swanson, Gary Liu, Denise Catacutan, Autumn Arnold, James Zou*, Jon Stokes*.  

Nature Machine Intelligence (2024).

Principled and interpretable alignability testing and integration of single-cell data. [PDF]

Rong Ma, Eric Sun, David Donoho, James Zou.  

Proceedings of the National Academy of Sciences (2024).

TISSUE: uncertainty-calibrated prediction of single-cell spatial transcriptomics improves downstream analyses. [PDF]

Eric Sun, Rong Ma, Paloma Negredo, Anne Brunet, James Zou.  

Nature Methods (2024).

A visual–language foundation model for pathology image analysis using medical Twitter. [PDF

Zhi Huang, Federico Bianchi, Mert Yuksekgonul, Tom Montine, James Zou

Nature Medicine (2023). Cover article.

Blinded, randomized trial of sonographer versus AI cardiac function assessment. [PDF

Bryan He, Alan Kwan, Jae Cho, Neal Yuan, C. Pollick, T. Shiota, J. Ebinger, N. Bello, J. Wei, K. Josan, G. Duffy, M. Jujjavarapu, R. Siegel, Susan Cheng*, James Zou*, David Ouyang*. 

Nature (2023).

From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment. [PDF

Kyle Swanson, Eric Wu, Angela Zhang, Ash Alizadeh, James Zou

Cell (2023).

Graph deep learning for the characterization of tumour microenvironments from spatial protein profiles in tissue specimens. [PDF

Zhenqin Wu, Alex Trevino, Eric Wu, Kyle Swanson, Honesty Kim, Blaize D’Angio, Ryan Preska, Greg Charville, Piero Dalerba, Ann Egloff, R. Uppaluri, U. Duvvuri, Aaron Mayer, James Zou

Nature Biomedical Engineering (2022).

Systematic pan-cancer analysis of mutation-treatment interactions using large real-world clinicogenomics data. [PDF

Ruishan Liu, Shemra Rizzo, Sarah Waliany, Marius Garmhausen, Navdeep Pal, Zhi Huang, Nayan Chaudhary, Lisa Wang, Chris Harbron, Joel Neal, Ryan Copping, James Zou

Nature Medicine (2022).

Evaluating eligibility criteria of oncology trials using real-world data and AI. [PDF] [news​] [news​] [news]

Ruishan Liu, Shemra Rizzo, Sam Whipple, Navdeep Pal, Arturo Pineda, Michael Lu, Brandon Arnieri, Ying Lu, William Copra, Ryan Copping, James Zou

Nature (2021). Finalist for Global Pharma Award 2021; Top 10 Clinical Research Achievement

How medical AI devices are evaluated: limitations and recommendations from an analysis of FDA approvals. [PDF] [website] [news]

Eric Wu, Kevin Wu, Roxana Daneshjou, David Ouyang, Daniel Ho, James Zou

Nature Medicine  (2021).

Video-based AI for beat-to-beat assessment of cardiac function. [PDF

David Ouyang, Bryan He, Amirata Ghorbani, N. Yuan, J. Ebinger, C. Langlotz, P. Heidenrich, R. Harrington, D. Liang, E. Ashley, James Zou

Nature (2020).

Integrating spatial gene expression and breast tumour morphology with deep learning[PDF

Bryan He, Ludvig Bergenstrahle, Linnea Stenbeck, Abu Abid, Alma Andersson, Ake Borg, Jonas Maaskola, Joakim Lundeberg, James Zou.

Nature Biomedical Engineering (2020).

FrugalML: how to use ML prediction APIs more accurately and cheaply. [PDF

Lingjiao Chen, Matei Zaharia, James Zou

NeurIPS (2020). Selected for oral presentation (top 1% of submissions).

How much does your data exploration overfit? Controlling bias via information usage. [arXiv

Daniel Russo, James Zou

IEEE Transactions on Information Theory (2019). 

 

Large dataset enables prediction of repair after CRISPR-Cas9 editing in primary T cells. [arXiv

Ryan Leenay, Amirali Aghazadeh, Joseph Hiatt, David Tse, T. Roth, R. Apathy, E. Shifrut, J. Hulquist, N. Krogan, Z. Wu, G. Carolina, H. Canaj, M. Leonetti, Alex Marson, Andrew May, James Zou

Nature Biotechnology (2019).

 

AdaFDR: a Fast, Powerful and Covariate-Adaptive Approach for Multiple Hypothesis Testing.[arXiv

Martin Zhang, Fei Xia, James Zou

Nature Communications (2019). RECOMB Best Paper Award.

Data Shapley: Equitable Data Valuation for Machine Learning. [arXiv

Amirata Ghorbani, James Zou

ICML (2019). 

Design AI so that it's fair. [PDF

James Zou and Londa Schiebinger. 

Nature (2018).

Word embeddings quantify 100 years of gender and ethnic stereotypes. [PDF

Nikhil Garg, Londa Schiebinger, Dan Jurafsky, James Zou

Proceedings of the National Academy of Sciences (2018).

2024

ChatGPT is transforming peer review — how can we use it responsibly? [PDF]

James Zou

Nature (2024).

Are More LLM Calls All You Need? Towards the Scaling Properties of Compound AI Systems. [PDF]

Lingjiao Chen, Jared Quincy Davis, Boris Hanin, Peter Bailis, Ion Stoica, Matei Zaharia, James Zou

NeurIPS (2024).

Stochastic Amortization: A Unified Approach to Accelerate Feature and Data Attribution. [PDF]

Ian Covert, Chanwoo Kim, Su-In Lee*, James Zou*, Tatsunori Hashimoto*. 

NeurIPS (2024).

AvaTaR: Optimizing LLM Agents for Tool-Assisted Knowledge Retrieval. [PDF]

Shirley Wu, Shiyu ZHao, Qian Huang, Kexin Huang, Michihiro Yasunaga, Kaidi Cao, Vassilis Ioannidis, Karthik Subbian, Jure Leskovec, James Zou

NeurIPS (2024).

GraphMETRO: Mitigating Complex Graph Distribution Shifts via Mixture of Aligned Experts. [PDF]

Shirley Wu, Kaidi Cao, Bruno Ribeiro, James Zou*, Jure Leskovec*.

NeurIPS (2024).

Accelerating Transformers with Spectrum-Preserving Token Merging. [PDF]

Hoai-Chau Tran, Duy M. H. Nguyen, Duy M. Nguyen, Trung-Tin Nguyen, Ngan Le, Pengtao Xie, Daniel Sonntag, James Zou, Binh T. Nguyen, Mathias Niepert.

NeurIPS (2024).

Enhancing Large Vision Language Models with Self-Training on Image Comprehension. [PDF]

Yihe Deng, Pan Lu, Fan Yin, Ziniu Hu, Sheng Shen, James Zou, Kai-Wei Chang, Wei Wan.

NeurIPS (2024).

TFG: Unified Training-Free Guidance for Diffusion Models. [PDF]

Haotian Ye, Haowei Lin, Jiaqi Han, Minkai Xu, Sheng Liu, Yitao Liang, Jianzhu Ma, James Zou, Stefano Ermon.

NeurIPS (2024).

ClashEval: Quantifying the tug-of-war between an LLM’s internal prior and external evidence. [PDF]

Kevin Wu, Eric Wu, James Zou.

NeurIPS Datasets and Benchmarks (2024).

UniTox: Leveraging LLMs to Curate a Unified Dataset of Drug-Induced Toxicity from FDA Labels. [PDF]

Jake Silberg, Kyle Swanson, Elana Simon, Angela Zhang, Zaniar Ghazizadeh, Scott Ogden, Hisham Hamadeh, James Zou.

NeurIPS Datasets and Benchmarks (2024).

CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models. [PDF]

Peng Xia, et al, James Zou, Huaxiu Yao.

NeurIPS Datasets and Benchmarks (2024).

STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases. [PDF]

Shirley Wu, Shiyu Zhao, Michihiro Yasunaga, Kexin Huang, Kaidi Cao, Qian Huang, Vassilis N. Ioannidis, Karthik Subbian, James Zou*, Jure Leskovec*.

NeurIPS Datasets and Benchmarks (2024).

Language models for biological research: a primer. [PDF]

Elana Simons, Kyle Swanson, James Zou.

Nature Methods (2024).

Discovery and generalization of tissue structures from spatial omics data. [PDF]

Zhenqin Wu, et al, James Zou*, Aaron Mayer*, Alex Trevino*.

Cell Reports Methods (2024).

ADMET-AI: a machine learning ADMET platform for evaluation of large-scale chemical libraries. [PDF]

Kyle Swanson, Parker Walther, Jeremy Leitz, Souhrid Mukherjee, Joe Wu, Rabin Shinaraine, James Zou.

Bioinformatics (2024).

A generalist vision-language foundation model for diverse biomedical tasks. [PDF]

Kai Zhang et al. 

Nature Medicine (2024).

Can Large Language Models Provide Useful Feedback on Research Papers? A Large-Scale Empirical Analysis. [PDF]

Weixin Liang, Yuhui Zhang, Hancheng Cao, et al. James Zou.

NEJM AI  (2024).

SPRITE: improving spatial gene expression imputation with gene and cell networks. [PDF]

Eric Sun, Rong Ma, James Zou.

Bioinformatics (ISMB 2024).

Regulating AI Adaptation: An Analysis of AI Medical Device Updates. [PDF]

Kevin Wu, Eric Wu, Kit Rodolfa, Dan Ho, James Zou.

Conference on Health, Inference and Learning  (CHIL 2024).

Model ChangeLists: Characterizing Updates to ML Models. [PDF]

Sabri Eyuboglu, Karan Goel, Arjun Desai, Lingjiao Chen, Mathew Monfort, Chris Re, James Zou.

ACM FAccT (2024).

A pathologist–AI collaboration framework for enhancing diagnostic accuracies  and efficiencies. [PDF]

Zhi Huang, et al, Tom Montine, James Zou.

Nature Biomedical Engineering (2024).

Systematic analysis of 32,111 AI model cards characterizes documentation practice in AI. [PDF]

Weixin Liang, Nazneen Rajani, Xinyu Yang, Ezi Ozoani, Eric Wu, Yiqun Chen, Daniel Smith, James Zou.

Nature Machine Intelligence (2024).

How is ChatGPT's behavior changing over time? [PDF]

Lingjiao Chen, Matei Zaharia, James Zou.

Harvard Data Science Review (2024).

Assessing the Impact of ChatGPT in AI Conference Peer Reviews. [PDF]

Weixin Liang, Zach Izzo, Yaohui Zhang, Haley Lepp, Hancheng Cao, Xuandong Zhao, Lingjiao Chen, Haotian Ye, Sheng Liu, Zhi Huang, Dan McFarland, James Zou.

ICML (2024).

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis. [PDF]

Federico Bianchi, Patrick John Chia, Mert Yuksekgonul, Jacopo Tagliabue, Dan Jurafsky, James Zou.

ICML (2024).

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering. [PDF]

Sheng Liu, Haotian He, Lei Xing, James Zou.

ICML (2024).

Scaling Laws for the Value of Individual Data Points in Machine Learning.

Ian Covert, Wenlong Ji, Tatsunori Hashimoto, James Zou.

ICML (2024).

ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations. [PDF]

Kailas Vodrahalli, James Zou.

ICML (2024).

SleepFM: Multi-modal Representation Learning for Sleep across ECG, EEG and Respiratory Signals.

Rahul Thapa, Bryan He, Magnus Kjaer, Hyatt Moore, Gauri Ganjoo, Emmanuel Mignot, James Zou

ICML (2024).

Learning and Forgetting Unsafe Examples in Large Language Models. [PDF]

Jiachen Zhao, Zhun Deng, David Madras, James Zou, Mengye Ren

ICML (2024).

Simple linear attention language models balance the recall-throughput tradeoff. [PDF]

Sabri Eyuboglu, Simran Arora, Michael Zhang, Aman Timalsina, Silas Alberti, James Zou, Atri Rudra, Christ Re.

ICML (2024).

Selecting Large Language Model to Fine-tune via Rectified Scaling Law. [PDF]

Haowei Lin, Baizhou Huang, Haotian Ye, Qinyu Chen, Zihao Wang, Sujian Li, Jianzhu Ma, Xiaojun Wan, James Zou, Yitao Liang.  

ICML (2024).

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits.

Jiachen Wang, Tianji Yang, James Zou, Yongchan Kwon, Ruoxi Jia.

ICML (2024).

Prospector Heads: Generalized Feature Attribution for Large Models & Data. [PDF]

Gautam Machiraju, et al.  

ICML (2024).

TrustLLM: Trustworthiness in Large Language Models. [PDF]

Yue Huang, et al.  

ICML (2024).

Scaling adoption of medical AI—reimbursement from value-based care and fee-for-service perspectives. [PDF]

Michael Abramoff, Tinglong Dai, James Zou.  

New England Journal of Medicine AI (2024).

Provable Membership Inference Privacy. [PDF]

Zach Izzo, Jinsung Yoon, Sercan Arik, James Zou.  

Transactions on ML Research (2024).

New Evaluation Metrics Capture Quality Degradation due to LLM Watermarking. [PDF]

Karan Singh, James Zou.  

Transactions on ML Research (2024).

Generative AI for designing and validating easily synthesizable and structurally novel antibiotics. [PDF]

Kyle Swanson, Gary Liu, Denise Catacutan, Autumn Arnold, James Zou*, Jon Stokes*.  

Nature Machine Intelligence (2024).

Bridging the literacy gap for surgical consents: an AI-human expert collaborative approach. [PDF]

Rohaid Ali, Ian Connolly, Oliver Tang, et al, James Zou, Curtis Doberstein  

npj Digital Medicine (2024).

Principled and interpretable alignability testing and integration of single-cell data. [PDF]

Rong Ma, Eric Sun, David Donoho, James Zou.  

Proceedings of the National Academy of Sciences (2024).

Systematic analysis of off-label and off-guideline cancer therapy usage in a real-world cohort of 165,912 US patients. [PDF]

Ruishan Liu, Lisa Wang, Shemra Rizzo, Marius Garmhausen, Navdeep Pal, Sarah Waliany, Sarah McGough, Yvonne Lin, Zhi Huang, Joel Neal, Ryan Copping, James Zou.  

Cell Reports Medicine (2024).

TISSUE: uncertainty-calibrated prediction of single-cell spatial transcriptomics improves downstream analyses. [PDF]

Eric Sun, Rong Ma, Paloma Negredo, Anne Brunet, James Zou.  

Nature Methods (2024).

What Should Data Science Education Do With Large Language Models? [PDF]

Xinming Tu, James Zou, Weijie Su, Linjun Zhang.  

Harvard Data Science Review (2024).

Protein structure generation via folding diffusion. [PDF]

Kevin Wu, Kevin Yang, Rianne van den Berg, Sarah Alamdari, James Zou, Alex Lu, Ava Amini.  

Nature Communications (2024).

DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models. [PDF]

Yongchan Kwon, Eric Wu, Kevin Wu, James Zou.  

International Conference on Learning Representations (ICLR 2024).

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions. [PDF]

Fede Bianchi, Mirac Suzgun, Giuseppe Attanasio, Paul Rottger, Dan Jurafsy, Tatsu Hashimoto, James Zou

International Conference on Learning Representations (ICLR 2024).

Navigating dataset documentations in AI: a large-scale analysis of dataset cards on Hugging Face. [PDF]

Xinyu Yang, Weixin Liang, James Zou.  

International Conference on Learning Representations (ICLR 2024).

Zoology: Measuring and Improving Recall in Efficient Language Models. [PDF]

Simran Arora, Sabri Eyuboglu, Aman Timalsina, I. Johnson, M. Poli, James Zou, Atri Rudra, Chris Re  

International Conference on Learning Representations (ICLR 2024).

Using ChatGPT to facilitate truly informed medical consent. [PDF

Fatima Mirza, Oliver Tang, Ian Connolly, et al., James Zou, Rohaid Ali. 

New England Journal of Medicine AI (2024).

The power of contrast for feature learning: a theoretical analysis. [PDF

Wenlong Ji, Zhun Deng, Ryumei Nakada, James Zou, Linjun Zhang. 

Journal of Machine Learning Research (2024).

VetLLM: large language model for predicting diagnosis from veterinary notes. [PDF

Yixing Jiang, Jeremy Irvin, Andrew Ng, James Zou

Proceedings of the Pacific Symposium on Biocomputing (PSB 2024).

PEPSI: polarity measurements from spatial proteomics imaging suggest immune cell engagement. [PDF

Eric Wu, Michael Wu, Aaron Mayer, Alex Trevino, James Zou

Proceedings of the Pacific Symposium on Biocomputing (PSB 2024).

2023

Characterizing the Clinical Adoption of Medical AI Devices through U.S. Insurance Claims. [PDF]

Kevin Wu, Eric Wu, et al, James Zou

New England Journal of Medicine AI (2023).

TWIGMA: A dataset of AI-Generated Images with Metadata From Twitter. [PDF]

Yiqun Chen, James Zou

NeurIPS Datasets and Benchmarks Track (2023).

Beyond Confidence: Reliable Models Should Also Consider Atypicality. [PDF]

Mert Yuksekgonul, Linjun Zhang, James Zou*, Carlos Guestrin*. 

NeurIPS (2023).

OpenDataVal: a Unified Benchmark for Data Valuation. [PDF]

Kevin Jiang, Weixin Liang, James Zou, Yongchan Kwon. 

NeurIPS Datasets and Benchmarks Track (2023).

Factorized Contrastive Learning: Going Beyond Multi-view Redundancy. [PDF]

Paul Liang, Zihao Deng, Martin Ma, James Zou, Louis-Philippe Morency, Ruslan Salakhutdinov. 

NeurIPS (2023).

DataPerf: Benchmarks for Data-Centric AI Development. [PDF]

DataPerf team. 

NeurIPS Datasets and Benchmarks Track (2023).

Improving genetic risk prediction across diverse population by disentangling ancestry representations. [PDF]

Prashnna Gyawali, Yann Le Guen, Xiaoxia Liu, Michael Belloy, Hua Tang, James Zou*, Zhihuai He*. 

Communications Biology (2023).

A clinically applicable AI system for diagnosis of congenital heart diseases based on computed tomography images. [PDF]

Xiaowei Xu, Qianjun Jia, Haiyun Yuan, Hailong Qiu, Yuhao Dong, Wen Xie, Zeyang Yao, Jiawei Zhang, Zhiqaing Nie, Xiaomeng Li, Yiyu Shi, James Zou*, Meiping Huang*, Jian Zhuang*. 

Medical Image Analysis (2023).

A visual–language foundation model for pathology image analysis using medical Twitter. [PDF

Zhi Huang, Federico Bianchi, Mert Yuksekgonul, Tom Montine, James Zou

Nature Medicine (2023). Cover article.

Implications of predicting race variables from medical images. [PDF]

James Zou, Judy Gichoya, Daniel Ho, Ziad Obermeyer.  

Science (2023).

A deep learning-based electrocardiogram risk score for long term cardiovascular death and disease. [PDF

Weston Hughes, et al, David Ouyang*, Euan Ashley*, James Zou*, Marco Perez*. 

npj Digital Medicine (2023).

Skin Tone Analysis for Representation in Educational Materials (STAR-ED) using machine learning. [PDF

Girmaw Tadesse, Celia Cintas, Kush Varshney, et al., James Zou*, Roxana Daneshjou*. 

npj Digital Medicine (2023).

GPT detectors are biased against non-native English writers. [PDF]

Weixin Liang, Mert Yuksekgonul, Yining Mao, Eric Wu, James Zou

Patterns (2023).

Beyond Positive Scaling: How Negation Impacts Scaling Trends of Language Models. [PDF]

Yuhui Zhang, Michi Yasunaga, Zhengping Zhou, Z. HaoChen, James Zou, Percy Liang Serena Yeung. 

ACL Findings (ACL 2023).

Machine learning modeling of RNA structures: methods, challenges and future perspectives. [PDF]

Kevin Wu, James Zou*, Howard Chang*. 

Briefings in Bioinformatics (2023).

Deep learning-based electrocardiographic screening for chronic kidney disease. [PDF]

Lauri Holmstrom, Matthew Christensen, Neal Yuan, Weston Hughes, John Theurer, M. Jujjavarapu, P. Fatehi, A. Kwan, R. Sandhu, J. Ebinger, S. Cheng, James Zou, Sumeet Chugh, David Ouyang. 

Communications Medicine (2023).

7-UP: generating in silico CODEX from a small set of immunofluorescence markers. [PDF]

Eric Wu, Alex Trevino, et al, Aaron Mayer, James Zou

PNAS Nexus (2023).

Who counts as an inventor? Seniority and gender in 430,000 biomedical inventor–researcher teams. [PDF]

Anoop Manjunath, Nathan Kahrobai, Jaya Manjunath, Angelina Seffens, Arya Gowda, Rohaan Umbarkar, Esha Umbarkar, James Zou*, Ishan Kumar*. 

Nature Biotechnology (2023).

Brain proteomic analysis implicates actin filament processes and injury response in resilience to Alzheimer’s disease. [PDF]

Zhi Huang, Gennifer Merrihew, Eric Larson, Jea Park, Deanna Plubell, Eddie Fox, Kathy Montine, Caitlin Latimer, C. Keene, James Zou*, Mike MacCoss*, Tom Montine*. 

Nature Communications (2023).

Leveraging Physiology and Artificial Intelligence to Deliver Advancements in Health Care. [PDF]

Angela Zhang, Zhenqin Wu, Eric Wu, Matthew Wu, Michael Snyder, James Zou, Joe Wu. 

Physiological Review (2023).

Data-OOB: Out-of-bag Estimate as a Simple and Efficient Data Value. [PDF]

Yongchan Kwon, James Zou

International Conference on Machine Learning (ICML 2023).

Discover and Cure: Concept-aware Mitigation of Spurious Correlation. [PDF]

Shirley Wu, Mert Yuksekgonul, Linjun Zhang, James Zou

International Conference on Machine Learning (ICML 2023).

On the nonlinear correlation of ML performance between data subpopulations. [PDF​]

Weixin Liang, Yining Mao, Yongchan Kwon, Xinyu Yang, James Zou

International Conference on Machine Learning (ICML 2023).

Data-Driven Subgroup Discovery for Linear Regression. [PDF​]

Zach Izzo, Ruishan Liu, James Zou

International Conference on Machine Learning (ICML 2023).

Collecting data when missingness is unknown: a method for improving model performance given under-reporting in patient populations. 

Kevin Wu, Dominik Dahlem, Christopher Hane, Eran Halperin, James Zou

Conference on Health, Inference and Learning (CHIL 2023).

Understanding and Predicting the Effect of Environmental Factors on People with Type 2 Diabetes. 

Kailas Vodrahalli, Gregory Lyng, Brian Hill, Kimmo Karkkainen, Jeffrey Hertzberg, James Zou*, Eran Halperin*. 

Conference on Health, Inference and Learning (CHIL 2023).

Easily accessible text-to-image generation amplifies demographic stereotypes at large scale. [PDF

Federico Bianchi, Pratyusha Kalluri, Esin Durmus, Faisal Ladhak, Myra Cheng, Debora Nozza, Tatsunori Hashimoto, Dan Jurafsky*, James Zou*, Aylin Caliskan*. 

ACM Conference on Fairness, Accountability and Transparency (2023).

Blinded, randomized trial of sonographer versus AI cardiac function assessment. [PDF

Bryan He, Alan Kwan, Jae Cho, Neal Yuan, C. Pollick, T. Shiota, J. Ebinger, N. Bello, J. Wei, K. Josan, G. Duffy, M. Jujjavarapu, R. Siegel, Susan Cheng*, James Zou*, David Ouyang*. 

Nature (2023).

From patterns to patients: Advances in clinical machine learning for cancer diagnosis, prognosis, and treatment. [PDF

Kyle Swanson, Eric Wu, Angela Zhang, Ash Alizadeh, James Zou

Cell (2023).

AI-enabled assessment of cardiac function and video quality in emergency department point-of-care echocardiograms. [PDF

Bryan He, Dev Dash, Youyou Duanmu, Ting Xu Tan, David Ouyang, James Zou

Journal of Emergency Medicine (2023).

Development and clinical evaluation of an AI support tool for improving telemedicine photo quality. [PDF

Kailas Vodrahalli, Justin Ko, Albert Chiou, Rob Novoa, Abu Abid, Michelle Phung, Kiana Yekrang, Paige Petrone, James Zou*, Roxana Daneshjou*. 

JAMA Dermatology (2023).

Subcellular omics: a new frontier pushing the limits of resolution, complexity and throughput. [PDF

Jim Eberwine, Junhyong Kim, R. Anafi, S. Brem, M. Bucan, S. Fisher, M. Grady, A. Herr, D. Issadore, H. Jeong, H. Kim, D. Lee, S. Rubakhin, J. Sul, J. Sweedler, J. Wolf, K. Zaret, James Zou

Nature Methods (2023).

A spectral method for assessing and combining multiple data visualizations. [PDF]

Rong Ma, Eric Sun, James Zou.  

Nature Communications (2023). 2023 JSM Outstanding Paper Award

Video-Based Deep Learning for Automated Assessment of Left Ventricular Ejection Fraction in Pediatric Patients. [PDF]

Charitha Reddy, Leo Lopez, David Ouyang*, James Zou*, Bryan He*  

Journal of the American Society of Echocardiography (2023).

Post-hoc Concept Bottleneck Models. [PDF]

Mert Yuksekgonul, Maggie Wang, James Zou.  

International Conference on Learning Representations (ICLR 2023). Spotlight

When and why Vision-Language Models behave like Bags-of-Words, and what to do about it? [PDF]

Mert Yuksekgonul, Federico Bianchi, Pratyusha Kalluri, Dan Jurafsky, James Zou.  

International Conference on Learning Representations (ICLR 2023). Oral/top 5% of accepted papers

FIFA: Making Fairness More Generalizable in Classifiers Trained on Imbalanced Data. [PDF]

Zhun Deng, Jiayao Zhang, Linjun Zhang, Ting Ye, Yates Coley, Weijie J Su, James Zou.  

International Conference on Learning Representations (ICLR 2023).

FaiREE: fair classification with finite-sample and distribution-free guarantee. [PDF]

Puheng Li, James Zou, Linjun Zhang.  

International Conference on Learning Representations (ICLR 2023).

DrML: Diagnosing and Rectifying Vision Models using Language. [PDF]

Yuhui Zhang, Jeff HaoChen, Shih-Cheng Huang, Kuan-Chieh Wang, James Zou, Serena Yeung.  

International Conference on Learning Representations (ICLR 2023).

Freeze then Train: Towards Provable Representation Learning under Spurious Correlations and Feature Noise. [PDF

Haotian Ye, James Zou*, Linjun Zhang*.  

International Conference on AI and Statistics (AISTATS 2023). 

Understanding Multimodal Contrastive Learning and Incorporating Unpaired Data. [PDF]

Ryumei Nakada, Halil Gulluk, Zhun Deng, Wenlong Ji, James Zou*, Linjun Zhang*.  

International Conference on AI and Statistics (AISTATS 2023). 

Analyses of canine cancer mutations and treatment outcomes using real-world clinico-genomics data of 2119 dogs. [PDF]

Kevin Wu, Lucas Rodrigues, Gerald Post, Garrett Harvey, Michelle White, Aubrey Miller, Lindsay Lambert, Ben Lewis, Christina Lopes, James Zou

npj Precision Oncology (2023).

Dynamic visualization of high dimensional data. [PDF]

Eric Sun, Rong Ma, James Zou

Nature Computational Science (2023).

2022

Competition over data: how does data purchase affect users? [PDF]

Yongchan Kwon, Tony Ginart, James Zou

Transactions of Machine Learning Research (2022).

Predicting Immune Escape with Pretrained Protein Language Model Embeddings. [PDF]

Kyle Swanson, Howard Chang, James Zou

Machine Learning in Computational Biology (PMLR) (2022).

Ensembling improves stability and power of feature selection for deep learning models. [PDF]

Prashnna Gyawali, Xiaoxia Liu, James Zou*, Zihuai He*.  

Machine Learning in Computational Biology (PMLR) (2022).

Graph deep learning for the characterization of tumour microenvironments from spatial protein profiles in tissue specimens. [PDF

Zhenqin Wu, Alex Trevino, Eric Wu, Kyle Swanson, Honesty Kim, Blaize D’Angio, Ryan Preska, Greg Charville, Piero Dalerba, Ann Egloff, R. Uppaluri, U. Duvvuri, Aaron Mayer, James Zou

Nature Biomedical Engineering (2022).

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning. [PDF]

Weixin Liang, Yuhui Zhang, Yongchan Kwon, Serena Yeung, James Zou

NeurIPS (2022).

WeightedSHAP: analyzing and improving Shapley-based feature attributions. [PDF]

Yongchan Kwon and James Zou

NeurIPS (2022).

Uncalibrated Models Can Improve Human-AI Collaboration. [PDF]

Kailas Vodrahalli, Tobi Gerstenberg, James Zou

NeurIPS (2022).

Estimating and Explaining Model Performance When Both Covariates and Labels Shift. [PDF]

Lingjiao Chen, Matei Zaharia, James Zou

NeurIPS (2022).

mixReg: A Simple Way to Improve Generalization in Regression for Deep Neural Networks. [PDF]

Huaxiu Yao, Yiping Wang, Linjun Zhang, James Zou, Chelsea Finn. 

NeurIPS (2022).

SKINCON: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis. [PDF]

Roxana Daneshjou, Mert Yuksekgonul, Zhuo Ran Cai, Rob Novoa, James Zou

NeurIPS Datasets and Benchmarks Track (2022).

HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions. [PDF]

Lingjiao Chen, Zhihua Jin, Sabri Eyuboglu, Christopher Re, Matei Zaharia, James Zou

NeurIPS Datasets and Benchmarks Track (2022).

SEAL: Interactive Tool for Systematic Error Analysis and Labeling. [PDF]

Nazneen Rajani, Weixin Liang, Lingjiao Chen, Meg Mitchell, James Zou

EMNLP Demo Track (2022).

Systematic analysis of 50 years of Stanford University technology transfer and commercialization. [PDF]

Weixin Liang, Scott Elrod, Daniel McFarland, James Zou

Patterns (2022).

Artificial Intelligence, machine learning and the changing landscape of molecular biology. [PDF]

James Zou, Hongzhe Li, Sylvia Plevritis 

Journal of Molecular Biology (2022).

Polygenic enrichment distinguishes disease associations of individual cells in single-cell RNA-seq data. [PDF]

Martin Zhang, et al. 

Nature Genetics (2022).

Advances, challenges and opportunities in creating data for trustworthy AI. [PDF]

Weixin Liang, Girmaw Tadesse, Daniel Ho, Fei-Fei Li, Matei Zaharia, Ce Zhang, James Zou

Nature Machine Intelligence (2022).

Disparities in dermatology AI performance on a diverse, curated clinical image set. [PDF

Roxana Daneshjou, Kailas Vodrahalli, et al., James Zou*, Andrew Chiou*. 

Science Advances (2022).     *co-corresponding authors

Systematic pan-cancer analysis of mutation-treatment interactions using large real-world clinicogenomics data. [PDF] [Stanford News]

Ruishan Liu, Shemra Rizzo, Sarah Waliany, Marius Garmhausen, Navdeep Pal, Zhi Huang, Nayan Chaudhary, Lisa Wang, Chris Harbron, Joel Neal, Ryan Copping, James Zou

Nature Medicine (2022).

Shifting machine learning for healthcare from development to deployment and from models to data. [PDF]

Angela Zhang, Lei Xing, James Zou, Joe Wu.

Nature Biomedical Engineering (2022).

Meaningfully debugging model mistakes using conceptual counterfactual explanations. [PDF]

Abu Abid, Mert Yuksekgonul, James Zou.

International Conference on Machine Learning (ICML 2022).

FrugalMCT: Efficient Online ML API Selection for Multi-Label Classification Tasks. [PDF]

Lingjiao Chen, Matei Zaharia, James Zou.

International Conference on Machine Learning (ICML 2022).

When and How Mixup Improves Calibration. [PDF]

Linjun Zhang, Zhun Deng, Kenji Kawaguchi, James Zou.

International Conference on Machine Learning (ICML 2022).

Improving Out-of-Distribution Robustness via Selective Augmentation. [PDF]

Huaxiu Yao, Yu Wang, Sai Li, Linjun Zhang, Weixin Liang, James Zou, Chelsea Finn. 

International Conference on Machine Learning (ICML 2022).

Do Humans Trust Advice More if it Comes from AI? An Analysis of Human-AI Interactions. [PDF]

Kailas Vodrahalli, Roxana Daneshjou, Tobi Gerstenberg, James Zou.

AI, Ethics and Society Conference (AIES 2022).

Clustering Plotted Data by Image Segmentation. [PDF]

Tarek Naous, Srinjay Sarkar, Abubakar Abid, James Zou.

Conference on Computer Vision and Pattern Recognition (CVPR Demo 2022).

A Unified f-divergence Framework Generalizing VAE and GAN. [PDF]

Jaime Roquero, James Zou.

International Symposium on Information Theory (ISIT 2022).

The Genetic Etiology of Periodic Leg Movement in Sleep. [PDF]

Jacob Edelson, et al., James Zou, Emmanuel Mignot. 

Sleep (2022).

Dynamical Systems Model of RNA Velocity Improves Inference of Single-cell Trajectory, Pseudo-time and Gene Regulation. [PDF]

Ruishan Liu, Angela Pisco, Emelie Braun, Sten Linnarsson, James Zou.  

Journal of Molecular Biology (2022).

Machine Learning Prediction of Clinical Trial Operational Efficiency. [PDF]

Kevin Wu, Eric Wu, M. DAndrea, N. Chitale, M. Lim, M. Dabrowski, K. Kantor, H. Rangi, R. Liu, M. Garmhausen, N. Pal, C. Harbron, S. Rizzo, R. Copping, James Zou.  

Journal of the American Association of Pharmaceutical Scientists (2022).

Assessment of COVID-19 data reporting in 100+ websites and apps in India. [PDF]

Varun Vasudevan, Abeynaya Gnanasekaran, B. Bansal, C. Lahariya, G. ParameswaranJames Zou.  

PLoS Global Health (2022).

Classification and clustering of RNA crosslink-ligation data reveal complex structures and homodimers. [PDF]

Minjie Zhang, Irena Hwang, Kongpan Li, J. Bai, J. Chen, Tsachy Weisman, James Zou, Zhipeng Lu 

Genome Research (2022).

AI-enabled in silico immunohistochemical characterization for Alzheimer’s disease. [PDF]

Bryan He, Syed Bukhari, Edward Fox, Abu Abid, Jeanne Shen, Claudia Kawas, Maria Corrada, Tom Montine, James Zou.  

Cell Reports Methods (2022).

DynaMorph: self-supervised learning of morphodynamic states of live cells. [PDF]

Michael Wu, B. Chhun, G. Popova, S. Guo, C. Kim, L. Yeh, T. Nowakowski*, James Zou*, S. Mehta*.   

Molecular Biology of the Cell (2022).          *co-corresponding authors

How did the model change? Efficiently assessing machine learning API shifts. [PDF]

Lingjiao Chen, Matei Zaharia, James Zou.  

International Conference on Learning Representations (ICLR 2022).

MetaShift: a dataset of datasets for evaluating contextual distribution shifts and training conflicts. [PDF]

Weixin Liang, James Zou.  

International Conference on Learning Representations (ICLR 2022). 

Domino: discovering systematic errors with cross-modal embeddings. [PDF]

S. Eyuboglu, M. Varma, K. Saab, J. Delbrouck, C. Lee-Messer, J. Dunnmon, James Zou, S. C. Re.  

International Conference on Learning Representations (ICLR 2022). Oral 

Beta Shapley: a Unified and Noise-reduced Data Valuation Framework for Machine Learning. [PDF]

Yongchan Kwon, James Zou.  

International Conference on AI and Statistics (AISTATS 2022). Oral (top 3% of submissions)

MLDemon: Deployment Monitoring for Machine Learning Systems. [PDF]

Tony Ginart, Martin Zhang, James Zou.  

International Conference on AI and Statistics (AISTATS 2022). 

How to Learn when Data Gradually Reacts to Your Model. [PDF]

Zach Izzo, James Zou, Lexing Ying.  

International Conference on AI and Statistics (AISTATS 2022).

Diversifying history: A large-scale analysis of changes in researcher demographics and scholarly agendas. [PDF]

Stephen Risi, Mathias Nielsen, Emma Kerr, Emer Brady, Lanu Kim, Dan McFarland, Dan Jurafsky, James Zou, Londa Schiebinger.  

PLOS One (2022). 

CloudPred: Predicting Patient Phenotypes From Single-cell RNA-seq. [PDF]

Bryan HeMatthew ThomsonMeena SubramaniamRichard Perez, Jimmie YeJames Zou

Pacific Symposium on Biocomputating (2022).

Predicting Visuo-Motor Diseases From Eye Tracking Data. [PDF]

Kailas Vodrahalli, Maciej Filipkowski, Tiffany ChenJames Zou*, Joyce Liao* 

Pacific Symposium on Biocomputating (2022).    *Corresponding author

Prostate cancer therapy personalization via multi-modal deep learning on randomized phase III clinical trials. [PDF]

Esteva et al.  

npj Digital Medicine (2022). 

 

Artificial Intelligence for Retrosynthesis Prediction. [PDF]

Jiang et al.  

Engineering (2022). 

2022
2023
2020

2021

Beyond Importance Scores: Interpreting Tabular ML by Visualizing Feature Semantics. [PDF]

Amirata Ghorbani, Dina Berenbaum, Maor Ivgi, Yuval Dafna, James Zou

MDPI Information (2021).

Super-resolved spatial transcriptomics by deep data fusion. [PDF

L. Bergenstrahle, B. He, J. Bergenstrahle, X. Abalo, R. Mirzazadeh, K. Thrane, A. Ji, A. Andersson, L. Larsson, N. Stakenborg, G. Boeckxstaens, P. Khavari, J. Zou, J. Lundeberg, J. Maaskola 

Nature Biotechnology (2021).

Quantification of Gender Bias and Sentiment Toward Political Leaders Over 20 Years of Kenyan News Using Natural Language Processing. [PDF]

Emma Pair, Nikitha Vicas, Ann Weber, V. Meausoone, James Zou, Amos Njuguna, Gary Darmstadt 

Frontiers in Psychology (2021).

Adversarial Training Helps Transfer Learning via Better Representations. [PDF]

Zhun DengLinjun ZhangKailas VodrahalliKenji KawaguchiJames Zou

NeurIPS (2021).

Deep learning evaluation of biomarkers from echocardiogram videos. [PDF] [podcast]

Weston Hughes, N. Yuan, B. Hu, J. Ouyang, J. Ebinger, P. Botting, J. Lee, J. Theurer, J. Tooley, K. Nieman, M. Lungren, D. Liang, I. Schnittger, J. Chen, E. Ashley, S. Cheng, David Ouyang, James Zou

Lancet EbioMedicine (2021).

Lack of Transparency and Potential Bias in Artificial Intelligence Data Sets and Algorithms. [PDF] [news]

Roxana Daneshjou, Mary Smith, Mary Sun, Veronica Rotemberg, James Zou

JAMA Dermatology (2021).

Patient Experience Surveys Reveal Gender‑Biased Descriptions of Their Care Providers. [PDF]

Dylan Haynes, Anu Pampari, C. Topham, K. Schwarzenberger, M. Heath, James Zou*, Teri Greiling*. 

Journal of Medical Systems (2021).    *Corresponding author

Disparity in the quality of COVID-19 data reporting across India. [PDF] [news]

Varun Vasudevan, Abeynaya Gnanasekaran, Varsha Sankar, Siddarth Vasudevan, James Zou

BMC Public Health (2021).

Comprehensive analysis of 2.4 million patent-to-research citations maps the biomedical innovation and translation landscape. [PDF] [news]

Anoop Manjunath, Hongyu Li, Shuchen Song, Zhixing Zhang, Shu Liu, Nathan Kahrobai, Arya Gowda, Angelina Seffens, James Zou*, Ishan Kumar. 

Nature Biotechnology (2021).    *Corresponding author

Large language models associate Muslims with violence. [PDF] [news]

Abubakar Abid, Maheen Farooqi, James Zou

Nature Machine Intelligence (2021).   

How to learn when data reacts to your model: performative gradient descent. [PDF]

Zach Izzo, Lexing Ying, James Zou

ICML (2021).

Improving generalization in meta-learning via task augmentation. [PDF]

Huaxiu Yao, Longkai Huang, Linjun Zhang, Ying Wei, Li Tian, James Zou. Junzhou Huang, Z. Li. 

ICML (2021).

Neural group testing to accelerate deep learning. [PDF]

Weixin Liang, James Zou

International Symposium on Information Theory (ISIT 2021).

Mixed dimension embedding with application to memory-efficient recommendation systems. [PDF]

Tony Ginart, Maxim Naumov, Dheevatsa Mudigere, Jiyan Yang, James Zou

International Symposium on Information Theory (ISIT 2021).

Who's responsible? Jointly quantifying the contribution of the learning algorithm and data. [PDF]

Gal Yona, Amirata Ghorbani, James Zou

AI, Ethics and Society Conference (2021).

Ensuring that biomedical AI benefits diverse populations. [PDF]

James Zou and Londa Schiebinger 

Lancet EBioMedicine (2021).

Evaluating eligibility criteria of oncology trials using real-world data and AI. [PDF] [news​] [news​] [news]

Ruishan Liu, Shemra Rizzo, Sam Whipple, Navdeep Pal, Arturo Pineda, Michael Lu, Brandon Arnieri, Ying Lu, William Copra, Ryan Copping, James Zou

Nature (2021). Finalist for Global Pharma Award 2021

How medical AI devices are evaluated: limitations and recommendations from an analysis of FDA approvals. [PDF] [website] [news]

Eric Wu, Kevin Wu, Roxana Daneshjou, David Ouyang, Daniel Ho, James Zou

Nature Medicine  (2021).

Mouse aging cell atlas analysis reveals global and cell type-specific aging signatures [PDF] [news]

Martin Zhang, Angela Pisco, Spyros Darmanis, James Zou

eLife (2021).

BABEL enables cross-modality translation between multi-omic profiles at single-cell resolution. [PDF

Kevin Wu, Katie Yost, Howard Chang*, James Zou*

Proceedings of the National Academy of Sciences  (2021).

How to evaluate deep learning for cancer diagnostics: factors and recommendations. [PDF

Roxana Daneshjou, Bryan He, David Ouyang, James Zou

BBA Reviews on Cancer (2021).

Competing AI: how does competition feedback affect machine learning. [PDF] [news]

Tony Ginart, Eva Zhang, Yongchan Kwon, James Zou

International Conference on AI and Statistics (AISTATS 2021).

Efficient computation and analysis of distributional Shapley values. [PDF

Yongchan Kwon, Manny Rivas, James Zou

International Conference on AI and Statistics (AISTATS 2021).

Approximate data deletion from machine learning models. [PDF] [news

Zach Izzo, Mary Smart, Kamalika Chaudhuri, James Zou

International Conference on AI and Statistics (AISTATS 2021).

Improving adversarial robustness via unlabeled out-of-domain data. [PDF

Linjun Zhang, Zhun Deng, Amirata Ghorbani, James Zou

International Conference on AI and Statistics (AISTATS 2021). Oral (top 3% of submissions).

How does mixup help with robustness and generalization? [PDF

Linjun Zhang, Zhun Deng, Kenji Kawaguchi, Amirata Ghorbani, James Zou

International Conference on Learning Representations (ICLR 2021). Spotlight 

TrueImage: A Machine Learning Algorithm to Improve the Quality of Telehealth Photos. [PDF] [news]

Kailas Vodrahalli, Roxana Daneshjou, Roberto Novoa, Albert Chiou, Justin Ko, James Zou

Pacific Symposium on Biocomputing (PSB 2021).

Data valuation for medical imaging using Shapley value and application to a large-scale chest X-ray dataset. [PDF

Siyi Tang, Amirata Ghorbani, R. Yamashita, S. Rehman, Jared Dunnmon, James Zou, Daniel Rubin 

Scientific Reports (2021).

2020

Variation in COVID-19 Data Reporting Across India: 6 Months into the Pandemic. [PDF

Varun Vasudevan, Abeynaya Gnanasekaran, Varsha Sankar, Siddarth Vasudevan, James Zou

Journal of the Indian Institute of Science (2020).  

FrugalML: how to use ML prediction APIs more accurately and cheaply. [PDF

Lingjiao Chen, Matei Zaharia, James Zou

NeurIPS (2020). Selected for oral presentation (top 1% of submissions).

Neuron Shapley: discoverying the responsible neurons. [PDF

Amirata Ghorbani, James Zou

NeurIPS (2020). 

MOPO: model based offline policy optimization. [PDF

T. Yu, G. Thomas, L. Yu, S. Ermon, J. Zou, S. Levine, C. Finn, T. Ma. 

NeurIPS (2020). 

ALICE: active learning with contrastive natural language explanations. [PDF

Weixin Liang, James Zou*, Zhou Yu*.

Empirical Methods in Natural Language Processing (EMNLP 2020).

Deep learning for biomedical videos: perspective and recommendations. [PDF

David Ouyang, Zhenqin Wu, Bryan He, James Zou

Artificial Inteliigence in Medicine (2020).

 

Deep profiling of protease substrate specificity enabled by dual random and scanned human proteome substrate phage libraries. [PDF

Jie Zhou, Shantao Li, Kevin Leung, Brian O'Donovan, James Zou, Joe Derisi, Jim Wells. 

PNAS (2020).

A single-cell transcriptomic atlas characterizes aging tissues in the mouse. [PDF

The Tabula Muris Consortium. 

Nature (2020).

Integrating spatial gene expression and breast tumour morphology with deep learning[PDF

Bryan He, Ludvig Bergenstrahle, Linnea Stenbeck, Abu Abid, Alma Andersson, Ake Borg, Jonas Maaskola, Joakim Lundeberg, James Zou.

Nature Biomedical Engineering (2020).

RNA-GPS predicts SARS-CoV-2 RNA residency to host mitochondria and nucleolus[PDF

Kevin Wu, Furqan Fazal, Kevin Parker, James Zou*, Howard Chang*.

Cell Systems (2020).

Deep learning models to detect hidden clinical correlates[PDF

David Ouyang, James Zou.

The Lancet Digital Health (2020).

Association of rapid eye movement sleep with mortality in middled-aged and older adults[PDF

E. Leary, K. Watson, S. Ancoli-Israel, S. Redline, K. Yaffe, L. Ravelo, P. Peppard, J. Zou, S. Goodman, E. Mignot, K. Stone.

JAMA Neurology (2020).

Clinical genetics lacks standard definitions and protocols for the collection and use of diversity measures[PDF

ClinGen Ancestry and Diversity Working Group.

American Journal of Human Genetics (2020).

A distributional framework for data valuation[PDF

Amirata Ghorbani, Michael Kim, James Zou.

International Conference on Machine Learning (ICML 2020).

Predicting target genes of non-coding regulatory variants with ICE[PDF

Michael Wu, Nilah Ioannidis, James Zou.

Bioinformatics (2020).

Beyond user self-reported Likert scale ratings: a comparison model for automatic dialog evaluation[PDF

Weixin Liang, James Zou, Zhou Yu.

Annual Conference of the Association of Computational Linguistics (ACL 2020).

PB-Net: automatic peak integration by sequential deep learning for multiple reaction monitoring[PDF

Zhenqin Wu, Daniel Serie, Gege Xu, James Zou.

Journal of Proteomics (2020).

RNA-GPS predicts RNA subcellular localization and highlights the role of splicing[PDF

Kevin Wu, Kevin Parker, Furqan Fazal, Howard Chang, James Zou.

RNA (2020).

Video-based AI for beat-to-beat assessment of cardiac function. [PDF

David Ouyang, Bryan He, Amirata Ghorbani, N. Yuan, J. Ebinger, C. Langlotz, P. Heidenrich, R. Harrington, D. Liang, E. Ashley, James Zou

Nature (2020).

A benchmark of algorithms for the analysis of pooled CRISPR screens. [PDF
Sunil Bodapati, Tim Daley, Xueqiu Lin, James Zou*, Lei Qi*. 
Genome Biology (2020).

An online platform for interactive feedback in biomedical machine learning. [PDF

Abubakar Abid, Ali Abdalla, Ali Abid, Dawood Khan, Abdulrahman Alfozan, James Zou

Nature Machine Intelligence (2020).

 

Learning transport cost from subset correspondence. [PDF

Ruishan Liu, Akshay Balsubramani, James Zou

International Conference on Learning Representations (ICLR 2020). 

Deep learning interpretation of echocardiograms. [PDF
Amirata Ghorbani, David Ouyang, Abubakar Abid, Bryan He, Jonathan Chen, Robert Harrington, David Liang, Euan Ashley, James Zou
Nature Digital Medicine (2020). 

LitGen: genetic literature recommendation guided by human explanations.  
Allen Nie, et al., James Zou
Pacific Symposium on Biocomputing (PSB 2020). 

 

2019

 

Sex and gender analysis improves science and engineering. [PDF]

Cara Tannenbaum*, Robert Ellis*, Friederike Eyssel*, James Zou*, Londa Schiebinger. 

Nature (2019).   *co-first authors

Making AI forget you: data deletion in machine learning. [arXiv
Tony Ginart, Melody Guan, Greg Valiant, James Zou
NeurIPS (2019). Selected for spotlight talk (top 3% of submissions).

Toward automatic concept based explanations. [PDF]
Amirata Ghorbani, James Wexler, James Zou, Been Kim. 
NeurIPS (2019). 

How much does your data exploration overfit? Controlling bias via information usage. [arXiv
Daniel Russo, James Zou
IEEE Transactions on Information Theory (2019). 

Large dataset enables prediction of repair after CRISPR-Cas9 editing in primary T cells. [arXiv
Ryan Leenay, Amirali Aghazadeh, Joseph Hiatt, David Tse, Theo Roth, Ryan Apathy, Eric Shifrut, Judd Hulquist, N. Krogan, Z. Wu, G. Carolina, H. Canaj, M. Leonetti, Alex Marson, Andrew May, James Zou
Nature Biotechnology (2019).

AdaFDR: a Fast, Powerful and Covariate-Adaptive Approach for Multiple Hypothesis Testing. [arXiv
Martin Zhang, Fei Xia, James Zou
Nature Communications (2019). Preliminary version won the RECOMB Best Paper Award.

VetTag: improving automated veterinary diagnosis coding via large-scale language modeling. [arXiv
Yuhui Zhang, Allen Nie, Ashley Zehnder, Rodney Page, James Zou
Nature Digital Medicine (2019). 

Data Shapley: Equitable Data Valuation for Machine Learning. [arXiv
Amirata Ghorbani, James Zou
ICML (2019). 

Concrete Autoencoders for Differentiable Feature Selection and Reconstruction. [arXiv
Abubakar Abid, Muhammad Balin, James Zou
ICML (2019). 

Adaptive Monte Carlo Multiple Testing via Multi-Armed Bandits. [arXiv
Martin Zhang, James Zou, David Tse. 
ICML (2019). 

Discovering Conditionally Salient Features with Statistical Guarantees
Jaime Roquero, James Zou
ICML (2019). 

Contrastive multivariate singular spectrum analysis. [arXiv
Abdi-Hakin Dirie, Abubakar Abid, James Zou
IEEE Allerton (2019). Preliminary version selected for spotlight at NIPS'18 Spatio-temporal Workshop.

A Knowledge Graph-based Approach for Exploring the U.S. Opioid Epidemic
Maulik Kamdar, Tymor Hamansy, Shea Zhao, Ayin Vala, Tome Eftimov, James Zou, Suzanne Tamang. 
ICLR AI for Social Good (2019). Best Poster Award.

Modeling spatial correlation of transcripts with application to pancreas development. [PDF
Ruishan Liu, Marco Mignardi, Robert Jones, Martin Enge, Seung Kim, Steve Quake, James Zou
Scientific Reports (2019).

Analyzing Polarization in Social Media: Method and Application to Tweets on 21 Mass Shootings
Dora Demszky, Nikhil Garg, Rob Voigt, James Zou, Jesse Shapiro, Matthew Gentzkow, Dan Jurafsky
NAACL (2019). Washington Post coverage

Interpretation of neural network is fragile. [arXiv
Amirata Ghorbani, Abubakar Abid, James Zou
AAAI (2019). Selected for oral presentation.

Improving knockoff stability: simultaneous multiple knockoffs and entropy maximization. [arXiv
Jaime Gimenez, James Zou
AISTATS (2019).

Knockoffs for the mass: new feature importance statistics with false discovery guarantees. [arXiv
Jaime Gimenez, Amirata Ghorbani, James Zou
AISTATS (2019).

Feedback GAN for DNA optimizes protein functions. [PDF
Anvita Gupta, James Zou
Nature Machine Intelligence (2019).

Multiaccuracy: black-box post-processing for fairness in classification. [PDF
Michael Kim, Amirata Ghorbani, James Zou
ACM/AAAI Conference of AI Ethics and Society (2019). 

Contingent Payment Mechanisms for Resource Utilization.
Hongyao Ma, Reshef Meir, David Parkes, James Zou
18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS) (2019).            Finalist for Best Paper Award.

A large CRISPR-induced bystander mutation causes immune dysregulation. [PDF]
Dimitre Simeonov et al. 
Communications Biology (2019). 

2018

DeepTag: inferring diagnoses from veterinary clinical notes. [PDF] [press
Allen Nie, Ashley Zehnder, Rodney Page, Y. Zhang, A. Pineda, M. Rivas, C. Bustamante, James Zou
Nature Digital Medicine (2018).

A primer on deep learning in genomics. [PDF]  
James Zou, Mikael Huss, Abubakar Abid, Pejman Mohammadi, Ali Torkamani, Amalio Telenti. 
Nature Genetics (2018).

Design AI so that it's fair. [PDF
James Zou and Londa Schiebinger. 
Nature (2018).

Autowarp: learning a warping distance from unlabeled time series using sequence autoencoders. [PDF
Abubakar Abid, James Zou
NIPS (2018).

Stochastic EM for shuffled linear regression. [arXiv
Abubakar Abid, James Zou
IEEE Allerton (2018).

Minimizing close-k aggregate loss improves classification. [arXiv
Bryan He, James Zou
Under submission (2018).

Exploring patterns enriched in a dataset with contrastive principal component analysis. [PDF
Abubakar Abid, Vivek Bagaria, Martin Zhang, James Zou
Nature Communications (2018). ICML CompBio Workshop Top Paper Award Winner.

Word embeddings quantify 100 years of gender and ethnic stereotypes. [PDF
Nikhil Garg, Londa Schiebinger, Dan Jurafsky, James Zou
Proceedings of the National Academy of Sciences (2018).

CoVeR: learning covariate-specific vector representations with tensor decompositions. [arXiv
Kevin Tian, Teng Zhang, James Zou
International Conference of Machine Learning (ICML 2018).

Predicting target genes of non-coding regulatory variants with ICE.
Michael Wu, Nilah Ioannidis and James Zou
Under submission (2018).

The clinical imperative for inclusivity: race, ethnicity and ancestry in genomics
Alice Popejoy et al. 
Human Mutation (2018). 

Why adaptively collected data has negative bias and how to correct for it. [arXiv
Xinkun Nie, Xiaoying Tian-Harris, Jonathan Taylor, James Zou
AISTATS 2018. ICML Workshop on Picky Learners Best Paper Award.

Embedding for missingness: deep learning with incomplete data.
Amirata Ghorbani, James Zou
IEEE Allerton (2018).

The effects of memory replay in reinforcement learning. [arXiv
Ruishan Liu and James ZouBest Poster Award at BayLearn.
IEEE Allerton (2018). 

The proteome of malaria plastid organelle, a key anti-parasitic target. [arXiv
Michael J Boucher, Sreejoyee Ghosh, Lichao Zhang, Avantika Lal, Se Won Jang, An Ju, Shuying Zhang, Xinzi Wang, Stuart A Ralph, James Zou, Joshua E Elias, Ellen Yeh. 
PLoS Biology (2018). 

2017

Mutation-convolution-max layers enhance deep learning of DNA motifs
Abubakar Abid, Amirata Ghorbani, James Zou
NIPS Machine Learning for CompBio Workshop (NIPS MLCB 2017). Spotlight paper 

NeuralFDR: learning decision threshold from hypothesis features.
Martin Zhang, Fei Xia, James Zou, David Tse. 
Neural Information Processing Systems (NIPS 2017).

Linear regression with shuffled labels. [arXiv
Abubakar Abid, Ada Poon, James Zou
Submitted 2017. 

Estimating the unseen from multiple populations. [arXiv
Aditi Rangunathan, Greg Valiant, James Zou
International Conference on Machine Learning (ICML 2017). 

Learning latent space models with angular constraints. [arXiv
Pengtao Xie, Yuntian Deng, Yi Zhou, Abhimanu Kumar, Yaoliang Yu, James Zou, Eric Xing. 
International Conference on Machine Learning (ICML 2017). 

Quantifying the accuracy of approximate diffusions and Markov chains. [arXiv]
Jonathan Huggins, James Zou.
AISTATS (2017).
 

Beyond bilingual: multi-sense word embedding using multi-lingual context. [arXiv
Shyam Upadhyay, Kai-Wei Chang, Matt Taddy, Adam Kalai, James Zou.                                                   Representation Learning for NLP (Rep4NLP 2017). Best Paper Award.


Correcting for cell-type heterogeneity in DNA methylation: avoiding statistical flaws.
Elior Rahmani, Noah Zaitlen, Yael Baran, Celeste Eng, Donglei Hu, Joshua Galanter, Sam Oh, Esteban Burchard, Eleazar Eskin, James Zou, Eran Halperin. 
Nature Methods (2017).

2016 and earlier 


Computational biology

 

Quantifying the unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects. [PDF]
James Zou, Greg Valiant, Paul Valiant, Konrad Karczewski, Siu On Chan, Kaitlin Samocha, Monkol Lek, Exome Aggregation Consortium, Shamil Sunyaev, Mark Daly, Daniel MacArthur
Nature Communications (2016).

Analysis of protein-coding genetic variation in 60,706 humans. [arXiv]
Exome Aggregation Consortium.
Nature (2016).

Sparse PCA corrects for cell type heterogeneity in epigenome-wide association studies.
Elior Rahmani, Noah Zaitlen, Yael Baran, Celeste Eng, Donglei Hu, Joshua Galanter, Sam Oh, Esteban Burchard, Eleazar Eskin, James Zou, Eran Halperin.
Nature Methods (2016). 

A genetic and socio-economic study of mate choice in Latinos reveals novel assortment patterns. [PDF] [Press
James Zou, Danny Park, Esteban Burchard, Dara Torgerson, Maria Pino-Yanes, Yun Song, Sriram Sankararaman, Eran Halperin, Noah Zaitlen 
Proceedings of the National Academy of Sciences 112(44):13621-6 (2015). 

Inferring parental genomic ancestries using pooled semi-Markov processes. [PDF]
James Zou, Eran Halperin, Esteban Burchard, Sriram Sankararaman.
Bioinformatics 31(12):i190-6 (2015).

Correcting for sample heterogeneity in epigenome-wide association studies. [PDF]
James Zou.
Methods Mol Biol. (2015).

Undesired usage and the robust self-assembly of heterogeneous structures. [PDF]
Arvind Murugan, James Zou, Michael Brenner.
Nature Communications 11;6:6203 (2015).

Extended fertility and longevity: the genetic and epigenetic link. [PDF]
Kerem Wainer-Katsir, James Zou, Michal Linial.
Fertil Steril. 103(5):1117-24 (2015).

Epigenome-wide association studies without the need for cell-type composition. [PDF]
James Zou, Christoph Lippert, David Heckerman, Martin Aryee, Jennifer Listgarten.
Nature Methods 11(3):309-11 (2014).
Highlight talk at ISMB 2015. 
Highlight talk at RECOMB 2015. 
Platform presentation at the 2013 Wellcome Trust Epigenomics of Common Diseases conference. 
Platform presentation at the 2013 Machine Learning in Computational Biology meeting.
 

Genome-wide chromatin state transitions elicited by developmental and environmental cues. [PDF]
Jiang Zhu, Mazhar Adli, James Zou, et al.
Cell 152(3):642-54 (2013).

Locus-specific chromatin inactivation at endogenous enhancers with programmable TALE-LSD1 fusions. [PDF]
Eric Mendenhall, Kaylyn Williamson, Deepak Reyon, James Zou, et al.
Nature Biotechnology 31(12):1133-6 (2013).

Getting the biggest birch for the bang: restoring and expanding upland birchwoods in the Scottish Highlands by managing red deer. [PDF]
Andrew Tanentzap, James Zou, David Coomes.
Ecology and Evolution 3(7):1890-901 (2013).

Genome-wide analysis reveals conserved and divergent features of Notch1/RBPJ binding in human and murine T-lymphoblastic leukemia cells. [PDF]
Hongfang Wang, James Zou [co-first author, corresponding author], et al.
Proceedings of the National Academy of Sciences 108(36):14908-13 (2011).

Epstein-Barr virus exploits intrinsic B-lymphocyte transcription programs to achieve immortal cell growth. [PDF]
Bo Zhao, James Zou [co-first author], et al.
Proceedings of the National Academy of Sciences 108(36):14902-7 (2011).

Canonical NF-kappaB activation is essential for Epstein-Barr virus latent membrane protein 1 TES2/CTAR2 gene regulation. [PDF]
Ben Gewurz, Jessica Mar, Megha Padi, Bo Zhao, Nicholas Shinners, Kaoru Takasaki, Edward Bedoya, James Zou, et al.
Journal of Virology 85(13):6764-73 (2011).

Epstein-Barr virus nuclear antigens 3C and 3A maintain lymphoblastoid cell growth by repressing p16INK4A and p14ARF expression. [PDF]
Seiji Maruo, Bo Zhao, Eric Johannsen, Elliott Kieff, James Zou, Kenzo Takada.
Proceedings of the National Academy of Sciences 108(5):1919-24 (2011).

Religion and HIV in Tanzania: influence of religious beliefs on HIV stigma, disclosure, and treatment attitudes.[PDF]
James Zou, Yvonne Yamanaka, et al.
BMC Public Health 9:75 (2009).


Machine learning and AI

Man is to Computer Programmer as Woman is to Homemaker? Debiasing Word Embeddings. [arXiv] [TechReview] [Vice] [NPR]
Tolga Bolukbasi, Kai-Wei Chang, James Zou, Venkatesh Saligrama, Adam Kalai.
Neural Information Processing Systems (NIPS 2016).

Rich component analysis. [arXiv]
Rong Ge, James Zou.
International Conference on Machine Learning (ICML 2016).

Controlling bias in adaptive data analysis using information theory. [arXiv]
Daniel Russo, James Zou.
AISTATS (2016) (full oral; top 7% of submissions).
Information Theory and Applications (ITA) invited talk.

Clustering with a reject option: interactive clustering as Bayesian prior elicitation
Akash Srivastava, James Zou, Ryan Adams, Charles Sutton.
ArXiv (2016).

Intersecting faces: non-negative matrix factorization with new guarantees. [PDF]
Rong Ge, James Zou.
International Conference on Machine Learning (ICML 2015).

Crowdsourcing feature discovery via adaptively chosen comparisons. [PDF]
James Zou, Kamalika Chaudhuri, Adam Kalai.
HCOMP (2015); CrowdML workshop (ICML 2015); Feature extraction workshop (NIPS 2015)
Invited to Journal of Machine Learning Research (JMLR) special issue (2015).

Incentive compatible experimental design. [PDF]
Panos Toulis, David Parkes, Elery Pfeffer, James Zou.
ACM Conference on Economics and Computation (EC 2015).

Approval voting behavior in Doodle. [PDF]
James Zou, Reshef Meir, David Parkes.
ACM Conference CSCW (2015). Honorable mention for best paper (top 5% of submissions).

Coordination through contingent payment mechanisms
Hongyao Ma, Reshef Meir, David C. Parkes, James Zou.
Conference on Auctions, Market Mechanisms and Their Applications (2015). INFORMS (2015)

Contrastive learning using spectral methods. [PDF]
James Zou, Daniel Hsu, David Parkes, Ryan Adams.
Neural Information Processing Systems (NIPS 2013).

Priors for diversity in generative latent variable models. [PDF]
James Zou, Ryan Adams.
Neural Information Processing Systems (NIPS 2012).

A slime mold solver for linear programming problems. [PDF]
Anders Johannson, James Zou.
Lecture Notes in Computer Science 7318 (2012).

Get another worker? Active crowdlearning with sequential arrivals.  
James Zou, David Parkes.
Proceedings of Workshop on Machine Learning in Human Computation and Crowdsourcing (ICML 2012).

Threats and trade-Offs in resource critical crowdsourcing tasks over networks. [arXiv]
Swaprava Nath, Pankaj Dayama, Dinesh Garg, Y. Narahari, James Zou.
Proceedings 8th Workshop on Internet and Network Economics (WINE 2012).

Tolerable manipulability in dynamic assignment without money. [PDF]
James Zou, Sujit Gujar, David Parkes.
Proceedings 24th AAAI Conference on Artificial Intelligence (AAAI 2010).

Dynamic House Allocation
Sujit Gujar, James Zou, David Parkes.
Proceedings 5th Multidisciplinary Workshop on Advances in Preference Handling (MPREF 2010).

2021
2019
2018
2017
2016
bottom of page