Willie

Research: I work at the intersection of machine learning, decision making, generative AI, and AI-for-science. One focus of my research is on AI-driven decision making in costly, data-limited settings, using model-based optimization, experimental design, and uncertainty quantification. I also work on generative models, LLMs, and multimodal models, with applications in the physical sciences, biology, and engineering.

Additionally, I've worked on distributed algorithms for scalable training, and developed open-source libraries for multilevel optimization, uncertainty quantification, AutoML, and Bayesian optimization.

Background: Previously, I was a postdoc in CS at Stanford University, working with Stefano Ermon. Prior to that, I received my PhD in Machine Learning at Carnegie Mellon University, where I was advised by Eric Xing and also worked with Jeff Schneider and Barnabás Póczos.

News

  • Oct 28, 2025 New paper on scalable methods for LLM data valuation with influence functions, in NeurIPS 2025.
  • Oct 11, 2025 New paper on active learning in self-driving labs for materials discovery in Digital Discovery.
  • July 7, 2025 New paper on sample-efficient alignment in LLMs with active exploration, in COLM 2025.
  • May 5, 2025 New paper on inverse design of ultra-high-performance concrete, in Phil. Trans. R. Soc. A.
  • Feb 11, 2025 New paper on LiveBench, a challenging, contamination-free LLM eval, in ICLR 2025 (spotlight).
  • Feb 11, 2025 New paper on decision making under uncertainty with LLMs (DeLLMa) in ICLR 2025 (spotlight).
  • Jan 6, 2025 Released METAGENE-1 a metagenomic foundation model designed for pandemic monitoring.
  • Jul 18, 2024 New paper on materials discovery with Bayesian algorithm execution in npj Comp. Materials.
  • Jul 10, 2024 New paper on isomorphic representations in multimodal foundation models in COLM 2024.
  • Mar 25, 2024 New paper on uncertainty quantification for deep learning PDE surrogates in AAAI 2024.
  • Feb 23, 2024 New paper on experimental design for determining safe tokamak rampdowns in Nuclear Fusion.
  • Oct 20, 2023 New paper (+ code) on algorithms and systems for scalable meta learning in NeurIPS 2023.
  • Oct 20, 2023 New paper on offline model-based optimization through co-teaching in NeurIPS 2023.
  • July 28, 2023 Co-organized the Differentiable Almost Everything Workshop at ICML 2023.
  • Jan 20, 2023 New paper on automatic differentiation for multilevel optimization in ICLR 2023 (notable-top-5%).
  • Jan 20, 2023 New paper on policy identification for active reinforcement learning in ICLR 2023 (notable-top-5%).
  • Jan 20, 2023 New paper on a framework to combine weak supervision and generative modeling in ICLR 2023.
  • Jan 1, 2023 New paper on offline imitation learning with suboptimal demonstrations in AAAI 2023.
  • Dec 7, 2022 New paper on uncertainty quantification with pre-trained language models in EMNLP 2022.
  • Dec 2, 2022 Invited talk at the Workshop on Gaussian Processes and Decision-making Systems at NeurIPS 2022.
  • Oct 10, 2022 New paper (+ code) on trajectory information planning for exploration in RL in NeurIPS 2022.
  • Oct 10, 2022 New paper on decision-theoretic entropies for generalizing Bayesian optimization in NeurIPS 2022.
  • July 22, 2022 Co-organized the Real World Experiment Design and Active Learning Workshop at ICML 2022.
  • May 15, 2022 New paper (+ website) on likelihood-free Bayesian optimization in ICML 2022 (long talk).
  • May 15, 2022 New paper on a modular conformal calibration framework for UQ in ICML 2022.
  • Jan 28, 2022 New paper (+ blog post) on experimental design and reinforcement learning in ICLR 2022.
  • Jan 1, 2022 New paper (+ website) on large-scale object counting in satellite images, in AAAI 2022 (oral).
  • Oct 15, 2021 New paper (+ code) on quantile methods for calibrated uncertainty quantification in NeurIPS 2021.
  • Oct 15, 2021 Two papers on explainable machine learning and personalized benchmarking in NeurIPS 2021.
  • July 14, 2021 Our paper on Pollux was awarded the Jay Lepreau Best Paper Award at OSDI'21.
  • June 10, 2021 New paper (+ website) on Bayesian Algorithm Execution (BAX) and InfoBAX, in ICML 2021.
  • June 1, 2021 I co-organized the Machine Learning for Data (Creation, Privacy, Bias) Workshop at ICML 2021.
  • Apr 1, 2021 New paper (+ AdaptDL) on Pollux, a deep learning cluster scheduler/tuner, in OSDI 2021.
  • Mar 16, 2021 New paper (+ code) on uncertainty quantification with martingales for GPs in ALT 2021.
  • Mar 9, 2021 New paper on active classification for catalyst discovery in the Journal of Chemical Physics.
  • Jan 12, 2021 New paper (+ code) on a framework for interactive weak supervision in ICLR 2021.
  • Dec 22, 2020 Released Uncertainty Toolbox, for predictive UQ, calibration, metrics, and visualization.
  • Dec 2, 2020 New paper (+ code) on BANANAS, a method for neural architecture search, in AAAI 2021.

Projects

Publications

A full list of my publications can be found here.