My research focuses on data for efficient multimodal machine learning, developing data-centric approaches that trace model performance back to training data composition and introduce data-driven capabilities to build more scalable and capable vision-language systems.
My work sits at the intersection of three core pillars:
Compositionality (how well models generalize to novel combinations of concepts) [COMPACT, ConceptMix]
Controllability (how precisely we can control generation) [Motive, Corgi]
I am happy to collaborate and answer questions about my research, feel free to send me an email. I especially encourage students from underrepresented groups to reach out.
09/2024: Attended ECCV 2024 and gave a spotlight talk on ConceptMix (Slides) at Knowledge in Generative Models Workshop; and our Vision-Language Dataset Distillation received best paper award at Dataset Distillation Workshop.
02/2024: Led a discussion on Scaling Law in the PLI Vision-Language Reading Group (Slides).
02/2024: I am TAing for COS 429 Computer Vision in Spring 2024.
01/2024: Passed my general exam (quals)! (Video | Slides | Reading List). Huge thanks to my committee members Olga Russakovsky, Szymon Rusinkiewicz and Adji Bousso Dieng for their support and feedback!
2023
09/2023: I am TAing for COS 597O Advanced Topics in Computer Science: Deep Generative Models, with focus on Methods, Applications & Societal Considerations in Fall 2023.
06/2023: Attended CVPR 2023 and presented Pix2Map (Poster).
05/2023: Started my internship at Meta Reality Lab in Redmond, WA on multimodal generative models.
TA: COS 597O Advanced Topics in Computer Science: Deep Generative Models: Methods, Applications & Societal Considerations.
Fall 2023, Princeton, with instructor: Adji Bousso Dieng.
Professional Activities & Outreach
Reviewer: NeurIPS 25'/24'/23', ICLR 25'/24', ICML 25'/24', CVPR 25'/24'/23'/22', ICCV 25'/23', ECCV 24'/22', TMLR, ICRA 24', ACCV 24', ICLR 23' Workshop ME-FoMo, NeurIPS Interpolate Workshop 22', BMVC 20', IJCAI 20'.
Workshop Co-organizer: ICCV 2025 Workshop on Curated Data for Efficient Learning.
Committee Member: Diversity, Equity and Inclusion Committee at Robotics Institute, CMU.