My research focuses on improving foundation models and agentic systems to make them more trustworthy and adaptive, enabling safe and effective human-AI collaboration. Currently, I am interested in:
(1) auditing, evaluating, and post-training for reliable agentic AI, especially in high-stakes, specialized domains (e.g., scientific and medical applications) and long-horizon tasks (where verification or oversight is difficult), and in scenarios requiring multimodal interaction or multi-agent coordination.
This includes improving both epistemic reliability (e.g., confidence calibration, recognizing knowledge gap, handling ambiguity) and behavioral reliability (e.g., mitigating risks of privacy leakage, tool misuse, and alignment failures).
(2) building user-adaptive and collaborative AI systems that can align with diverse user goals, preferences, and values in real-world interaction.
This includes personalized and pluralistic alignment, as well as improving models' ability to proactively seek information, infer user goals and intents from interaction, and adapt their responses and actions responsibly.
During Summer 2025, I was a Research Engineer Intern at the Center for AI Safety. Previously, I was a research assistant at Dartmouth College where I had the chance to work with Dr. Ruibo Liu and Prof. Soroush Vosoughi on value alignment for LLMs.
A benchmark and set of analyses for evaluating whether vision-language models respect contextual integrity in location disclosure for image geolocation, revealing that violations of contextual norms may result in privacy harms, characterized by over-disclosure of sensitive locations, poor privacy-utility tradeoffs, and misalignment with human privacy expectations.
Competed for Shared Task 0: Generalization and Typologically Diverse Morphological Inflection and achieved the highest performance among all submission in both small and large training conditions.
Misc
I come from Nanjing, a beautiful and historical city that served as the capital of six ancient Chinese dynasties over the past two thousand years.
I like listening to Rock N' Roll, ranging from Progressive Rock to BritPop and Pop Rock.
I've also been known to (awkwardly) hoop, smash, and stroke. (Style borrowed here from Prof. Schmidt)