I am a Computer Science Ph.D. student at UCLA, co-advised by Kai-Wei Chang and Nanyun Peng. I work on Multimodality (Vision + Language) and Embodied Learning.
I led visual agentic reasoning efforts for Segment Anything 3 at Meta Superintelligence Labs. Prior to my PhD, I spent time at Stanford SVL, working with Jiajun Wu and Fei-Fei Li.



