Previously, I completed my master's degree at the University of Pennsylvania, where I worked with Prof. Chris Callison-Burch and Prof. Eric Wong. Before that, I earned a B.S. in Computer Science from Brandeis University, graduating with honors as a member of Phi Beta Kappa (top 10%).
I study how to improve the computational reasoning capabilities of large language models through post-training and reinforcement learning techniques. Previously, I have also worked on the safety and trustworthiness of LLMs.
We introduce Deontic Agentic Reasoning (DAR), an agentic setup in which models query long, cross-referenced statutes on demand rather than reading them in full. Evaluated under multiple harnesses on hard subsets of DeonticBench, agentic harnesses push the frontier on deontic reasoning, but gains are uneven β weaker models often degrade on numerical tasks while consuming far more tokens.
We introduce DeonticBench, a benchmark of 6,232 tasks for evaluating rule-based reasoning in language models across domains including US federal taxes, airline baggage policies, US immigration administration, and US state housing law. The benchmark supports both language-based and computational reasoning approaches. Systems can optionally make use of provided symbolic translations. Results show current models struggle, achieving only 44.4% on numeric subsets and 46.6% on housing cases, and that fine-tuning and reinforcement learning approaches remain unreliable for solving complex rule-reasoning problems.
We propose Modality Aware Neuron Unlearning (MANU), a novel unlearning framework for MLLMs designed to selectively clip neurons based on their relative importance to the targeted forget data, curated for different modalities. Specifically, MANU consists of two stages: important neuron selection and selective pruning. The first stage identifies and collects the most influential neurons across modalities relative to the targeted forget knowledge, while the second stage is dedicated to pruning those selected neurons.
We propose Stable Sequential Unlearning (SSU), a novel framework designed to unlearn copyrighted content from LLMs over multiple time steps. Our approach works by identifying and removing specific weight updates in the model's parameters that correspond to copyrighted content. We improve unlearning efficacy by introducing random labeling loss and ensuring the model retains its general-purpose knowledge by adjusting targeted parameters.
We introduce Multimodal Large Language Model Unlearning Benchmark (MLLMU-Bench), a novel benchmark aimed at advancing the understanding of multimodal machine unlearning. MLLMU-Bench consists of 500 fictitious profiles and 153 profiles for public celebrities, each profile feature over 14 customized question-answer pairs, evaluated from both multimodal (image+text) and unimodal (text) perspectives.
We introduce Selective Knowledge negation Unlearning (SKU), a novel unlearning framework for LLMs, designed to eliminate harmful knowledge while preserving utility on normal prompts.
Teaching Assistant, Data Structures and the Fundamentals of Computing, Brandeis University (Fall 2021)
Academic Service
Conference Reviewer: NeurIPS, EMNLP, NAACL, ACL
Journal Reviewer: IEEE Transactions on Information Forensics and Security, npj Digital Medicine (Nature Portfolio)
Miscellaneous
I've always been surrounded by wonderful friends, collaborators, and advisors, and I try to maintain an optimistic outlook. If you're having a tough time and would like someone to talk to, feel free to reach out!
I like basketball, lifting, and making new friends.