Co-led the development of RL training pipeline for SWE agents.
Trained SoTA 32B and 14B SWE agents on complex, realistic bugs generated by BugPilot, a new bug generation framework.
Co-led the development of RL training pipeline for SWE agents.
Trained SoTA 32B and 14B SWE agents on complex, realistic bugs generated by BugPilot, a new bug generation framework.
Co-led the development of RL training pipeline for SWE agents.
Trained SoTA 32B and 14B SWE agents on complex, realistic bugs generated by BugPilot, a new bug generation framework.
Analyzed the game Codenames to understand how players use language to communicate efficiently across cultures and developed a method to allow players to communicate more efficiently across cultures.