๐ง ali.vosoughi@rochester.edu
๐ CS Department, Wegmans Hall 3211
๐ Apple
Machine Learning Intern
Agentic Multimodal AI
Agentic Multimodal AI
present
๐ต Smule AI
Research Scientist Intern
Spatial Audio Generation
Spatial Audio Generation
JunโSep 2025
๐ข Microsoft Research
Research Intern
Audiovisual LLM
Audiovisual LLM
MayโAug 2024
๐ Bosch AI Research
Research Intern
Audio LLM
Audio LLM
AprโJul 2023
๐ก๏ธ DARPA PTG
Graduate Researcher
Autonomous AR Copilot
Autonomous AR Copilot
2022โpresent
First counterfactual audio methods
ICASSP’24 + US Patent US20250124292A1 (published Jan 2025)
ICASSP’24 + US Patent US20250124292A1 (published Jan 2025)
Autonomous multimodal copilot
Real-time AR demonstrations (DARPA)
Real-time AR demonstrations (DARPA)
VERIFY benchmark
Reasoning verification framework
Reasoning verification framework
Recent News & Updates
03/2025
๐ Published VERIFY benchmark
10/2024
๐ค Presented at SANE 2024, DeepMind Boston
10/2024
๐ ACM Multimedia 2024 paper accepted
08/2024
๐ผ Research presentation at Microsoft, Seattle
03/2024
๐ NAACL 2024 paper accepted
02/2024
๐ IEEE Transactions on Multimedia paper
08/2023
๐ฏ Two ICCV 2023 papers accepted
04/2023
๐ข Started internship at Bosch Center for AI
04/2022
๐ Nominated for Donald M. and Janet C. Barnard Fellowship
Publications


VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity
Under Review’26
[Paper][Website][๐ค Hugging Face]


EAGLE: Egocentric AGgregated Language-video Engine
ACM International Conference on Multimedia (ACM MM) 2024
[Paper]





AVSA-Sep: Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation
IEEE/CVF International Conference on Computer Vision (ICCV) 2023: ICCV AV4D Workshop
[Paper]



Leveraging Pre-Images to Discover Nonlinear Relationships in Multivariate Environments
European Signal Processing Conference (EUSIPCO) 2021
[Paper]

Personal Gallery

