📧 ali.vosoughi@rochester.edu
📍 CS Department, Wegmans Hall 3211
🍎 Apple
Machine Learning Intern
Agentic Multimodal AI
Agentic Multimodal AI
present
🎵 Smule AI
Research Scientist Intern
Spatial Audio Generation
Spatial Audio Generation
Jun–Sep 2025
🏢 Microsoft Research
Research Intern
Audiovisual LLM
Audiovisual LLM
May–Aug 2024
🚗 Bosch AI Research
Research Intern
Audio LLM
Audio LLM
Apr–Jul 2023
🛡️ DARPA PTG
Graduate Researcher
Autonomous AR Copilot
Autonomous AR Copilot
2022–present
First counterfactual audio methods
ICASSP’24 + US Patent US20250124292A1 (published Jan 2025)
ICASSP’24 + US Patent US20250124292A1 (published Jan 2025)
Autonomous multimodal copilot
Real-time AR demonstrations (DARPA)
Real-time AR demonstrations (DARPA)
VERIFY benchmark
Reasoning verification framework
Reasoning verification framework
Recent News & Updates
03/2025
🚀 Published VERIFY benchmark
10/2024
🎤 Presented at SANE 2024, DeepMind Boston
10/2024
📄 ACM Multimedia 2024 paper accepted
08/2024
💼 Research presentation at Microsoft, Seattle
03/2024
📄 NAACL 2024 paper accepted
02/2024
📄 IEEE Transactions on Multimedia paper
08/2023
🎯 Two ICCV 2023 papers accepted
04/2023
🏢 Started internship at Bosch Center for AI
04/2022
🏆 Nominated for Donald M. and Janet C. Barnard Fellowship
Publications


VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity
Under Review’26
[Paper][Website][🤗 Hugging Face]


EAGLE: Egocentric AGgregated Language-video Engine
ACM International Conference on Multimedia (ACM MM) 2024
[Paper]





AVSA-Sep: Separating Invisible Sounds Toward Universal Audiovisual Scene-Aware Sound Separation
IEEE/CVF International Conference on Computer Vision (ICCV) 2023: ICCV AV4D Workshop
[Paper]



Leveraging Pre-Images to Discover Nonlinear Relationships in Multivariate Environments
European Signal Processing Conference (EUSIPCO) 2021
[Paper]

Personal Gallery

