Research Scientist Intern — Audio GenAI
Smule AI • Jun-Sep 2025
Developing spatial audio generation capabilities with prompt-based control for real-time applications using modern generative modeling approaches. Building training infrastructure for multimodal audio synthesis models from scratch using distributed computing frameworks.
Research Intern — Audio/Video/LLM
Microsoft Research • May–Aug 2024
Developed coordination system for multiple MLLMs and LLMs to extract unified features through cross-modal attention between audio and video. Built multi-LLM pipeline for large-scale audiovisual learning using distributed computing frameworks. Research on unified audiovisual encoder accepted at EUSIPCO 2025.
Audio–Language Research Intern
Bosch AI Research (BCAI) • Apr–Jul 2023
Co-authored counterfactual audio–language method (ICASSP 2024); patent pending. Developed novel approaches for learning audio concepts from counterfactual natural language prompts.
Patents & Intellectual Property
Method and System to Train Audio Retrieval and Zero Shot Classification Systems with Counterfactual Prompts
US Patent Application 18/379,518 • Pending
Assignee: Bosch AI Research USA