Ali Vosoughi is a Ph.D. student in the Electrical and Computer Engineering Department at the University of Rochester. His interests are multimodal information processing using deep learning, multisensory perception, video and scene understanding, audio, image, video, and natural language (NLP) that can help to build the next generation of AI assistants able to solve complex and imaginative tasks.

He works with Prof. Chenliang Xu and Prof. Axel Wismueller on deep multimodal learning of audio, vision, image, speech, and language models for egocentric/3rd person video understanding and medical imaging applications. Ali is a scholar in the NSF’s Augmented and Virtual Reality project.

Write ✉️ to Ali Vosoughi: mvosough




Audio-Visual Deep Learning

Visual Question Answering

Cross-Modal Language and Vision Models for Radiology

Point Clouds and Deep Learning

Neural Networks and Causality

Anomaly Detection with Autoencoders

Information Theory for Hardware Security

CMOS-Based Optimization Accelerators

Camera Sensor Design

CMOS and Electronics


