Ali Vosoughi

阿力

Ali Vosoughi is a Ph.D. student in the Electrical and Computer Engineering Department at the University of Rochester. His interests are multimodal information processing using deep learning, multisensory perception, video and scene understanding, audio, image, video, and natural language (NLP) that can help to build the next generation of AI assistants able to solve complex and imaginative tasks.

He works with Prof. Chenliang Xu and Prof. Axel Wismueller on deep multimodal learning of audio, vision, image, speech, and language models for egocentric/3rd person video understanding and medical imaging applications. Ali is a scholar in the NSF’s Augmented and Virtual Reality project.

Write ✉️ to Ali Vosoughi: mvosough 🙂 ece.rochester.edu


GitHub
LinkedIn
Scholar


News


Projects

Audio-Visual Deep Learning


Visual Question Answering


Cross-Modal Language and Vision Models for Radiology


Point Clouds and Deep Learning


Neural Networks and Causality


Anomaly Detection with Autoencoders


Information Theory for Hardware Security


CMOS-Based Optimization Accelerators


Camera Sensor Design


CMOS and Electronics


فارسی

Personal Gallery

Ali Vosoughi
Ali Vosoughi

چرا فارسی را نمی شود در این وبسایت از راست نوشت؟