About Me
I am an assistant professor of computer science at Brown University, and a staff research scientist at Google. My research interest is on computer vision and machine learning, especially on learning multimodal temporal representations from unlabeled videos. I received my Ph.D. from the University of Southern California in 2016, advised by Prof. Ram Nevatia, and my bachelor degree in Computer Science at Tsinghua University in 2011.
My long-term research goal is to make videos an indispensable learning source for visual perception, language understanding, robotics and cognitive science, and ultimately to develop autonomous machines that interact with and assist us in unconstrained environments.
I am looking for highly motivated students to join my lab at Brown, please find info for prospective students.
Teaching
- CS2950-K Special Topics in Computational Linguistics, Fall 2020.
Services
- Senior Program Committee, AAAI 2021.
- Area Chair, CVPR 2020 and 2021.
- Area Chair, WACV 2017 and 2018.
Recent Projects
![]() |
Learning Temporal Dynamics from Cycles in Narrated Video
Dave Epstein, Jiajun Wu, Cordelia Schmid, and Chen Sun arXiv 2101.02337 arXiv |
![]() |
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, and Cordelia Schmid ICCV 2019 arXiv / Research Blog / VentureBeat |
![]() |
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Jiyang Gao*, Chen Sun*, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, and Cordelia Schmid CVPR 2020 arXiv / Waymo Blog / VentureBeat |
![]() |
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, and Andrew Zisserman CVPR 2020 arXiv / Project / Data |
![]() |
What Makes for Good Views for Contrastive Learning?
Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, and Phillip Isola NeurIPS 2020 arXiv / Research Blog / Project / Code |