I am an assistant professor of computer science at Brown University, and a staff research scientist at Google. My research interest is on computer vision and machine learning, especially on learning multimodal temporal representations from unlabeled videos. I received my Ph.D. from the University of Southern California in 2016, advised by Prof. Ram Nevatia, and my bachelor degree in Computer Science at Tsinghua University in 2011.
My long-term research goal is to make videos an indispensable learning source for visual perception, language understanding, robotics and cognitive science, and ultimately to develop autonomous machines that interact with and assist us in unconstrained environments.
I am looking for highly motivated students to join my lab at Brown, please find info for prospective students.
- CS2950-K Special Topics in Computational Linguistics, Fall 2020.
- Senior Program Committee, AAAI 2021.
- Area Chair, CVPR 2020 and 2021.
- Area Chair, WACV 2017 and 2018.
Learning Temporal Dynamics from Cycles in Narrated Video
Dave Epstein, Jiajun Wu, Cordelia Schmid, and Chen Sun
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, and Cordelia Schmid
arXiv / Research Blog / VentureBeat
VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Jiyang Gao*, Chen Sun*, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, and Cordelia Schmid
arXiv / Waymo Blog / VentureBeat
Speech2Action: Cross-modal Supervision for Action Recognition
Arsha Nagrani, Chen Sun, David Ross, Rahul Sukthankar, Cordelia Schmid, and Andrew Zisserman
arXiv / Project / Data
What Makes for Good Views for Contrastive Learning?
Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, and Phillip Isola
arXiv / Research Blog / Project / Code