About Me

I am an assistant professor of computer science at Brown University, where I direct the PALM🌴 research lab on computer vision, machine learning, and AI. I am also a part-time staff research scientist at Google DeepMind. Previously, I received my PhD from the University of Southern California in 2016, advised by Prof. Ram Nevatia. I completed my bachelor's degree in Computer Science at Tsinghua University in 2011. (Short bio)

Chen Sun is an assistant professor of computer science at Brown University and a staff research scientist (part-time) at Google DeepMind, studying computer vision and machine learning. His current research focuses on learning temporal dynamics from unlabeled videos and applying these video-centric world models in robotics. Chen has received an NSF CAREER Award, a Richard B. Salomon Faculty Research Award, and a Samsung Global Research Outreach Award. His work on behavior prediction in videos was a CVPR 2019 best paper finalist. Previously, Chen received his PhD from the University of Southern California in 2016 and bachelor's degree from Tsinghua University in 2011.

At PALM🌴, we focus on learning generalizable temporal dynamics from unlabeled videos, whether as multimodal concepts, human behaviors, or raw pixels. We are also exploring applications of these video-centric world models in robotics.

I have received an NSF CAREER Award, a Richard B. Salomon Faculty Research Award, and a Samsung Global Research Outreach Award. My work on behavior prediction was a CVPR 2019 best paper finalist. Our research has been supported by Adobe, Honda, Meta, NASA, NSF, NVIDIA, and Samsung over the years. We are a member of the NSF AI Research Institute on Interaction for AI Assistants (ARIA).

Teaching

Group

PhD students

PhD alumni

  • Calvin Luo (PhD '26)
    • Thesis: Generalizable Decision-Making via Large-Scale Visual Generative Models
    • Research Mobility Fellowship, Joukowsky Outstanding Dissertation Prize Finalist
  • Nate Gillman (PhD '26, Mathematics & Computer Science)
    • Thesis: Controlling Deep Generative Models via Physical and Mathematical Priors
    • Next position: Research Scientist at Google Research
  • Shijie Wang (PhD '26)
    • Thesis: From Text to Pixels: Multimodal Video Understanding with Language Pre-training
    • Next position: Research Scientist at NVIDIA Cosmos Lab
  • Tian Yun (PhD '26, co-advised with Ellie Pavlick)
    • Thesis: Internal Representations in Vision-Language Models
    • Next position: Research Scientist at Meta MSL

Undergraduate and MS alumni

  • Kaleb Newman (Brown '25, CRA Honorable Mention β€” PhD at Princeton CS)
  • Michael Freeman (Brown '20 β€” PhD at Cornell CS)
  • Rohan Krishnan (Brown '25 β€” engineer at Klaviyo)
  • Chia-Hong Hsu (Brown MS '25 β€” PhD at UBC)
  • Kevin Zhao (Brown MS '24 β€” researcher at ByteDance Seed)
  • Zilai Zeng (Brown MS '24 β€” PhD at Brown CS)
  • Yunhao Luo (Brown MS '24 β€” PhD at UMich CS)
  • Mandy He (Brown '24 β€” engineer at Duolingo)
  • Minh Quan Do (Brown MS '24 β€” co-founder at Tan Kim Nhat Trading)
  • David Heffren (Brown '24 β€” PhD at JHU Applied Math)
  • John Ryan Byers (Brown '24 β€” master's at Cornell Tech)
  • Ce Zhang (Brown MS '23 β€” PhD at UNC CS)
  • Changcheng Fu (Brown MS '23 β€” PhD at USC CS)
  • Kunal Handa (Brown '23 β€” member of technical staff at Anthropic)
  • Jessica Li (Brown '23 β€” engineer at Headway)
  • Usha Bhalla (Brown '22 β€” PhD at Harvard CS)
  • Emily Byun (Brown '21 β€” PhD at CMU MLD)

Mentorship

Services

  • Workshop Chair, CVPR 2025.
  • Action Editor, TMLR.
  • Area Chair, ICLR 2025 and 2026.
  • Area Chair, CVPR 2020 to 2026.
  • Area Chair, ICCV 2023 and 2025.
  • Area Chair, ECCV 2022 to 2026.
  • Area Chair, NeurIPS 2023 to 2025.
  • Area Chair, ACL 2023 and 2025.
  • Senior PC, AAAI 2021 and 2022.
  • Area Chair, WACV 2017 and 2018.

Selected Projects

Goal Force Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals
Nate Gillman, Yinghua Zhou, Zitian Tang, Evan Luo, Arjan Chakravarthy,
Daksh Aggarwal, Michael Freeman, Charles Herrmann, and Chen Sun
CVPR 2026
arXiv / Project / Code
SAIL Self-Improving Loops for Visual Robotic Planning
Calvin Luo*, Zilai Zeng*, Mingxi Jia, Yilun Du, and Chen Sun
ICLR 2026
arXiv / Project
Spacewalk-18 Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Zitian Tang*, Rohan Myer Krishnan*, Zhiqiu Yu, and Chen Sun
WACV 2026 (Oral, Award Finalist)
arXiv / Project / Dataset
Force Prompting Force Prompting: Video Generation Models Can Learn and Generalize Physics-based Control Signals
Nate Gillman, Charles Herrmann*, Michael Freeman, Daksh Aggarwal, Evan Luo, Deqing Sun, and Chen Sun*
NeurIPS 2025
arXiv / Project / Code
ObjectMLLM How Can Objects Help Video-Language Understanding?
Zitian Tang, Shijie Wang, Junho Cho, Jaewook Yoo, and Chen Sun
ICCV 2025
arXiv / Project / Code
Fourier Head Fourier Head: Helping Large Language Models Learn Complex Probability Distributions
Nate Gillman*, Daksh Aggarwal*, Michael Freeman, Saurabh Singh, and Chen Sun
ICLR 2025
arXiv / Project / Code
Solving New Tasks by Adapting Internet Video Knowledge Solving New Tasks by Adapting Internet Video Knowledge
Calvin Luo*, Zilai Zeng*, Yilun Du, and Chen Sun
ICLR 2025
arXiv / Project / Code
Motion Prompting Motion Prompting: Controlling Video Generation with Motion Trajectories
Daniel Geng, Charles Herrmann et al.
CVPR 2025 (Oral)
arXiv / Project
Text-Aware Diffusion for Policy Learning Text-Aware Diffusion for Policy Learning
Calvin Luo, Mandy He*, Zilai Zeng*, and Chen Sun
NeurIPS 2024
(Also appeared at NeurIPS 2023 workshop on Diffusion Models)
arXiv / Poster / Project / Code
Self-Correcting Self-Consuming Loops Self-Correcting Self-Consuming Loops for Generative Model Training
Nate Gillman, Michael Freeman, Daksh Aggarwal, Chia-Hong Hsu, Calvin Luo, Yonglong Tian, and Chen Sun
ICML 2024
arXiv / Poster / Project / Code
Vamos Vamos: Versatile Action Models for Video Understanding
Shijie Wang, Qi Zhao, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ECCV 2024
arXiv / Project / Code
AntGPT AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
Qi Zhao*, Shijie Wang*, Ce Zhang, Changcheng Fu, Minh Quan Do, Nakul Agarwal, Kwonjoon Lee, and Chen Sun
ICLR 2024
arXiv / Project / Code
Goal-Conditioned Predictive Coding for Offline Reinforcement Learning Goal-Conditioned Predictive Coding for Offline Reinforcement Learning
Zilai Zeng, Ce Zhang, Shijie Wang, and Chen Sun
NeurIPS 2023
arXiv / Project / Code
Temporal Dynamics from Cycles Learning Temporal Dynamics from Cycles in Narrated Video
Dave Epstein, Jiajun Wu, Cordelia Schmid, and Chen Sun
ICCV 2021
arXiv / Research Blog / Project
InfoMin What Makes for Good Views for Contrastive Learning?
Yonglong Tian, Chen Sun, Ben Poole, Dilip Krishnan, Cordelia Schmid, and Phillip Isola
NeurIPS 2020
arXiv / Research Blog / Project / Code
VectorNet VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation
Jiyang Gao*, Chen Sun*, Hang Zhao, Yi Shen, Dragomir Anguelov, Congcong Li, and Cordelia Schmid
CVPR 2020
arXiv / Waymo Blog / VentureBeat
VideoBERT VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun, Austin Myers, Carl Vondrick, Kevin Murphy, and Cordelia Schmid
ICCV 2019
arXiv / Research Blog / VentureBeat