|
Shuhan Tan
Hi there! I am currently a PhD student at The University of Texas at Austin, advised by Philipp Krähenbühl.
My research focus on simulate realistic human behavior and environment evolument.
Specifically, my current research focuses on content generation for autonomous driving simulation systems, which aims to make autonomous driving safe and easily accessible for everyone.
My long-term goal is to develop realistic world modeling and simulation systems that can be used to train and test inteligent agents in a variety of domains.
Email  / 
CV  / 
Scholar  / 
LinkedIn
|
|
|
Education
The University of Texas at Austin
PhD in Computer Science • Aug. 2021 -
Sun Yat-Sen University
B.E. in Computer Science • Sep. 2016 - Jun. 2021
Ranking: 2/189
|
|
News
[02/2026] LCDrive got accepted to CVPR 2026! Check it out!
[10/2025] I was selected as an Outstanding Reviewer of ICCV 2025 (Top 3%)!
[06/2025] InfGen got accepted to ICCV 2025! Check it out!
[05/2025] RIPT-VLA released! Check it out!
[03/2025] SceneDiffuser++ got accepted to CVPR 2025!
|
|
Towards realistic, controllable and reactive traffic simulation
Long-term Human Motion Prediction Workshop
ICRA 2024. Yokohama, Japan.
Slides / Workshop
|
|
Latent Chain-of-Thought World Modeling for End-to-End Driving.
Shuhan Tan, Kashyap Chitta, Yuxiao Chen, Ran Tian, Yurong You, Yan Wang, Wenjie Luo, Yulong Cao, Philipp Krähenbühl, Marco Pavone, Boris Ivanovic
CVPR 2026.
Paper
A Vision-Language-Action model for autonomous driving that uses latent chain-of-thought reasoning with a learned world model for improved trajectory planning.
|
|
SceneGen: Learning to Generate Realistic Traffic Scenes.
Shuhan Tan*, Kelvin Wong*, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun.
CVPR 2021.
Paper /
Video /
Bibtex
Generate realistic traffic scences automatically.
|
|
Class-imbalanced Domain Adaptation: An Empirical Odyssey.
Shuhan Tan, Xingchao Peng, Kate Saenko.
TASK-CV Workshop, ECCV 2020.
Paper /
Bibtex
Align feature distributions across domains while the label distributions of the two domains are also different.
|
|
LidarSIM: Realistic LiDAR Simulation by Leveraging the Real World
Sivabalan Manivasagam, Shenlong Wang, Kelvin Wong, Wenyuan Zeng, Mikita Sazanovich, Shuhan Tan, Bin Yang, Wei-Chiu Ma and Raquel Urtasun.
CVPR 2020 (Oral).
Paper /
Supplement /
Bibtex
Realistic sensor simulation for LiDAR.
|
|
Biomarker Localization by Combining CNN Classifier and Generative Adversarial Network
Rong Zhang, Shuhan Tan, Ruixuan Wang, Siyamalan Manivannan, Jingjing Chen, Haotian Lin, Wei-Shi Zheng.
MICCAI 2019.
Paper /
Bibtex
We proposed a novel deep neural network architecture to effectively localize potential biomarkers in medical images, when only the image-level labels are available during model training.
|
|
Weakly Supervised Open-set Domain Adaptation by Dual-domain Collaboration
Shuhan Tan, Jiening Jiao, Wei-Shi Zheng
CVPR 2019.
Paper /
Supplement /
Bibtex
We proposed a practical weakly supervised setting for open-set domain adaptation, where two scarcely-labeled domains collaboratively learn from each other.
Invited for presentation at WebVision 2019.
|
|
Selected Honors
Distinguished Graduate Thesis, Sun Yat-sen University
|
|
Acedemic Service
Reviewer: CVPR, ECCV, ICCV, ICLR, ICML, ICRA, IJCV, NeruIPS, RA-L, TCSVT, TMI.
|
|
Shufa, aka Maomi
British Longhair Boy😺
Born in 2022, California.
Instagram / Wiki
My little colleague, who always sleeps on my desk,
put his paw on the F5 key to prevent vscode debugging.
|
Updated Oct 2025.
Special thanks to Jon Barron for website template.
|
|