Shuhan Tan

I am currently a PhD student at The University of Texas at Austin, advised by Philipp Krähenbühl. I am also a research intern at NVIDIA Research Autonomous Vehicle Research Group led by Dr. Marco Pavone .

Prior to my current position, I worked happily with Prof. Kristen Grauman and Prof. Bolei Zhou as a research assistant. I was also fortunate to have worked as a research intern at Uber ATG, advised by Prof. Raquel Urtasun and Prof. Shenlong Wang. Previously, I had the luck to work with Prof. Kate Saenko and Dr. Xingchao Peng at Boston University and Prof. Wei-Shi Zheng at iSEE, Sun Yat-sen University. I obtained my bachelor's degree from Sun Yat-Sen University in 2021.

My research interest mostly lies in machine learning and computer vision for autonomous driving, and particularly focuses on content generation for the safety of autonomous driving. My goal is to make autonomous driving safe and easily accessible to everyone.

Email  /  CV  /  Scholar  /  LinkedIn

PontTuset
Education

The University of Texas at Austin
PhD in Computer Science • Aug. 2021 -

Sun Yat-Sen University
B.E. in Computer Science • Sep. 2016 - Jun. 2021
Ranking: 2/189

Research Experience

NVIDIA Research Autonomous Vehicle Research Group
Research Intern • Aug. 2023 -
With Boris Ivanovic , Xinshuo Weng , Marco Pavone

The University of Texas at Austin
Research Assistant • Jan. 2023 -
PhD Advisor: Philipp Krähenbühl
Research Assistant • Aug. 2021 - Dec. 2022
With Kristen Grauman

The Chinese University of Hong Kong
Research Assistant • Sep. 2020 - Mar. 2021
With Bolei Zhou

Uber ATG Toronto
Research Intern • Sep. 2019 - Aug. 2020
With Raquel Urtasun, Shenlong Wang

Boston University
Research Assistant • July 2019 - Sep. 2019
With Kate Saenko, Xingchao Peng

Sun Yat-Sen University
Undergraduate Researcher • Sep. 2017 - June 2019
With Wei-Shi Zheng

News

[09/2023] EgoDistill is accepted to NeurIPS 2023.

[08/2023] LCTGen is accepted to CoRL 2023.

[08/2023] I started my internship at NVIDIA.

[01/2023] TrafficGen is accepted to ICRA 2023.

[08/2021] I started my PhD study at UT Austin.

Publications
PontTuset Language Conditioned Traffic Generation.



Shuhan Tan, Boris Ivanovic, Xinshuo Weng, Marco Pavone, Philipp Krähenbühl

CoRL 2023.

Paper / Project page / Code / Video / Demo (Colab)

Traffic scene generation with language condition using LLM.

PontTuset EgoDistill: Egocentric Head Motion Distillation for Efficient Video Understanding.

Shuhan Tan, Tushar Nagarajan, Kristen Grauman

NeurIPS 2023.

Paper / Project page

Efficient egocentric video understanding with head motion data from IMU.

PontTuset TrafficGen: Learning to Generate Diverse and Realistic Traffic Scenarios.

Lan Feng, Quanyi Li, Zhenghao Peng, Shuhan Tan, Bolei Zhou

ICRA 2023.

Paper / Project page/ Video

Synthesis new traffic scenario and replay in simulation.

PontTuset Improving the Fairness of Deep Generative Models without Retraining.

Shuhan Tan, Yujun Shen, Bolei Zhou.

arXiv.2012.04842 preprint

Paper / Project page / Colab

Mitigate biases of GAN models without retraining.

PontTuset SceneGen: Learning to Generate Realistic Traffic Scenes.

Shuhan Tan*, Kelvin Wong*, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun.

CVPR 2021.

Paper / Video / Bibtex

Generate realistic traffic scences automatically.

PontTuset Class-imbalanced Domain Adaptation: An Empirical Odyssey.

Shuhan Tan, Xingchao Peng, Kate Saenko.


TASK-CV Workshop, ECCV 2020.

Paper / Bibtex

Align feature distributions across domains while the label distributions of the two domains are also different.

PontTuset LidarSIM: Realistic LiDAR Simulation by Leveraging the Real World

Sivabalan Manivasagam, Shenlong Wang, Kelvin Wong, Wenyuan Zeng, Mikita Sazanovich, Shuhan Tan, Bin Yang, Wei-Chiu Ma and Raquel Urtasun.

CVPR 2020 (Oral).

Paper / Supplement / Bibtex

Realistic sensor simulation for LiDAR.

PontTuset

Biomarker Localization by Combining CNN Classifier and Generative Adversarial Network

Rong Zhang, Shuhan Tan, Ruixuan Wang, Siyamalan Manivannan, Jingjing Chen, Haotian Lin, Wei-Shi Zheng.

MICCAI 2019.

Paper / Bibtex

We proposed a novel deep neural network architecture to effectively localize potential biomarkers in medical images, when only the image-level labels are available during model training.

PontTuset

Weakly Supervised Open-set Domain Adaptation by Dual-domain Collaboration

Shuhan Tan, Jiening Jiao, Wei-Shi Zheng

CVPR 2019.

Paper / Supplement / Bibtex

We proposed a practical weakly supervised setting for open-set domain adaptation, where two scarcely-labeled domains collaboratively learn from each other.

Invited for presentation at WebVision 2019.

Selected Honors

Distinguished Graduate Thesis, Sun Yat-sen University

SenseTime Scholarship (21 out of all AI-focus undergraduate students in China)

Patent

Systems and Methods for Simulating Traffic Scenes. US Patent App. 17/528,277.

Shuhan Tan, Kelvin Ka Wing Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun

Service

Reviewer: CVPR, ECCV, ICCV, ICRA, RA-L, TCSVT, TMI.


Updated Oct 2023.

Special thanks to Jon Barron for website template.