Promptable Closed-loop Traffic Simulation
|
Webpage | Video | Paper | Code | Model Demo | Data Demo | Dataset |
In this work, we propose a new task: promptable closed-loop traffic simulation . Aside from simulating realistic traffic agent interactions in closed-loop, traffic models should also generate agent motions that satisfy a complex set of user-specified prompts, which contains multimodal prompts like:
Towards this problem, we propose ProSim, a multimodal promptable closed-loop traffic simulation framework. ProSim allows the user to give a complex set of numerical, categorical or textual prompts to instruct each agent's behavior and intention. ProSim then rolls out a traffic scenario in a closed-loop manner, modeling each agent's interaction with other traffic participants. We show the architecture of ProSim in below.
ProSim consists of three main modules:
ProSim prioritizes three key properties:
To provide realistic and diverse agent motion data with multimodal prompts for our task, we propose ProSim-Instruct-520k: a high-quality paired prompt- scenario dataset with 520K real-world driving scenarios from the Waymo Open Motion Dataset (WOMD).
ProSim-Instruct-520k includes multimodal prompts for more than 10M unique agents, representing over 575 hours of driving data. For each scenario, it includes Goal Points and Route Sketchs for each of the agents. For most agents, it contains Action Tags that describe agent movement behaviors, which are automatically labeled and quality assured by humans. Finally, ProSim-Instruct-520k includes 20 Text Instructions for each sceanrio, describing both scenario-level and agent-level behaviors and interactions, labeled by LLama3-70B given the action tags and metadata. In total, ProSim-Instruct-520k contains over 10M text prompts.
Here we show examples from different types of Text Instructions from ProSim-Instruct-520k. Note all agent names are refered as <id>, where "id" is the abbreviated agent id in the WOMD dataset.
Simple agent behaviors:
Temporal transitions of agent behaviors:
Scenario-level properties:
Let's take a closer look at one example of ProSim in the promptable closed-loop traffic simulation task.
We can observe ProSim's two core properties from some examples above:
We thank Yue Zhao, Vincent Cho, Jerry Ouyang-Zhang, and Brady Zhou for their insightful discussions.
@inproceedings{ tan2024promptable, title={Promptable Closed-loop Traffic Simulation}, author={Tan, Shuhan and Ivanovic, Boris and Chen, Yuxiao and Li, Boyi and Weng, Xinshuo and Cao, Yulong and Kr{\"a}henb{\"u}hl, Philipp and Pavone, Marco}, booktitle={8th Annual Conference on Robot Learning}, year={2024}, }