ReconDreamer++: Harmonizing Generative and Reconstructive Models
for Driving Scene Representation

  • Guosheng Zhao* 1,2
  • Xiaofeng Wang* 1,2
  • Chaojun Ni* 1,3
  • Zheng Zhu* 1
  • Wenkang Qin1
  • Guan Huang1
  • Xingang Wang2
  • 1 GigaAI
  • 2 Institute of Automation, Chinese Academy of Sciences
  • 3 Peking University

Comparison of ReconDreamer++ with SOTA methods, Street Gaussians and ReconDreamer, on original and novel trajectories. Left: ReconDreamer++ demonstrates superior rendering performance for both vehicle foregrounds and road surfaces compared to existing SOTA methods. Right: ReconDreamer++ significantly improves performance on novel trajectories while maintaining high rendering quality on the original trajectory.

The overall framework of ReconDreamer++.

Abstract

Combining reconstruction models with generative models has emerged as a promising paradigm for closed-loop simulation in autonomous driving. For example, ReconDreamer has demonstrated remarkable success in rendering large-scale maneuvers. However, a significant gap remains between the generated data and real-world sensor observations, particularly in terms of fidelity for structured elements, such as the ground surface. To address these challenges, we propose ReconDreamer++, an enhanced framework that significantly improves the overall rendering quality by mitigating the domain gap and refining the representation of the ground surface. Specifically, ReconDreamer++ introduces the Novel Trajectory Deformable Network (NTDNet), which leverages learnable spatial deformation mechanisms to bridge the domain gap between synthesized novel views and original sensor observations. Moreover, for structured elements such as the ground surface, we preserve geometric prior knowledge in 3D Gaussians, and the optimization process focuses on refining appearance attributes while preserving the underlying geometric structure. Experimental evaluations conducted on multiple datasets (Waymo, nuScenes, PandaSet, and EUVS) confirm the superior performance of ReconDreamer++. Specifically, on Waymo, ReconDreamer++ achieves performance comparable to Street Gaussians for the original trajectory while significantly outperforming ReconDreamer on novel trajectories. In particular, it achieves substantial improvements, including a 6.1% increase in NTA-IoU, a 23. 0% improvement in FID, and a remarkable 4.5% gain in the ground surface metric NTL-IoU, highlighting its effectiveness in accurately reconstructing structured elements such as the road surface.

Experimental Results

Video
Single Frame
Original Trajectory
Lane Shift @ 3m
Original Trajectory
Lane Shift @ 3m

Rendering Results on Various Datasets.


Waymo


Original Trajectory

Lane Shift @ 3m

Lane Shift @ 6m


nuScenes


Original Trajectory

Lane Shift @ 3m

Lane Shift @ 6m


PandaSet


Original Trajectory

Lane Shift @ 2m

Lane Shift @ 3m

Vertical Shift @ 1m


EUVS


Train Set

Test Set

Ablation Studies on depth loss, ground model and NTDNet.

Original Trajectory

Lane Shift @ 3m

Lane Shift @ 6m

Citation

Acknowledgements

The website template was borrowed from Michaël Gharbi and Jon Barron.