Skip to Content
CSE5519CSE5519 Advances in Computer Vision (Topic I: 2025: Embodied Computer Vision and Robotics)

CSE5519 Advances in Computer Vision (Topic I: 2025: Embodied Computer Vision and Robotics)

link to paper 

Novelty in NWM

  • Conditional Diffusion Transformer
  • Use time and action to conditioning the diffusion process
Tip

This paper provides a new way to train navigation world models. Via conditioned diffusion, the model can generate an imagined trajectory in an unknown environment and perform navigation tasks.

However, the model collapses frequently when using out-of-distribution data, resulting in poor navigation performance. I wonder how we can further condition on the novelty of the environment and integrate exploration strategies to train the model online to fix the collapse issue. What might be the challenges of doing so in the Conditioned Diffusion Transformer?

Last updated on