ProphetDWM is an autonomous driving world model that jointly predicts future video frames and driving actions. It features a diffusion-based transition module and an action learning module, trained jointly for alignment.
Brings world models closer to real-world use cases by combining video imagination with action prediction, useful for self-driving and planning systems.