World Twin
Scene-level memory builds dynamic twins that preserve spatial structure and temporal continuity.
Human-Centric Scene-Level Controllable 5D World Model
Worlds are no longer only seen. They can be twinned, interacted with, controlled, and evolved.
EvoPhys-World moves beyond visual generation and camera navigation, enabling action-conditioned physical interaction, long-horizon prediction, and human-centered policy learning.
Scene-level memory builds dynamic twins that preserve spatial structure and temporal continuity.
Actions drive object responses, contact changes, and future state generation.
Human action space connects egocentric perception, hand motion, and embodied control.
The model does not only predict how the world looks. It imagines how different actions push the same world state into different futures.
Spatial scene structure, geometry, objects, and layout.
Space plus time, motion, continuity, and future state prediction.
Action-conditioned worldlines with memory, causality, feedback, and policy value.
A unified state-action world model integrates 4D spatiotemporal memory, next-state prediction, next-action prediction, and a self-evolving interaction loop.
World Engine builds dynamic scene-level twins that can be navigated, manipulated, and physically interacted with.
World Policy learns in a human-centered action space, then maps imagined interaction into embodied control.
Video slots are ready for project demos. Each block is designed for a short, looped clip with a direct task title.
World twin roaming
Physical response
Fine-grained control
Long-horizon rollout
Action chunk decoding
Human-to-robot transfer
Multiple futures
Embodied execution
Virtual interaction generates experience, policy learning turns experience into action, and real-world feedback closes the loop.
Human-centered egocentric observations and interaction traces.
Scene-level memory creates dynamic worlds for interaction.
Actions produce future states, contact changes, and outcomes.
Generated experience feeds embodied policy learning.
Given different actions, EvoPhys-World imagines different physically grounded futures and selects actionable paths.
Led by Prof. Shanghang Zhang.