Embodied Intelligence
Train AI agents safely in high-fidelity digital environments. Enable trial-and-error learning with unlimited scenario variations for robust real-world deployment.
High-fidelity, real-time interactive world simulator by Robbyant featuring 10-minute video generation, 16 FPS interactivity, and zero-shot generalization.
LingBot-World is an open-source world simulator developed by Robbyant, the embodied AI subsidiary of Ant Group. Built on a sophisticated Diffusion Transformer architecture, lingbot-world provides researchers and developers with a powerful digital sandbox for training AI agents, testing autonomous driving systems, and creating dynamic game environments.
Generate up to 10 minutes of continuous, stable video with zero drift, maintaining object integrity and scene consistency throughout extended sequences.
Experience responsive control at 16 frames per second with sub-second latency. Control characters, camera perspectives, and environment parameters in real-time.
Transform any single image or game screenshot into a fully interactive video stream without additional training, drastically reducing deployment costs.
Advanced reasoning capabilities that track off-screen objects and maintain spatial awareness, enabling sophisticated multi-step task planning.
LingBot-World employs a multi-stage training pipeline that transforms a standard video generation model into a real-time, interactive simulator:
The hybrid data engine combines real-world footage, gameplay recordings, and Unreal Engine synthetic data to learn complex world dynamics and causal relationships.
Read the full technical paper on arXiv →Train AI agents safely in high-fidelity digital environments. Enable trial-and-error learning with unlimited scenario variations for robust real-world deployment.
Test self-driving algorithms across diverse scenarios, weather conditions, and edge cases without real-world risks or costs.
Create dynamic, interactive game worlds with unprecedented efficiency. Generate responsive environments from concept art or reference images.
We believe in collaborative innovation. LingBot-World is completely open-source, with code, models, and documentation freely available to the global research community.
LingBot-World matches industry leaders like Google Genie 3 in video quality, dynamism, long-term consistency, and interactive responsiveness.
Robbyant (Ant Lingbo Technology) is the embodied intelligence division of Ant Group, focused on creating sophisticated AI systems that perceive, understand, and interact with the physical world.
Part of the LingBot Series:
Our mission: Advance AGI innovation through open collaboration and real-world applications.