BAAI Embodied AI

BAAI Embodied AI Embodied Intelligence One-Stop Platform: an end-to-end, closed-loop system for data, models, and training.

BAAI RoboCOIN, with 3M+ downloads in the open-source community! đŸ”„đŸ”„đŸ”„ Project: http://flagopen.github.io/RoboCOIN/ & https...
06/03/2026

BAAI RoboCOIN, with 3M+ downloads in the open-source community! đŸ”„đŸ”„đŸ”„
Project: http://flagopen.github.io/RoboCOIN/ & https://huggingface.co/RoboCOIN
RoboCOIN delivers an unprecedented collection for dexterous bimanual manipulation: · Unmatched Scale: 180,000+ trajectories, 421 tasks, across 16 real-world scenarios (home, lab, warehouse). · Broad Hardware Coverage: Collected from 15 different robot platforms, including dual-arm, semi-humanoid, and humanoid robots. · Rich & Structured: Features a unique 3-tier annotation system (trajectory, segment, frame-level) so robots learn both the "what" and the "how." · Proven Performance: Integrated annotations boosted success rates in complex tasks by up to 50% for models like π₀. 🔗 Learn More: http://arxiv.org/abs/2511.17441

BAAI WBC Framework Thor: Helping Humanoids “Stand Firm” Under Intense InteractionsRemember the Unitree G1 that effortles...
05/03/2026

BAAI WBC Framework Thor: Helping Humanoids “Stand Firm” Under Intense Interactions

Remember the Unitree G1 that effortlessly pulled a 1400kg car?
The core technology behind it is Thor, the embodied intelligence framework developed by BAAI.
Thor combines biomechanics principles with reinforcement learning, enabling robots to coordinate their whole body and adapt their posture like humans in high-load, high-contact real-world environments—achieving balanced stability and efficient output.

Key Technical Highlights
FAT2 (Force-adaptive Torso-tilt Reward): Guides robots to autonomously adjust posture and achieve human-like full-body reactions
Decoupled Network Architecture: Divides high-dimensional action space into modules for stable and coordinated whole-body control

🚀 Real-World Validation
✅ Pulling a 1400kg car — Stable walking under extreme load
✅ Opening a fire door (≈60N) — Single-arm unbalanced force still maintains self-stability
✅ Dragging an 85kg trolley — Coordinated upper and lower body output continuous force
✅ Wiping a whiteboard — Maintains balance under contact even without force sensors

Significance
Thor marks a critical step for humanoid robots to move from “performers” to “workers”, opening a new path toward human-level whole-body control in industrial, service, and rescue scenarios.

04/03/2026

Many robot dance demonstrations rely on pre-programmed motion playback.

Robo Perform takes a different approach.

By extracting musical features from audio, the system dynamically generates movements.

From scripted to adaptive, music-driven motion.

03/03/2026

Embodied AI Robot Dancing to Live Music

03/03/2026

Introducing BAAI RoboBrain 2.5closing the reliability gap in embodied AI with:

✅ Depth in Sight (Precise 3D Spatial Reasoning) — from 2D understanding to actionable 3D trajectories

✅ Time in Mind (Dense Temporal Value Estimation) — real-time progress feedback for robust long-horizon tasks

Achieving SOTA across multiple spatial and temporal reasoning benchmarks, RoboBrain 2.5 brings us closer to robots that work reliably in the real world.

Github: https://github.com/FlagOpen/RoboBrain2.5

RoboBrain-Audio is a native full-duplex omnimodal interaction system with lifelong memory. It supports simultaneous list...
02/03/2026

RoboBrain-Audio is a native full-duplex omnimodal interaction system with lifelong memory. It supports simultaneous listening and speaking, question-and-answer interruption, and personalized interaction based on users’ information and social relationships. It is particularly suitable for scenarios required by embodied agents, such as identity recognition, continuous interaction, being interrupted, and rapid response.
RoboBrain-Audio achieves full-duplex spoken dialog capabilities at the 7B model scale. Trained on approximately 1 million hours of audio-text paired data—only about 1% of the data volume of existing large-scale audio foundation models—it can still match or even outperform other models of the same type. In contrast to the time-division multiplexing (TDM) architecture commonly adopted by traditional spoken dialog models, its native full-duplex model architecture can reduce the response latency to around the 80-millisecond level.

The Impact of TDM and Native Full-Duplex Technology on the Responsiveness of Embodied Interaction Systems
RoboBrain-Audio supports multi-user identity recognition via facial recognition, voiceprint recognition and other means. Meanwhile, it can memorize users’ basic information, preferences and interpersonal relationships to construct a long-term memory and social relationship graph. Adopting an asynchronous process featuring parallel storage, retrieval and response, the model is designed with a human-like two-level memory system consisting of short-term and long-term memory. Such design enables long-term planning and cumulative learning in an embodied (robotic/physical) environment. The model achieves a facial recognition accuracy rate of 98.4% and a voiceprint recognition error rate of less than 1%. In noisy environments, its personalized conversation capability attains a factual correctness rate of 87.6% and a response quality score of 8.82 out of 10. Additionally, the system’s throughput rate exceeds 20 fps, which far surpasses the requirements for real-time voice conversations.

✹ Founded in 2018, BAAI is a leading non-profit AI research institute, building next-generation intelligent infrastructu...
02/03/2026

✹ Founded in 2018, BAAI is a leading non-profit AI research institute, building next-generation intelligent infrastructure for embodied AI.

🌍Our Embodied Intelligence One-Stop Platform provides an end-to-end, closed-loop system covering data collection, annotation, management, model training, evaluation, and deployment—delivering high-quality embodied data to support diverse robot embodiments and real-world application scenarios. 🌐

Find us more: https://ei2.baai.ac.cn/home

Address

150 Chengfu Rd, Haidian District
Beijing

Alerts

Be the first to know and let us send you an email when BAAI Embodied AI posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Share