BAAI Embodied AI

BAAI Embodied AI Embodied Intelligence One-Stop Platform : An end-to-end, closed-loop system covering data collection, data annotation, data management, model training, etc.

15/05/2026

Introducing ExoActor: Exocentric Video Generation as Generalizable Interactive Humanoid Control

How can humanoid robots learn rich interactions with the world — without relying on massive task-specific robot datasets?

ExoActor explores a new direction:
using third-person video generation as a unified interface for humanoid control.

Given a task instruction and scene context, ExoActor generates plausible interaction videos that implicitly model:
→ robot behavior
→ object interaction
→ environmental dynamics
→ task intent

These generated videos are then transformed into executable humanoid motions through motion estimation and a general whole-body controller.

Instead of directly supervising robot actions, ExoActor leverages the generative prior of large-scale video models to model interaction-rich behaviors.

The result:
generalizable humanoid behaviors in unseen scenarios — without additional real-world data collection.

ExoActor explores a scalable path toward interaction-centric humanoid intelligence, where video generation becomes part of the control pipeline itself.

15/05/2026

BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

Introducing BifrostUMI: Bridging Robot-Free Demonstrations and Humanoid Whole-Body Manipulation

Scaling humanoid whole-body visuomotor learning requires massive amounts of high-quality interaction data. However, most current data collection pipelines still rely heavily on robot teleoperation — often limited by expensive hardware setups, low accessibility, and inefficient operation.

Inspired by UMI, we present BifrostUMI — a portable, efficient, and robot-free data collection framework designed for humanoid robots.

BifrostUMI uses lightweight VR devices to capture natural human demonstrations as sparse keypoint trajectories while simultaneously recording wrist-mounted visual observations.

These multimodal signals are used to train a high-level policy that predicts future keypoint trajectories conditioned on visual inputs. Through a robust retargeting pipeline, the predicted trajectories are mapped onto humanoid morphology and executed via a whole-body controller.

This enables agile and diverse human behaviors to transfer naturally from human demonstrations to humanoid embodiments — without relying on traditional teleoperation systems.

We validate the framework across multiple experimental scenarios, demonstrating the effectiveness and versatility of robot-free humanoid data collection for whole-body manipulation.

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal InteractionMost robot learning systems ...
12/05/2026

OmniUMI: Towards Physically Grounded Robot Learning via Human-Aligned Multimodal Interaction

Most robot learning systems still rely mainly on vision.

But contact-rich manipulation depends on:
→ touch
→ force
→ interaction dynamics

Without grounded physical feedback, teleoperation users tend to overcompensate during contact, leading to unstable force patterns and inefficient demonstrations.

OmniUMI introduces:
• RGB + depth
• tactile sensing
• grasping force
• external interaction wrench
• bilateral force feedback

within a compact handheld interface designed for collection–deployment consistency.

A key idea:
reuse the same motorized gripper across both demonstration and deployment to preserve physically grounded multimodal consistency.

Toward scalable contact-rich robot learning.

30/04/2026

"Historical Review: Embodied Intelligence Platform Integrates Pika Multimodal Data Solution"

゚viralシ

27/04/2026

🚀 From papers to real robots — made simple.

SO-101, the open-source robot arm from Hugging Face LeRobot, is becoming a go-to choice for students and developers to get hands-on with embodied AI.

Now, it’s fully integrated into the BAAI Embodied AI Platform. 🎉

With seamless access to the official data collection pipeline, you can standardize the entire workflow:
—from task planning
—to ex*****on
—to annotation & data management

This makes SO-101 not just easy to use, but actually useful for:
• research experiments
• method validation
• teaching & education

Turn your robot data into structured, reusable assets — not just demos.

17/04/2026

Lingyu Teleop Robot Is Now Live on BAAI

16/04/2026

Historical Review

BAAI One-Stop Embodied Intelligence Platform

Embodiment Integration | Franka Robot

15/04/2026

BAAI One-Stop Embodied Intelligence Platform · Embodiment Integration

14/04/2026

“Robot: I’m Getting Out of Here!”

13/04/2026

Watch This Robot Backflip

09/04/2026

How Strong Is BAAI Thor?

Address

150 Chengfu Road, Haidian District
Beijing
100083

Opening Hours

Monday 09:00 - 19:00
Tuesday 09:00 - 19:00
Wednesday 09:00 - 19:00
Thursday 09:00 - 19:00
Friday 09:00 - 19:00

Alerts

Be the first to know and let us send you an email when BAAI Embodied AI posts news and promotions. Your email address will not be used for any other purpose, and you can unsubscribe at any time.

Share