OpenHumanoid

Open-source agentic AI framework for voice-controlled humanoid robots. Currently running voice-driven locomotion on the Unitree G1 via OpenClaw and GR00T Whole-Body Control.

Built during HACK2026 Hackathon, organized by the ETH Robotics Club and won the 2nd place.

Hackathon Team

How It Works

Two switchable voice-control modes, both sharing a single HTTP bridge to the robot:

Mode	Latency	Input	Capabilities
Fast (`VOICE_MODE=realtime`)	~500ms	Voice (Realtime API)	Locomotion: walk, turn, stop, distance/timed/sequential
Full (`VOICE_MODE=openclaw`)	~2-5s	Voice + Text + WhatsApp (OpenClaw)	Locomotion with personality (Theo), multi-channel access

See docs/architecture.md for the full architecture and data flow.

Prerequisites

Python 3.10+
uv (Python package manager)
Docker (for the WBC container)
A Unitree G1 robot connected via Ethernet (or use mock mode for dev)
An OpenAI API key with Realtime API access
A working microphone and speaker (for voice modes)

Quick Start

1. Clone and install

sudo apt-get install -y libportaudio2
git clone git@github.com:alexzh3/OpenHumanoid.git
cd OpenHumanoid
uv sync

2. Configure

cp .env.example .env
# Edit .env and set OPENAI_API_KEY

3. Set up the WBC (one-time)

git lfs install
git clone https://github.com/NVlabs/GR00T-WholeBodyControl.git
cd GR00T-WholeBodyControl/decoupled_wbc
./docker/run_docker.sh --install --root    # first time: pulls Docker image
./docker/run_docker.sh --root              # subsequent runs: enters container

Container uses --network host so the bridge port (8765) is accessible from the host. Container name: decoupled_wbc-bash-root.

4. Launch bridge + control loop

# Simulation (MuJoCo)
./scripts/start_bridge.sh

# Real robot
./scripts/start_bridge.sh real

Verify: curl http://localhost:8765/status

Kill bridge: docker exec decoupled_wbc-bash-root pkill -9 -f run_with_bridge.py

Without Docker/robot: Run uv run python bridge/mock_bridge.py instead. Same API, prints to console.

Real robot prerequisites

Before start_bridge.sh real will work, the host ethernet NIC must have an IPv4 address on the robot subnet. CycloneDDS (used by the Unitree SDK) ignores interfaces without an IP.

# 1. Assign IP to the robot NIC (one-time per boot)
sudo ip addr add 192.168.123.222/24 dev enp0s31f6

# 2. Allow DDS multicast traffic through the firewall
sudo ufw allow in on enp0s31f6

# 3. Put the robot in damping mode (L2+B on controller) before launching

Different laptop? You may need to change the NIC name. Find yours with:

ip link show          # look for the wired ethernet interface

Then either set it inline or export it:

ROBOT_NIC=eth0 ./scripts/start_bridge.sh real

5. Run a voice mode

Fast mode (OpenAI Realtime API):

uv run python -m realtime.main

Voice commands:

"get ready" / "stand up" — activate robot (required first)
"walk forward" — continuous until "stop"
"walk forward slowly" / "walk forward fast" — speed control
"walk forward for 3 seconds" — timed, auto-stops
"walk forward 2 meters" — distance-based
"walk forward 1 meter then turn right" — sequential
"release" / "relax" — toggle hold/limp
"stop" — immediate halt

Full mode (OpenClaw Gateway):

cd openclaw && bash setup.sh && cd ..
openclaw gateway start

Open http://127.0.0.1:18789 for WebChat, or use Talk Mode for voice. Supports text and voice via WhatsApp when configured.

Bridge HTTP API

Base URL: http://localhost:8765 (configurable via BRIDGE_PORT)

Velocities are written directly to the WBC neural network policy — any float value is accepted, no quantization.

Locomotion

Method	Endpoint	Example	Description
POST	`/move`	`curl -s -X POST http://localhost:8765/move -H 'Content-Type: application/json' -d '{"vx":0.4,"vy":0.0,"vyaw":0.0}'`	Set velocity `[vx, vy, vyaw]` directly on `policy.cmd`
POST	`/stop`	`curl -s -X POST http://localhost:8765/stop`	Zero all velocities
POST	`/activate`	`curl -s -X POST http://localhost:8765/activate`	Activate walking policy
POST	`/deactivate`	`curl -s -X POST http://localhost:8765/deactivate`	Deactivate policy
POST	`/key`	`curl -s -X POST http://localhost:8765/key -H 'Content-Type: application/json' -d '{"key":"9"}'`	Send a raw key event (`9`=release/hold, `1`/`2`=base height, etc.)

Speed reference: slow=0.2, medium=0.4, fast=0.6 m/s.

Status

Method	Endpoint	Description
GET	`/status`	Returns current velocity, actual `policy.cmd`, and `policy_connected` flag

curl -s http://localhost:8765/status | python3 -m json.tool

Testing

# Terminal 1: mock bridge
uv run python bridge/mock_bridge.py

# Terminal 2: test
curl -X POST http://localhost:8765/activate
curl -X POST http://localhost:8765/move -H 'Content-Type: application/json' -d '{"vx": 0.4}'
curl -X POST http://localhost:8765/stop

Roadmap

Task	Status	Description
Task 1 — OpenClaw + WBC	Done	Voice -> locomotion pipeline via shared bridge
Task 2 — SLAM/LiDAR Navigation	Scaffolded	3D localization built (FAST-LIO + Open3D), not yet connected
Task 3 — VLA + Navigation + WBC	Planned	Perception, manipulation, VLA integration

See docs/README_future.md for details on planned features.

Project Structure

OpenHumanoid/
├── bridge/              # Bridge server (run_with_bridge.py for Docker, mock for host)
├── realtime/            # Fast mode: OpenAI Realtime API voice client
├── openclaw/            # Full mode: OpenClaw Gateway config, skills, workspace
├── scripts/             # Launch and utility scripts
├── docs/                # Architecture docs, planning assets, roadmap
├── GR00T-WholeBodyControl/  # NVIDIA WBC repo (gitignored, clone separately)
├── CONTEXT.md           # AI-readable project context
├── .env.example         # Environment variable template
└── pyproject.toml       # Python dependencies (uv sync)

Documentation

License

MIT — see LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenHumanoid

Hackathon Team

How It Works

Prerequisites

Quick Start

1. Clone and install

2. Configure

3. Set up the WBC (one-time)

4. Launch bridge + control loop

Real robot prerequisites

5. Run a voice mode

Bridge HTTP API

Locomotion

Status

Testing

Roadmap

Project Structure

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
bridge		bridge
capabilities		capabilities
docs		docs
openclaw		openclaw
realtime		realtime
scripts		scripts
.env.example		.env.example
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

OpenHumanoid

Hackathon Team

How It Works

Prerequisites

Quick Start

1. Clone and install

2. Configure

3. Set up the WBC (one-time)

4. Launch bridge + control loop

Real robot prerequisites

5. Run a voice mode

Bridge HTTP API

Locomotion

Status

Testing

Roadmap

Project Structure

Documentation

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages