I’m East Wu, currently focusing on Multimodal Large Language Models (LLMs) and Robotics.
Previously, I spent years working in Computer Vision and AIGC (Image/Video Generation).
My goal is to build intelligent agents that see, think, and act in the real world.
- 🤖 Robotics / Embodied AI
- 🧠 Multimodal LLMs (Vision–Language–Action models)
- 🔍 Computer Vision
- 🎨 AIGC (Images / 3D / Motion Generation)
- Building Multimodal LLM-driven embodied agents for real-world understanding
- Open-source tools and research related to multimodal and robotic intelligence