- The Promise of Generalist Robotic Policies
- GitHub - huggingface/lerobot: 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Vision Language Action Models
- [2406.09246] OpenVLA: An Open-Source Vision-Language-Action Model
- RT-2: Vision-Language-Action Models
- [2405.14093] A Survey on Vision-Language-Action Models for Embodied AI
- OpenVLA: LeRobot Research Presentation #5 by Moo Jin Kim - YouTube
Ideas
- o1 style CoT planning for action planning