Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot)
AI & ML interests
Embodied AI
Recent Activity
View all activity
EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining.
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
IPEC-COMMUNITY/EO-1-3B
Robotics • Updated • 12 -
IPEC-COMMUNITY/EO-Data1.5M
Viewer • Updated • 739k • 3.6k • 11 -
IPEC-COMMUNITY/demos25
Viewer • Updated • 75 • 242
-
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation
Paper • 2502.18041 • Published • 1 -
IPEC-COMMUNITY/openfly-agent-7b
Image-Text-to-Text • 8B • Updated • 137 -
IPEC-COMMUNITY/OpenFly_DataGen
Updated • 399 • 1 -
IPEC-COMMUNITY/OpenFly-rlds
Updated • 3.02k
-
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Paper • 2501.15830 • Published • 13 -
IPEC-COMMUNITY/spatialvla-4b-224-pt
Image-Text-to-Text • 4B • Updated • 12.7k • 11 -
IPEC-COMMUNITY/spatialvla-4b-mix-224-pt
Image-Text-to-Text • 4B • Updated • 425 • 4 -
IPEC-COMMUNITY/spatialvla-4b-224-sft-bridge
Robotics • 4B • Updated • 137 • 1
-
IPEC-COMMUNITY/libero_spatial_no_noops_1.0.0_lerobot
Viewer • Updated • 53k • 3.25k • 1 -
IPEC-COMMUNITY/libero_goal_no_noops_1.0.0_lerobot
Viewer • Updated • 52k • 2.84k -
IPEC-COMMUNITY/libero_object_no_noops_1.0.0_lerobot
Viewer • Updated • 67k • 2.84k -
IPEC-COMMUNITY/libero_10_no_noops_1.0.0_lerobot
Viewer • Updated • 101k • 3.29k • 1
Open X-Embodiment datasets in LeRobot format with standard transfomation (https://github.com/Tavish9/any4lerobot)
-
SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Model
Paper • 2501.15830 • Published • 13 -
IPEC-COMMUNITY/spatialvla-4b-224-pt
Image-Text-to-Text • 4B • Updated • 12.7k • 11 -
IPEC-COMMUNITY/spatialvla-4b-mix-224-pt
Image-Text-to-Text • 4B • Updated • 425 • 4 -
IPEC-COMMUNITY/spatialvla-4b-224-sft-bridge
Robotics • 4B • Updated • 137 • 1
EmbodiedOneVision is a unified framework for multimodal embodied reasoning and robot control, featuring interleaved vision-text-action pretraining.
-
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control
Paper • 2508.21112 • Published • 77 -
IPEC-COMMUNITY/EO-1-3B
Robotics • Updated • 12 -
IPEC-COMMUNITY/EO-Data1.5M
Viewer • Updated • 739k • 3.6k • 11 -
IPEC-COMMUNITY/demos25
Viewer • Updated • 75 • 242
-
IPEC-COMMUNITY/libero_spatial_no_noops_1.0.0_lerobot
Viewer • Updated • 53k • 3.25k • 1 -
IPEC-COMMUNITY/libero_goal_no_noops_1.0.0_lerobot
Viewer • Updated • 52k • 2.84k -
IPEC-COMMUNITY/libero_object_no_noops_1.0.0_lerobot
Viewer • Updated • 67k • 2.84k -
IPEC-COMMUNITY/libero_10_no_noops_1.0.0_lerobot
Viewer • Updated • 101k • 3.29k • 1
-
OpenFly: A Versatile Toolchain and Large-scale Benchmark for Aerial Vision-Language Navigation
Paper • 2502.18041 • Published • 1 -
IPEC-COMMUNITY/openfly-agent-7b
Image-Text-to-Text • 8B • Updated • 137 -
IPEC-COMMUNITY/OpenFly_DataGen
Updated • 399 • 1 -
IPEC-COMMUNITY/OpenFly-rlds
Updated • 3.02k