Robots:
Trossen AI Stationary Robot
pi0.5 learns a multi-prompt sub-task policy, e.g. 'pick up pink cube ...'
Gemini Robotics ER-1.5 HL controller chooses subtasks to 'put all cubes in bucket'
pi0.5 policy for puttting a bead on the string. But does it 'see' the string?
ACT policy rollout for popping lid.
ACT policy rollout for pouring little red box.
pi0.5 policy improved by augmenting datasets with human interventions.
Not all results are this good, but still much better than pre-DAgger!
pi0 LoRA finetuned policy from SIMULATED dataset with a red cube.
Hierarchical Planning
Recursive Tree Planner
Builds a stack of 5 blocks in Box2d
Paper: Pure Planning to Pure Policies and In Between with a Recursive Tree Planner