observation.images.wrist

observation.images.context

Language Instruction:

put the cube in the box

0 / 13