Dongyu supplement. Added as a candidate missing/adjacent dataset paper after searching primary arXiv sources. This is a draft summary for triage, not a full paper read.
One-liner. An open robot manipulation dataset with 60,096 trajectories across 24 environments, compatible with natural-language and goal-image conditioning.
Dataset plus experiments across imitation learning and offline RL methods; supports open-vocabulary multi-task learning with goal images or natural language instructions.
Good anchor for open language-conditioned robot learning data, even though it is not centered on non-visual sensing.
Vision/language/action focused; little direct tactile/audio/force/thermal coverage.
arXiv lines 38-40 report scale, language compatibility, and public sharing.