Dongyu supplement. Added as a candidate missing/adjacent dataset paper after searching primary arXiv sources. This is a draft summary for triage, not a full paper read.
One-liner. A tactile-language-action model for contact-rich peg-in-hole manipulation, paired with a 24k tactile action instruction dataset that is explicitly released with data and code.
Cross-modal language grounding over tactile sequences to generate robust contact-rich actions.
This is one of the cleanest missing items for the current map: it combines tactile sensing, language, action generation, contact-rich assembly, and open data.
Narrow task family: peg-in-hole assembly. It is excellent for contact-rich language grounding, but not a broad dynamic state-change benchmark.
arXiv lines 31-41 report title, dataset size, release claim, and project website.