Multimodal Dataset Map
This is a static visualization tool for surveying datasets related to language grounding, multimodal perception, and robotic manipulation.
Contents
- 149 catalog entries
- 91 entries from the Professor Survey
- 58 supplemental entries
- 140 dataset-relevant rows
- 92 rows that introduce a dataset, benchmark, or simulator
- 87 rows with open or partial data availability
- Evidence-backed filters for modalities, language annotations, data supervision, task families, concept types, environments, and data availability
How to View
Run a small local static server from this folder:
python -m http.server 8000
Then open:
http://localhost:8000/app/
If port 8000 is already in use:
python -m http.server 8010
Then open:
http://localhost:8010/app/
Included Files
app/: static frontenddata/: generated catalog JSON filessource_materials/*/knowledge/: local HTML summaries used by the Summary linkssource_materials/professor_survey_raw/index.html: original professor survey navigation pagereports/filter_audit_2026-06-28.md: audit summary and per-row evidencereports/filter_audit_2026-06-28.csv: full audit table
The share package intentionally excludes raw PDFs, development scripts, git metadata, cache folders, and temporary files. Paper links in the app point to online pages such as arXiv or project pages when available.