Multimodal Dataset Map

This is a static visualization tool for surveying datasets related to language grounding, multimodal perception, and robotic manipulation.

149 catalog entries
91 entries from the Professor Survey
58 supplemental entries
140 dataset-relevant rows
92 rows that introduce a dataset, benchmark, or simulator
87 rows with open or partial data availability
Evidence-backed filters for modalities, language annotations, data supervision, task families, concept types, environments, and data availability

How to View

Run a small local static server from this folder:

python -m http.server 8000

Then open:

http://localhost:8000/app/

If port 8000 is already in use:

python -m http.server 8010

Then open:

http://localhost:8010/app/

Included Files

app/: static frontend
data/: generated catalog JSON files
source_materials/*/knowledge/: local HTML summaries used by the Summary links
source_materials/professor_survey_raw/index.html: original professor survey navigation page
reports/filter_audit_2026-06-28.md: audit summary and per-row evidence
reports/filter_audit_2026-06-28.csv: full audit table

The share package intentionally excludes raw PDFs, development scripts, git metadata, cache folders, and temporary files. Paper links in the app point to online pages such as arXiv or project pages when available.

Dongyu Luo

Multimodal Dataset Map

Contents

How to View

Included Files