Ego4D
Meta AI (Facebook AI Research) and academic consortium · 2022 · datasets.bot · datasets.bot page
One-liner. Meta-led 3,670-hour massive-scale egocentric human video dataset and benchmark suite from 923 wearers in 9 countries.
Setup
- Datasets / benchmarks: Ego4D is a massive-scale dataset of 3,670 hours of daily-life egocentric (first-person) video captured by 923 unique camera wearers across 74 locations in 9 countries, built by a consortium of Facebook AI (Meta) and 13 universities. Portions include audio, 3D environment meshes, eye gaze, stereo, multi-camera footage, IMU, and dense textual narrations, supporting five benchmark suites (episodic memory, hands-and-objects, audio-visual diarization, social interaction, forecasting). It captures human activity only and contains no robot embodiment, but is widely used in embodied AI and egocentric-perception research as a robot-manipulation pretraining corpus. License: research-only. Download: https://ego4d-data.org/docs/start-here/.
- Hardware / simulator: Embodiment: human. Environment: home, industrial, office, outdoor, retail, kitchen. Realness: physical.
Schema
Per-video MP4 streams + JSON annotation/metadata files; benchmark clips, precomputed features, narrations.
Links