Ego4D: Around the World in 3,000 Hours of Egocentric Video

Kristen Grauman; Andrew Westbury; Eugene H. Byrne; Zachary Chavis; Antonino Furnari; Rohit Girdhar; Jackson Hamburger; Hao Jiang; Miao Liu; Xingyu Liu

doi:10.1109/cvpr52688.2022.01842

Ego4D: Around the World in 3,000 Hours of Egocentric Video

dc.contributor.author	Kristen Grauman
dc.contributor.author	Andrew Westbury
dc.contributor.author	Eugene H. Byrne
dc.contributor.author	Zachary Chavis
dc.contributor.author	Antonino Furnari
dc.contributor.author	Rohit Girdhar
dc.contributor.author	Jackson Hamburger
dc.contributor.author	Hao Jiang
dc.contributor.author	Miao Liu
dc.contributor.author	Xingyu Liu
dc.coverage.spatial	Bolivia
dc.date.accessioned	2026-03-22T13:50:20Z
dc.date.available	2026-03-22T13:50:20Z
dc.date.issued	2022
dc.description	Citaciones: 484
dc.description.abstract	We introduce Ego4D, a massive-scale egocentric video dataset and benchmark suite. It offers 3,670 hours of dailylife activity video spanning hundreds of scenarios (household, outdoor, workplace, leisure, etc.) captured by 931 unique camera wearers from 74 worldwide locations and 9 different countries. The approach to collection is designed to uphold rigorous privacy and ethics standards, with consenting participants and robust de-identification procedures where relevant. Ego4D dramatically expands the volume of diverse egocentric video footage publicly available to the research community. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. Furthermore, we present a host of new benchmark challenges centered around understanding the first-person visual experience in the past (querying an episodic memory), present (analyzing hand-object manipulation, audio-visual conversation, and social interactions), and future (forecasting activities). By publicly sharing this massive annotated dataset and benchmark suite, we aim to push the frontier of first-person perception. Project page: https://ego4d-data.org/
dc.identifier.doi	10.1109/cvpr52688.2022.01842
dc.identifier.uri	https://doi.org/10.1109/cvpr52688.2022.01842
dc.identifier.uri	https://andeanlibrary.org/handle/123456789/43015
dc.language.iso	en
dc.relation.ispartof	2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
dc.source	The University of Texas at Austin
dc.subject	Suite
dc.subject	Computer science
dc.subject	Benchmark (surveying)
dc.subject	Gaze
dc.subject	Event (particle physics)
dc.subject	Perception
dc.subject	Identification (biology)
dc.subject	Multimedia
dc.subject	Artificial intelligence
dc.subject	Human–computer interaction
dc.title	Ego4D: Around the World in 3,000 Hours of Egocentric Video
dc.type	article

Collections

Artículo Científico Publicado

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Files

Collections