New

Landing Page

We added a landing page to all available datasets allowing you to explore annotations and datasets available from our team. Interact with any of the cards to navigate to the corresponding dataset!

Connected Datasets

Audio Annotations

Unmute Here

EPIC-Sounds

ICASSP 2023 TPAMI 2025

A large-scale audio dataset identifying temporal segments of "actions that sound". Distinguishes auditory events from visual events with material discrimination.

Audio-Based Recognition Audio-Based Detection

This dataset is collected from the audio stream of EPIC-KITCHENS-100 to provide annotations of audio events, complimenting the existing visual annotations.

Derived From

Core Data ECCV 2018 TPAMI 2021 IJCV 2022

EPIC-KITCHENS-100

100 hours of unscripted egocentric footage from 45 kitchens. The foundation for EPIC-Sounds (above) and the datasets below.

Action Recognition Detection Anticipation Domain Adaptation

RGB Videos

Also Providing

Segmentation Masks

VISOR

NeurIPS 2022

Pixel-wise segmentation of hands and active objects, featuring dense interpolations and contact relations.

EPIC-Fields

NeurIPS 2023

3D camera information (extrinsics & intrinsics) for dynamic egocentric videos.

18.7MRegistered Frames

671Videos

96%Success Rate

45Kitchens

Neural Rendering D-NVS UDOS

Highly Detailed Annotations

HD-EPIC

New

CVPR 2025

41 hours of unscripted multi-day recordings with highly detailed and interconnected ground-truth labels, grounded in 3D through digital twins of each scene.

"How" & "Why" Descriptions

Recipe Prep & Step Pairs

Gaze Priming

Digital Twins

VQA Benchmark

7 Categories: Recipes, Ingredients, Nutrition, Fine-grained Actions, 3D Perception, Object Motion, and Gaze.

26.6K

Questions