Skip to main content

Egocentric Data

Hand pose overlay on egocentric camera images

Examples of hand pose overlays on egocentric camera images.

EgoSuite egocentric data is available in two complementary formats: MCAP Data and LeRobot Data. Both represent the same kind of egocentric human activity episodes, but they are optimized for different workflows.

FormatBest forMain contents
MCAP DataRaw multimodal episode playback, inspection, conversion, and visualizationProtobuf topics for metadata, pose, RGB camera, depth camera, point cloud, audio, and semantic annotation
LeRobot DataRobot learning datasets, model training, and data loading pipelinesParquet state data, MP4 videos, task/subtask metadata, episode stats, optional point clouds, and the original annotation file

MCAP Data

MCAP is the native EgoSuite recording format. A single .mcap file stores synchronized topics for an episode. It is useful when you need the original multimodal streams, timestamps, calibration topics, or Foxglove visualization.

The MCAP documentation is organized by topic group:

SectionContent
MetadataSession-level information, capture devices, task info, operator, and global timing
PoseHead, Hand, and Body pose in world coordinates
SensorRGB Camera, Depth Camera, PointCloud, and Audio
Semantic AnnotationTask, subtask/action segments, skill names, and segment time bounds

RGB camera streams are documented separately for head-mounted and wrist-mounted cameras. Depth camera data is documented as a sibling sensor page because it uses depth images, intrinsics, and extrinsics rather than RGB video semantics.

LeRobot Data

LeRobot Data is the training-oriented export format. It converts an EgoSuite episode into a LeRobot v3.0-style dataset folder with:

  • data/chunk-*/file-*.parquet for per-frame state data.
  • videos/observation.images.{cam}/chunk-*/file-*.mp4 for camera streams.
  • meta/tasks.parquet and meta/subtasks.parquet for task and subtask index mappings.
  • meta/episodes/chunk-*/file-*.parquet for episode-level metadata and statistics.
  • Optional pointcloud/ files and the original annotation.json.

See Format Specification for the folder structure, parquet schemas, pose joint conventions, and task/subtask metadata layout.

Choosing a Format

Use MCAP Data when you want topic-level access, calibration data, visualization, or conversion from the original recording. Use LeRobot Data when you want a dataset layout that is ready for robot learning workflows, including frame-aligned state columns and video files.