Rerun Hub
The production backend for the Rerun data layer. Catalog, byte-range indexing, and retrieval that turns your object stores into a queryable, streamable foundation. Run transforms on the edge or close to the data.
Petabyte- scale
Across your object storage, any bucket or region
Direct streaming
Byte-range reads, straight from your storage to compute
Built for physical data
Multi-rate, multimodal data, kept in its native shape
Build your loop your way with the open-source SDK. Hub handles the hard parts: consistency and scale for physical data.
Teams win by iterating fast on data composition and modeling while scaling data and compute.
The same open-source SDK drives every stage, on your own compute and tools. It connects to Hub over an open protocol.
The catalog, schema management, byte-range indexing, and streaming that keep physical data consistent and queryable at scale.
Four capabilities on one catalog over your object storage, already handling petabytes of robot data today.
Query
Run any SQL or dataframe query across your catalog, down into the columns, time ranges, and values inside your recordings, not just their metadata.
Transform
Add derived columns and evolve schemas without breaking history. You run the transforms with the SDK; Hub keeps the derived data and your raw recordings organized together.
Train
Express a dataset mix as a query and stream it to your GPUs. The dataloader is column-aware and video-codec-aware, so you train directly on your recordings.
Share
One viewer, the same recordings, shared across the team. Explore, annotate, and trace a failure back to the data that caused it.
Your own isolated Hub deployment, run for you in the cloud region you choose.
Any S3-compatible bucket, across regions. You decide where it lives.
Enterprise-ready with broad SSO support, self-managed storage options, and all the controls your security team expects. Designed for low friction at any scale: from empowering small research teams to facilitating secure cross-org data sharing.
Book a meeting to see Hub against your stack: your data, your storage, your training cluster.