Ashmyr Data Pipeline Demo

Interactive walk-through

Four stages. One deployment session.

Use the controls to step through what happens after the robot starts operating. The right-hand panels stay anchored on the same example session and update as the pipeline advances.

Stage 01

Capture

Continuous synchronized session logging.

Stage 02

Failure slice

Pin the exact failure window and state.

Stage 03

Recovery cut

Isolate the corrective trajectory cleanly.

Stage 04

Export

Bundle it for retraining and eval.

Capture

Every rollout is logged as a continuous, synchronized session.

Before anything fails, the pipeline has already aligned the streams that matter: camera feeds, robot state, task ID, environment metadata, and operator notes. That makes later segmentation possible without manual reconstruction.

Wall-clock and robot-clock alignment on ingest
Per-site metadata attached at session start
Task-level markers inserted before rollout begins

Captured streams cam_front.mp4 · cam_wrist.mp4 · joints.parquet · session.json

The raw layer is still retained, but the pipeline immediately organizes it around the rollout rather than leaving it as an unlabeled dump.

Why it matters

If time sync is wrong, recovery data is noisy. The value of the final correction depends on capturing the pre-failure context, the actual error event, and the human intervention in one coherent timeline.

Failure slice

The pipeline narrows a full rollout into the exact failure interval.

Once the robot slips, stalls, collides, or diverges, the system clips around the event and records a failure code. This is where a long session stops being generic runtime footage and starts becoming usable supervision.

Failure type tagged against a stable taxonomy
Pre-failure buffer preserved for causal context
Operator handoff timestamp attached to the event

Window extracted -8.0s to +5.2s around GRASP_SLIP

The clip includes the lead-in, not only the bad frame, because labs need to see how the policy approached failure as much as the failure itself.

Eval benefit

The same failure slice becomes evaluation infrastructure if it is tagged consistently across sites. Ashmyr can compare how policies fail before and after retraining, not just whether they fail.

Recovery cut

The operator correction is isolated as its own trajectory.

This is the highest-signal unit in the demo. Instead of handing back “robot failed here,” the pipeline extracts the expert intervention that got the task back on track and pairs it with the failure state that triggered it.

Recovery start and stop markers inserted explicitly
Corrective actions preserved separately from background runtime
Session context retained so the cutout is not decontextualized

Recovery package recovery_clip.mp4 · recovery_joints.parquet · failure_state.json

Labs can train on the corrective snippet without losing the session context that explains why the intervention was needed.

Ashmyr role

The value is not only the intervention itself. It is the discipline of cutting it cleanly, tagging it consistently, and delivering it in a reusable format across many deployments and many sites.

Export

The final output is a training and evaluation bundle, not an ops artifact.

At export time, the session has already been structured enough to hand directly into a lab’s post-training or evaluation workflow. The point is to reduce the gap between “the robot failed” and “the training stack can use this.”

Manifest summarizes session, failure, and recovery metadata
Files are grouped by rollout and recovery unit
Output can map to RLDS-like or lab-specific schemas

Bundle status Ready for post-training and field eval replay

The same package can support retraining, failure analysis, and cross-site robustness tracking if the export contract stays stable.

What labs avoid

No manual log stitching, no ad hoc file naming, no guessing which frames align with the human correction. The packaging work is already done.

Current bundle

Deployment session `ASM-052`

Raw synchronized session with site metadata, task markers, and aligned streams ready for failure slicing.

Session file

session_manifest.json

Task, site, shift, operator, and synchronized timestamp map.

Video

cam_front.mp4 / cam_wrist.mp4

Continuous RGB capture of the live rollout.

Robot state

joints.parquet

Joint positions and task markers over the full session.

Live session log

Example deployment feed

DEPLOYMENT PIPELINE

Rollouts 0

Failures 0

Exports 0

Why this matters

Pipeline outputs for model teams

These metrics are illustrative, but they show what changes when the deployment loop is structured instead of bespoke.

Segmentation 13.2s

Failure window isolated from a 51-second rollout.

Recovery cut 8.2s

Corrective intervention preserved as its own unit.

Artifacts 7 files

Exported in one bundle instead of scattered raw logs.

Schema Stable

Consistent enough to compare runs across sites.

From live failure to training-ready bundle.

Four stages. One deployment session.

Capture

Failure slice

Recovery cut

Export

Every rollout is logged as a continuous, synchronized session.

The pipeline narrows a full rollout into the exact failure interval.

The operator correction is isolated as its own trajectory.

The final output is a training and evaluation bundle, not an ops artifact.

Deployment session `ASM-052`

session_manifest.json

cam_front.mp4 / cam_wrist.mp4

joints.parquet

Example deployment feed

Pipeline outputs for model teams

This is what your deployment data looks like.
Ready to get started?

Four stages. One deployment session.

Capture

Failure slice

Recovery cut

Export

Every rollout is logged as a continuous, synchronized session.

The pipeline narrows a full rollout into the exact failure interval.

The operator correction is isolated as its own trajectory.

The final output is a training and evaluation bundle, not an ops artifact.

Deployment session `ASM-052`

session_manifest.json

cam_front.mp4 / cam_wrist.mp4

joints.parquet

Example deployment feed

Pipeline outputs for model teams

This is what your deployment data looks like.Ready to get started?

This is what your deployment data looks like.
Ready to get started?