Experiment Runner
The Experiment Runner is the execution engine of Hubify Labs. It takes experiment definitions, provisions compute, executes code, and captures every detail for reproducibility.Running an Experiment
- Web UI
- CLI
- API
- Open the Captain View
- Click New Experiment (or press
Cmd+E) - Describe the experiment in natural language or fill in the structured form
- Select compute requirements (GPU type, estimated duration)
- Click Run
Experiment Dashboard
Each running experiment has a detail view showing:- Live Logs, Streaming stdout/stderr from the pod
- Metrics, Custom metrics emitted by your script (loss, convergence, sample count)
- Figures, Plots generated during execution, updated in real time
- Resource Usage, GPU utilization, memory, disk I/O
- Checkpoints, Saved intermediate states you can resume from
- Cost, Running cost in USD
Checkpointing
Experiments automatically checkpoint at configurable intervals:QC Gates
Every experiment passes through a quality control gate before results are accepted:| Check | Description | Threshold |
|---|---|---|
| Completeness | All expected output files exist | 100% |
| Convergence | R-hat statistic for MCMC chains | < 1.05 |
| Error Bounds | Statistical uncertainties are reasonable | Domain-specific |
| Reproducibility | Config + data + code are frozen | All locked |
| Review | Cross-model verification of results | Pass |
- Rerun with more samples
- Adjust parameters and retry
- Escalate to you for a decision
Chaining
Experiments can be chained so outputs flow into inputs:Batch Experiments
Run parameter sweeps or multi-configuration experiments:Reproducibility Record
Every experiment captures:- Git SHA of the codebase
- Full dependency list (
pip freeze) - Config files (YAML/JSON, checksummed)
- Input data SHA-256 hashes
- Random seeds
- Pod hardware specs
- Start/end timestamps