Overview
Implement the full training pipeline from ADR-129 to retrain RuvLTRA models with TurboQuant KV-cache profiling on Google Cloud.
Phases
Phase 1: imatrix Recalibration + TurboQuant KV Profiling (Week 1)
Phase 2: WET-Augmented LoRA Fine-Tuning (Week 2-3)
Phase 3: Benchmarking & Validation (Week 3-4)
Phase 4: Publishing (Week 4)
Release Gates (G1-G7)
| Gate |
Criterion |
| G1 |
HumanEval pass@1 ≥ 45% (0.5B) / ≥ 55% (3B) |
| G2 |
Routing accuracy ≥ 80% (no regression) |
| G3 |
Wikitext-2 PPL increase < 5% |
| G4 |
TurboQuant ≥ 8x compression, PPL delta < 1% |
| G5 |
Long context PPL < 20 at 16K tokens |
| G6 |
Zero eval contamination |
| G7 |
Inference ≥ 80 tok/s (0.5B) / ≥ 40 (3B) |
Infrastructure
- Compute: L4 GPU (Cloud Run Jobs) + A100-80GB (Vertex AI)
- Data: Brain memories (3,870+), WET corpus, Claude Flow routing (2,700+), ADR corpus (129 docs)
- Estimated cost: ~$70-210 (experimental compute)
Files Created
scripts/training/release_gate.py — Automated ship/no-ship checker
scripts/training/export_training_data.py — Dataset export with governance
scripts/training/contamination_check.py — Eval contamination detection
scripts/training/Dockerfile — Training image
scripts/training/deploy_training.sh — Cloud Run job creation
scripts/training/run_calibration.py — Phase 1 entry point
scripts/training/run_sft.py — Phase 2 entry point
crates/ruvllm/src/quantize/turboquant_profile.rs — Sidecar config loading
Related
🤖 Generated with claude-flow
Overview
Implement the full training pipeline from ADR-129 to retrain RuvLTRA models with TurboQuant KV-cache profiling on Google Cloud.
Phases
Phase 1: imatrix Recalibration + TurboQuant KV Profiling (Week 1)
gcr.io/ruv-dev/ruvltra-training:latestDocker image.turboquant.jsonsidecar profiles per modelPhase 2: WET-Augmented LoRA Fine-Tuning (Week 2-3)
Phase 3: Benchmarking & Validation (Week 3-4)
scripts/training/release_gate.pyPhase 4: Publishing (Week 4)
.turboquant.jsonsidecarsruvllmregistry with checksumsruvllmand@ruvector/ruvllmwith sidecar loadingRelease Gates (G1-G7)
Infrastructure
Files Created
scripts/training/release_gate.py— Automated ship/no-ship checkerscripts/training/export_training_data.py— Dataset export with governancescripts/training/contamination_check.py— Eval contamination detectionscripts/training/Dockerfile— Training imagescripts/training/deploy_training.sh— Cloud Run job creationscripts/training/run_calibration.py— Phase 1 entry pointscripts/training/run_sft.py— Phase 2 entry pointcrates/ruvllm/src/quantize/turboquant_profile.rs— Sidecar config loadingRelated
crates/ruvllm/src/quantize/turbo_quant.rs🤖 Generated with claude-flow