All 50 gpu-infra-os feature components. Each submits typed JSON on action.
GPU Cluster Provisioner
Persist to MongoDB (outcome JSON)
Slug cluster-provisioner → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Auto-Scaling Policy Manager
Persist to MongoDB (outcome JSON)
Slug autoscaling-policy → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Node Health Dashboard
Persist to MongoDB (outcome JSON)
Slug node-health → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Multi-Region Cluster View
Persist to MongoDB (outcome JSON)
Slug multi-region → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Spot Instance Optimizer
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug spot-optimizer → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Reserved Capacity Planner
Persist to MongoDB (outcome JSON)
Slug reserved-capacity → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Cluster Topology Editor
Persist to MongoDB (outcome JSON)
Slug topology → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Resource Quota Manager
Persist to MongoDB (outcome JSON)
Slug resource-quota → /api/gpu-infra/clusters. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Distributed Training Launcher
Persist to MongoDB (outcome JSON)
Slug dist-training → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Training Job Monitor
Persist to MongoDB (outcome JSON)
Slug training-monitor → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Experiment Tracker
Persist to MongoDB (outcome JSON)
Slug experiment-tracker → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Checkpoint Manager
Persist to MongoDB (outcome JSON)
Slug checkpoint → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Hyperparameter Sweep
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug hyperparam-sweep → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Job Queue Manager
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug job-queue → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Preemption Recovery Console
Persist to MongoDB (outcome JSON)
Slug preemption-recovery → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Job Cost Estimator
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug job-cost-estimate → /api/gpu-infra/jobs. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Model Registry
Persist to MongoDB (outcome JSON)
Slug model-registry → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Model Deployment Wizard
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug model-deploy → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
A/B Traffic Router
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug ab-traffic → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Model Performance Monitor
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug model-perf → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Model Rollback Console
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug model-rollback → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Feature Store Browser
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug feature-store → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Pipeline Orchestrator
Persist to MongoDB (outcome JSON)
Slug pipeline → /api/gpu-infra/models. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Inference Endpoint Manager
Manage serving endpoints: replicas, health status, latency/RPS, and safe actions (scale / restart / deactivate). Persists to MongoDB via /api/gpu-infra/inference.
Inference Autoscaler Config
Tune horizontal autoscaling for inference: replica bounds, RPS and GPU targets, cooldowns, optional pre-warm, and cron-based replica overrides.
Inference Cost Optimizer
Compare cost optimizations (quantize, batch, downsize, spot), record tradeoffs, and persist selected strategies with projected monthly savings.
Cold Start Analyzer
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug cold-start → /api/gpu-infra/inference. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Batch Inference Scheduler
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug batch-inference → /api/gpu-infra/inference. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Serving SLA Monitor
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug serving-sla → /api/gpu-infra/inference. Optional upstream: GPU_INFRA_UPSTREAM_URL.
GPU Cost Dashboard
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug gpu-cost-dash → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Budget Alert Manager
Persist to MongoDB (outcome JSON)
Slug budget-alert → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Cost Forecast Engine
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug cost-forecast → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Team Cost Allocation
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug team-allocation → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Savings Recommender
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug savings → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Invoice & Credit Manager
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug invoice-credit → /api/gpu-infra/billing. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Dataset Registry
Persist to MongoDB (outcome JSON)
Slug dataset-registry → /api/gpu-infra/storage. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Storage Cost Analyzer
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug storage-cost → /api/gpu-infra/storage. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Data Pipeline Monitor
Persist to MongoDB (outcome JSON)
Slug data-pipeline → /api/gpu-infra/storage. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Artifact Lifecycle Manager
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug artifact-lifecycle → /api/gpu-infra/storage. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Model Artifact Diff
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug artifact-diff → /api/gpu-infra/storage. Optional upstream: GPU_INFRA_UPSTREAM_URL.
API Key Manager
Persist to MongoDB (outcome JSON)
Slug api-key → /api/gpu-infra/developer. Optional upstream: GPU_INFRA_UPSTREAM_URL.
SDK Quickstart Generator
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug sdk-quickstart → /api/gpu-infra/developer. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Webhook Event Manager
Persist to MongoDB (outcome JSON)
Slug webhook → /api/gpu-infra/developer. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Environment & Secret Manager
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug env-secret → /api/gpu-infra/developer. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Terraform Module Generator
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug terraform → /api/gpu-infra/developer. Optional upstream: GPU_INFRA_UPSTREAM_URL.
RBAC Permission Manager
Persist to MongoDB (outcome JSON)
Slug rbac → /api/gpu-infra/compliance. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Compliance Audit Dashboard
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug compliance-audit → /api/gpu-infra/compliance. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Data Residency Controller
Configure and submit to emit typed outcome JSON.
Persist to MongoDB (outcome JSON)
Slug data-residency → /api/gpu-infra/compliance. Optional upstream: GPU_INFRA_UPSTREAM_URL.
Observability Hub
Persist to MongoDB (outcome JSON)
Slug observability → /api/gpu-infra/observability. Optional upstream: GPU_INFRA_UPSTREAM_URL.
GPU Kernel Profiler
Persist to MongoDB (outcome JSON)
Slug gpu-profiler → /api/gpu-infra/observability. Optional upstream: GPU_INFRA_UPSTREAM_URL.