clglavan/magos-k8s-0.6b
clglavan/magos-k8s-0.6b is a 0.6 billion parameter reasoning model derived from Qwen3-0.6B, specifically fine-tuned for Kubernetes diagnostics. It excels at mapping Kubernetes symptoms to corrective actions, generating structured reasoning traces, and providing precise kubectl/promtool commands or YAML patches. This model is optimized for use as an inner-loop reasoner in devops agents, focusing on event-grounded diagnostic reasoning and YAML manifest generation.
Loading preview...
Model Overview
magos-k8s-0.6b is a 0.6 billion parameter reasoning model, based on Qwen3-0.6B, specialized in Kubernetes diagnostics. It undergoes a two-stage full-weight training process: continued pre-training on Kubernetes documentation and API references, followed by supervised fine-tuning on event-to-YAML diagnostic pairs. The model generates structured <think> reasoning traces culminating in concise actions like kubectl commands, YAML patches, or root cause analyses.
Key Capabilities
- Kubernetes Diagnostics: Correlates
kubectl get eventsorkubectl describeoutput to responsible YAML fields and suggests fixes. - YAML Manifest Handling: Strong performance in generating and reviewing YAML manifests across various Kubernetes Kinds (Pod, Deployment, Service, etc.).
- Prometheus Alert Processing: Provides meaning and diagnostic steps for prometheus-operator runbook alerts.
- Structured Reasoning: Delivers short, templated reasoning traces that lead to concrete next actions, ideal for integration into devops agents.
What's New in v15
v15 represents a significant redesign, focusing on event-grounded diagnostic reasoning. It was trained on ~16.6k matched BROKEN/HEALTHY event→YAML diagnostic pairs, leading to improved performance in YAML handling and diagnosis. Benchmarks show v15 outperforming v8 overall, particularly in YAML (+9) and diagnosis (+3) buckets, while reducing hallucinations by 14% and increasing the agent-usable rate to 31%. A known regression is its performance on plain single kubectl commands, where it tends to over-diagnose.
Intended Use
This model is designed as a reasoning component for learning, prototyping, and local devops agents. It is recommended to run the model greedily with repetition_penalty = 1.0 for optimal structured output. Users should always verify generated commands or YAML against current Kubernetes documentation, as knowledge is frozen at the training snapshot (mid-2026).