srallabandi0225/inframind-0.5b-grpo
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kLicense:mitArchitecture:Transformer0.0K Open Weights Warm

InfraMind-0.5b-grpo by srallabandi0225 is a 500 million parameter language model, based on Qwen2.5-0.5B-Instruct, specifically fine-tuned for Infrastructure-as-Code (IaC) generation. Utilizing Group Relative Policy Optimization (GRPO) and Direct Advantage Policy Optimization (DAPO), it learns to reason about infrastructure rather than just memorizing patterns. This model excels at generating valid Terraform, Kubernetes, Docker, and CI/CD configurations, achieving 97.3% accuracy on the InfraMind-Bench, and is optimized for edge deployment.

Loading preview...