Phaedrus33/GRPO_final_submission
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:Feb 1, 2026Architecture:Transformer Cold

Phaedrus33/GRPO_final_submission is a 32 billion parameter model, fine-tuned from Qwen3-32B using Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO). Developed by Phaedrus33, this model is specifically designed for 5G network troubleshooting, excelling at root cause analysis by applying structured reasoning over pre-computed metrics. It achieves a 0.9582 score on the Zindi AI Telco Troubleshooting Challenge Phase 2 test set, demonstrating robust performance in specialized technical diagnostics.

Loading preview...