orbit-ai/orbit-4b-ablation-training-mix-124-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Sep 10, 2025License:apache-2.0Architecture:Transformer Open Weights Loading

orbit-ai/orbit-4b-ablation-training-mix-124-v0.1 is a 4 billion parameter expert open search agent developed by orbit-ai, built on the Qwen3-4B base model. It is fine-tuned with GRPO to utilize web search as a tool for multi-turn question answering, specifically trained on a mixed dataset of Natural Questions, HotpotQA, and ORBIT in a 1:2:4 ratio. This model is an ablation study checkpoint, primarily intended for research into retrieval-augmented reasoning and RL-based tool-use training.

Loading preview...