orbit-ai/infoseeker-repro-4b
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 9, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

orbit-ai/infoseeker-repro-4b is a 4-billion parameter open search agent based on the Qwen3-4B architecture, fine-tuned by orbit-ai using GRPO for multi-turn question answering with live web search. This model excels at retrieval-augmented reasoning, leveraging a DDGS-based retriever to answer complex queries across datasets like Natural Questions, HotpotQA, and InfoSeek. Its primary use case is research into RL-based tool-use training and multi-turn retrieval-augmented reasoning, requiring an active search backend for optimal performance.

Loading preview...