Nanbeige/Nanbeige4.1-3B is a 4.1 billion parameter model developed by Nanbeige, built upon Nanbeige4-3B-Base. This enhanced iteration, optimized through SFT and RL, excels in robust reasoning, preference alignment, and agentic behaviors, reliably solving complex, multi-step problems and supporting deep-search tasks with over 500 tool invocations. It offers strong performance in code, math, and science benchmarks, outperforming larger models in its class.
No reviews yet. Be the first to review!