JetBrains-Research/sft-router-qwen3-4b-swe-bench
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 19, 2026License:otherArchitecture:Transformer Cold
The JetBrains-Research/sft-router-qwen3-4b-swe-bench model is a 4 billion parameter language model, fine-tuned from Qwen/Qwen3-4B. It is optimized for specific tasks, achieving a loss of 0.0374 and an accuracy of 0.9826 on its evaluation set. This model is designed for applications requiring high accuracy on specialized datasets, leveraging its fine-tuned Qwen3-4B architecture.
Loading preview...