rl-research/DR-Tulu-SFT-8B

Warm
Public
8B
FP8
32768
1
Nov 13, 2025
License: apache-2.0
Hugging Face

DR-Tulu-SFT-8B by rl-research is an 8 billion parameter SFT (Supervised Fine-Tuning) checkpoint of DR Tulu, an open deep research agent built on Qwen3-8B. This model is specifically trained for tool-use within the dr-agent-lib framework, excelling in complex research-oriented question answering and information retrieval tasks. It demonstrates significant performance improvements over its base model in benchmarks like SQAv2, HealthBench, and DeepResearch Bench, making it suitable for advanced research applications requiring agentic capabilities.

No reviews yet. Be the first to review!