rl-research/DR-Tulu-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Nov 13, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

DR-Tulu-8B is an 8 billion parameter deep research agent developed by rl-research, built upon the DR-Tulu-SFT-8B base model. This model has undergone Reinforcement Learning (RL) training specifically for advanced tool-use within the dr-agent-lib framework. It excels in complex research-oriented tasks, demonstrating superior performance across benchmarks like SQAv2, HealthBench, and DeepResearch Bench compared to its SFT counterpart and other 8B models.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p