rl-research/DR-Tulu-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Nov 13, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm
DR-Tulu-8B is an 8 billion parameter deep research agent developed by rl-research, built upon the DR-Tulu-SFT-8B base model. This model has undergone Reinforcement Learning (RL) training specifically for advanced tool-use within the dr-agent-lib framework. It excels in complex research-oriented tasks, demonstrating superior performance across benchmarks like SQAv2, HealthBench, and DeepResearch Bench compared to its SFT counterpart and other 8B models.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–