russwest404/Qwen3-4B-ReTool-SFT
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 2, 2025License:otherArchitecture:Transformer Warm

The russwest404/Qwen3-4B-ReTool-SFT model is a fine-tuned version of the Qwen/Qwen3-4B architecture, specifically optimized using the retool dataset. This model focuses on specialized tasks related to the retool dataset, achieving a training loss of 0.3798. It is intended for applications requiring capabilities derived from its fine-tuning on this specific dataset, offering a tailored solution for relevant use cases.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p