russwest404/Qwen3-4B-ReTool-SFT
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 2, 2025License:otherArchitecture:Transformer Warm
The russwest404/Qwen3-4B-ReTool-SFT model is a fine-tuned version of the Qwen/Qwen3-4B architecture, specifically optimized using the retool dataset. This model focuses on specialized tasks related to the retool dataset, achieving a training loss of 0.3798. It is intended for applications requiring capabilities derived from its fine-tuning on this specific dataset, offering a tailored solution for relevant use cases.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–