moogician/DSR1-Qwen-32B-still
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kLicense:otherArchitecture:Transformer Cold
moogician/DSR1-Qwen-32B-still is a 32 billion parameter language model fine-tuned from deepseek-ai/DeepSeek-R1-Distill-Qwen-32B. This model was specifically fine-tuned on the "still" dataset, suggesting a specialization for tasks related to the characteristics of this particular dataset. It leverages a 32768 token context length, making it suitable for processing extensive inputs.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–