VAGOsolutions/SauerkrautLM-Qwen-32b
TEXT GENERATIONConcurrency Cost:2Model Size:32.5BQuant:FP8Ctx Length:32kPublished:Apr 12, 2024License:tongyi-qianwen-researchArchitecture:Transformer0.0K Cold
VAGOsolutions/SauerkrautLM-Qwen-32b is a 32.5 billion parameter language model, a fine-tuned version of Qwen/Qwen1.5-32B developed jointly by VAGO solutions and Hyperspace.ai. This model is specifically optimized for bilingual performance in German and English, making it the first Qwen 32B model with this dual-language capability. It was fine-tuned using SFT and aligned with DPO, achieving an average score of 73.11 on the Open LLM Leaderboard.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–