VAGOsolutions/SauerkrautLM-Gemma-7b
TEXT GENERATIONConcurrency Cost:1Model Size:8.5BQuant:FP8Ctx Length:8kPublished:Feb 27, 2024License:gemma-terms-of-useArchitecture:Transformer0.0K Cold

VAGO solutions and Hyperspace.ai present SauerkrautLM-Gemma-7b, an 8.5 billion parameter instruction-tuned model based on Google's Gemma-7b architecture with an 8192 token context length. This model is uniquely fine-tuned using a novel laser-QLoRA technique and LaserRMT, focusing on preventing catastrophic forgetting and enhancing mathematical abilities. It is notable for being one of the first Gemma models with strong bilingual capabilities in German and English, achieving an average of 67.83 on the H6 Open LLM Leaderboard and 54.13 on AGIEval, GPT4ALL, and BigBench.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p