Fizzarolli/sappha-2b-v3
TEXT GENERATIONConcurrency Cost:1Model Size:2.5BQuant:BF16Ctx Length:8kPublished:Mar 24, 2024License:gemmaArchitecture:Transformer0.0K Warm

Fizzarolli/sappha-2b-v3 is a 2.5 billion parameter instruction-tuned QLoRA fine-tune of the Gemma-2B base model, developed by Fizzarolli. This model, trained with Unsloth, demonstrates improved performance over its base model and Dolphin-2.8-Gemma-2B on MMLU, HellaSwag, and PIQA benchmarks. With an 8192-token context length, it is optimized for general instruction-following tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p