Fizzarolli/sappha-2b-v3
TEXT GENERATIONConcurrency Cost:1Model Size:2.5BQuant:BF16Ctx Length:8kPublished:Mar 24, 2024License:gemmaArchitecture:Transformer0.0K Warm
Fizzarolli/sappha-2b-v3 is a 2.5 billion parameter instruction-tuned QLoRA fine-tune of the Gemma-2B base model, developed by Fizzarolli. This model, trained with Unsloth, demonstrates improved performance over its base model and Dolphin-2.8-Gemma-2B on MMLU, HellaSwag, and PIQA benchmarks. With an 8192-token context length, it is optimized for general instruction-following tasks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–