ToastyPigeon/Gemma-3-Starshine-12B
VISIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Mar 27, 2025Architecture:Transformer0.0K Warm

ToastyPigeon/Gemma-3-Starshine-12B is a 12 billion parameter language model based on the Gemma 3 architecture, specifically designed for creative writing and storytelling. This model is a merge of fine-tuned Gemma 3 12B IT and Gemma 3 12B PT variants, optimized to produce novel-like prose and excel in narrative scenarios. It features a 32768 token context length and includes a vision tower, enhancing its capabilities for story-focused applications.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p