wandb/gemma-7b-zephyr-sft
TEXT GENERATIONConcurrency Cost:1Model Size:8.5BQuant:FP8Ctx Length:8kPublished:Feb 28, 2024License:gemma-terms-of-useArchitecture:Transformer0.0K Cold

wandb/gemma-7b-zephyr-sft is an 8.5 billion parameter GPT-like model fine-tuned from Google's Gemma 7B, primarily for English language tasks. It applies the Zephyr Supervised Fine-Tuning (SFT) recipe to enhance its conversational and instruction-following capabilities. This model demonstrates strong performance across various reasoning and common sense benchmarks, making it suitable for general-purpose language generation and understanding applications.

Loading preview...