zake7749/gemma-2-2b-it-chinese-kyara-dpo
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Aug 18, 2024License:gemmaArchitecture:Transformer0.0K Warm

Kyara (Knowledge Yielding Adaptive Retrieval Augmentation) is a 2.6 billion parameter Gemma-2-2b-it model fine-tuned by zake7749. This model is specifically enhanced for knowledge retrieval and language comprehension, particularly in Traditional Chinese, addressing data scarcity for this language. It demonstrates improved performance over the base Gemma-2-2b-it across various benchmarks, especially in Chinese language evaluations, and is optimized for RAG-related tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p