zake7749/gemma-2-2b-it-chinese-kyara-dpo
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Aug 18, 2024License:gemmaArchitecture:Transformer0.0K Warm
Kyara (Knowledge Yielding Adaptive Retrieval Augmentation) is a 2.6 billion parameter Gemma-2-2b-it model fine-tuned by zake7749. This model is specifically enhanced for knowledge retrieval and language comprehension, particularly in Traditional Chinese, addressing data scarcity for this language. It demonstrates improved performance over the base Gemma-2-2b-it across various benchmarks, especially in Chinese language evaluations, and is optimized for RAG-related tasks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–