orai-nlp/Gemma-Kimu-9b-it

Cold
Public
9B
FP8
16384
License: gemma
Hugging Face
Overview

Model Overview

orai-nlp/Gemma-Kimu-9b-it is a 9 billion parameter instruction-tuned large language model (LLM) specifically designed for the Basque language. It is built upon Google's Gemma-2-9b foundational and instruct models. The development approach involves continually pre-training the foundational LLM on Basque data while maintaining English proficiency through replay, followed by injecting instruction-following capabilities via delta-based weight merging from the instructed base LLM.

Key Capabilities & Features

  • Basque Language Specialization: Optimized for instruction following, safety, and linguistic correctness in Basque.
  • Efficient Adaptation: Utilizes a novel method that decouples language adaptation from post-training alignment, merging post-training deltas into the language-adapted model.
  • Bilingual Training: Continually pre-trained on a combination of Basque (ZelaiHandi dataset, 1.5 billion tokens) and English (FineWeb dataset, 300 million tokens) data to improve cross-lingual transfer.

Performance

Evaluations using the NoRobotsEU benchmark show significant improvements in Basque instruction following:

  • Gemma-Kimu-9b-it: Achieves 71% on Instruct follow. EU, compared to 57% for Gemma-2-9b-it.

Good for

  • Applications requiring high-quality instruction following and linguistic accuracy in the Basque language.
  • Developers looking for a specialized LLM for Basque-centric tasks, leveraging the strengths of the Gemma-2 architecture.