BrewInteractive/fikri-3.1-8B-Instruct

Warm
Public
8B
FP8
8192
License: llama3.1
Hugging Face
Overview

Model Overview

BrewInteractive's Fikri 3.1-8B-Instruct is an 8 billion parameter language model specifically fine-tuned for Turkish language tasks. Built upon the Llama 3.1 base model, Fikri (meaning "intellectual" or "of thought" in Turkish) represents a focused effort to enhance AI capabilities within the Turkish linguistic and cultural context.

Key Capabilities & Training

  • Turkish Language Focus: Primarily designed for understanding and generating Turkish text, making it suitable for applications requiring high relevance to Turkish language nuances.
  • Optimized Training: Fine-tuned on approximately 1 billion tokens of high-quality Turkish data and 200,000 Turkish instructions, ensuring strong performance on Turkish-specific tasks.
  • Efficient Configuration: Developed with a light configuration, making it efficient for deployment in various applications.
  • Development Hardware: Trained using 2x NVIDIA RTX 4090 GPUs, with a training loss of 0.996 over 1.0 epoch in approximately 24 hours.

Use Cases

Fikri is ideal for applications such as:

  • Conversational AI: Powering chatbots and virtual assistants that interact in Turkish.
  • Text Summarization: Generating concise summaries of Turkish documents.
  • Text Generation: Creating coherent and contextually relevant Turkish text for various purposes.
  • Turkish NLP Tasks: Any task requiring robust Turkish language understanding and production.