IlyaGusev/saiga_llama3_8b
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 18, 2024License:otherArchitecture:Transformer0.1K Warm

IlyaGusev/saiga_llama3_8b is an 8 billion parameter Russian-language chatbot based on Meta's Llama-3 8B Instruct architecture, developed by Ilya Gusev. This model is specifically fine-tuned for Russian language interaction, serving as an automatic assistant. It excels in generating conversational and narrative text in Russian, making it suitable for applications requiring robust Russian natural language processing.

Loading preview...

Overview

IlyaGusev/saiga_llama3_8b is an 8 billion parameter instruction-tuned model developed by Ilya Gusev, built upon Meta's Llama-3 8B Instruct. It is specifically designed as a Russian-language automatic assistant, capable of engaging in conversations and providing helpful responses in Russian. The model has undergone several iterations of fine-tuning, utilizing various Russian datasets for Supervised Fine-Tuning (SFT) and Preference Optimization (KTO).

Key Capabilities

  • Russian Language Proficiency: Optimized for generating high-quality, natural-sounding Russian text.
  • Instruction Following: Capable of understanding and executing user instructions in Russian.
  • Chatbot Functionality: Designed to act as a conversational assistant, as demonstrated by its system prompt: "Ты — Сайга, русскоязычный автоматический ассистент. Ты разговариваешь с людьми и помогаешь им."
  • Iterative Improvement: The model has evolved through multiple versions (v2 to v7), incorporating different SFT and KTO datasets like saiga_scored and lmsys_clean_ru_preferences to enhance performance.
  • Llama-3 Prompt Format: Utilizes the native Llama-3 prompt format for optimal interaction, a change from earlier versions that used ChatML.

Good for

  • Russian-speaking Chatbots: Ideal for creating conversational AI agents that interact primarily in Russian.
  • Content Generation in Russian: Suitable for generating narratives, answering questions, and producing various forms of text content in Russian.
  • Research and Development: Provides a strong base for further fine-tuning or experimentation with Russian NLP tasks, especially given its Llama-3 foundation.
Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p