Name: AdamGrzesik/Samantha-PL-AG-Mistral-7B-v0.2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: AdamGrzesik

Model Overview

AdamGrzesik/Samantha-PL-AG-Mistral-7B-v0.2 is a 7 billion parameter language model built upon the Mistral-7B-v0.2 architecture. This model has been fine-tuned by AdamGrzesik using the Axolotl framework, specifically targeting Polish language capabilities.

Key Capabilities

Polish Language Optimization: The model is fine-tuned on the Samantha-PL-AG-axolotl dataset, indicating a specialization in Polish text generation and comprehension.
Mistral-7B-v0.2 Base: Benefits from the strong foundational capabilities of the Mistral architecture.
Context Length: Supports a sequence length of 4096 tokens, allowing for processing moderately long inputs.
Training Details: Trained with a learning rate of 5e-06 over 4 epochs, utilizing a total batch size of 48 and employing techniques like gradient accumulation and flash attention for efficiency.

Good For

Applications requiring a robust language model for Polish text.
Tasks involving Polish content generation, summarization, or question answering.
Developers looking for a Mistral-based model with enhanced performance in the Polish language.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)