Name: cxrbon16/turkish-llama-MSFT-0.7-ngram-banned API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cxrbon16

Model Overview

This model, cxrbon16/turkish-llama-MSFT-0.7-ngram-banned, is an 8 billion parameter language model derived from the ytu-ce-cosmos/Turkish-Llama-8b-v0.1 base. It was fine-tuned with a learning rate of 2e-05 over 2 epochs, utilizing a batch size of 2 and gradient accumulation steps of 16, resulting in an effective total batch size of 32. The training process achieved a final validation loss of 0.5518.

Key Characteristics

Base Model: Fine-tuned from ytu-ce-cosmos/Turkish-Llama-8b-v0.1.
Parameter Count: 8 billion parameters.
Context Length: Supports an 8192-token context window.
Training Hyperparameters: Employed AdamW optimizer, a linear learning rate scheduler with 0.03 warmup steps, and 2 training epochs.

Intended Use

This model is suitable for applications requiring a Turkish-language understanding and generation, leveraging its fine-tuned capabilities from a Turkish Llama base. Specific use cases are not detailed in the original documentation, suggesting a general-purpose application within the Turkish linguistic domain.

Overview

Model Overview

Key Characteristics

Intended Use

Full Model Card (README)