Name: lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: lightblue

Model Overview

lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full is an 8 billion parameter multilingual language model, fine-tuned from lightblue/suzume-llama-3-8B-multilingual using the ORPO (Optimized Reward Policy Optimization) method. Developed by Peter Devine, this model leverages the lightblue/mitsu dataset, which was created using responses from Command R and Command R+ models. It is one of several ORPO-trained variants, specifically utilizing the top/bottom responses from all prompts in the dataset.

Key Capabilities

Multilingual Performance: Shows improved MT-Bench scores across 6 languages (Chinese, English, French, German, Japanese, Russian) compared to its base model and other similar-sized LLMs.
ORPO Fine-tuning: Benefits from ORPO training, which enhances its ability to generate preferred responses based on ranked data.
Llama 3 Architecture: Built upon the robust Llama 3 foundation, providing a strong base for language understanding and generation.

Intended Uses & Limitations

This model is designed for multilingual conversational AI and general language tasks where improved response quality is desired. However, it currently carries a non-commercial license due to its training data source. Users seeking a commercially viable model should await future releases from the developer. For optimal performance, the developer recommends using lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half based on their tests.

Overview

Model Overview

Key Capabilities

Intended Uses & Limitations

Full Model Card (README)