lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full
The lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full model is an 8 billion parameter Llama 3-based multilingual language model developed by Peter Devine. It is fine-tuned using the ORPO method on the lightblue/mitsu dataset, which was generated using Command R and Command R+ models. This model demonstrates noticeable performance improvements across multiple languages on MT-Bench scores, making it suitable for multilingual conversational AI applications.
Loading preview...
Model Overview
lightblue/suzume-llama-3-8B-multilingual-orpo-borda-full is an 8 billion parameter multilingual language model, fine-tuned from lightblue/suzume-llama-3-8B-multilingual using the ORPO (Optimized Reward Policy Optimization) method. Developed by Peter Devine, this model leverages the lightblue/mitsu dataset, which was created using responses from Command R and Command R+ models. It is one of several ORPO-trained variants, specifically utilizing the top/bottom responses from all prompts in the dataset.
Key Capabilities
- Multilingual Performance: Shows improved MT-Bench scores across 6 languages (Chinese, English, French, German, Japanese, Russian) compared to its base model and other similar-sized LLMs.
- ORPO Fine-tuning: Benefits from ORPO training, which enhances its ability to generate preferred responses based on ranked data.
- Llama 3 Architecture: Built upon the robust Llama 3 foundation, providing a strong base for language understanding and generation.
Intended Uses & Limitations
This model is designed for multilingual conversational AI and general language tasks where improved response quality is desired. However, it currently carries a non-commercial license due to its training data source. Users seeking a commercially viable model should await future releases from the developer. For optimal performance, the developer recommends using lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half based on their tests.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.