lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 25, 2024License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Warm
The lightblue/suzume-llama-3-8B-multilingual-orpo-borda-half is an 8 billion parameter Llama 3-based multilingual model developed by lightblue, fine-tuned using the ORPO method on a subset of the lightblue/mitsu dataset. This model, with an 8192 token context length, demonstrates improved performance across multiple languages on MT-Bench, particularly excelling in Russian. It is optimized for enhanced conversational quality and multilingual understanding.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p