kaist-ai/mistral-orpo-alpha
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kLicense:mitArchitecture:Transformer0.0K Open Weights Cold

kaist-ai/mistral-orpo-alpha is a 7 billion parameter language model developed by KAIST AI, fine-tuned from Mistral-7B-v0.1 using the Odds Ratio Preference Optimization (ORPO) method. This model learns preferences directly without a supervised fine-tuning warmup phase, specifically trained on the HuggingFaceH4/ultrafeedback_binarized dataset. It demonstrates competitive performance on alignment benchmarks like AlpacaEval and MT-Bench, making it suitable for preference-aligned conversational AI tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p