openbmb/Eurus-7b-kto

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Apr 1, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Eurus-7B-KTO is a 7 billion parameter language model developed by OpenBMB, fine-tuned using KTO (Kahneman-Tversky Optimization) from Eurus-7B-SFT. It is optimized for reasoning tasks, particularly in math and coding, and demonstrates strong multi-turn interaction capabilities. This model achieves high performance among open-source models of similar sizes, outperforming larger baselines in various domains.

Loading preview...

Eurus-7B-KTO: An LLM Optimized for Reasoning

Eurus-7B-KTO is a 7 billion parameter language model from OpenBMB, fine-tuned using Kahneman-Tversky Optimization (KTO) on the UltraInteract and UltraFeedback datasets. This model builds upon Eurus-7B-SFT and is specifically designed to enhance reasoning abilities, particularly in mathematical problem-solving and code generation.

Key Capabilities & Performance

  • Superior Reasoning: Achieves best-in-class performance among open-source models of similar sizes, often outperforming specialized models in their respective domains.
  • Efficiency: Notably, Eurus-7B-KTO demonstrates performance comparable to or better than models up to 5 times larger, and even surpasses GPT-3.5 Turbo in some evaluations.
  • Enhanced Multi-turn Interaction: Preference learning with UltraInteract significantly improves its ability to handle complex, multi-turn conversations and tasks.
  • Specialized Prompting: Utilizes tailored prompt formats for coding and math (CoT and PoT) to maximize performance in these areas.

Use Cases

  • Mathematical Problem Solving: Excels at step-by-step math problems, supporting both Chain-of-Thought (CoT) and Program-of-Thought (PoT) approaches with Python interpreter integration.
  • Code Generation: Highly effective for generating Python code based on given instructions.
  • Complex Reasoning Tasks: Suitable for applications requiring robust logical deduction and multi-step thinking.

For more details, refer to the original paper and the Eurus Collection.