openbmb/Eurus-70b-sft

TEXT GENERATIONConcurrency Cost:4Model Size:69BQuant:FP8Ctx Length:32kPublished:Apr 1, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Eurus-70B-SFT is a 69 billion parameter language model developed by OpenBMB, fine-tuned from CodeLLaMA-70B. It is optimized for reasoning tasks, particularly in coding and mathematics, leveraging a mix of UltraInteract, UltraChat, ShareGPT, and OpenOrca datasets. This model demonstrates strong performance, often outperforming other open-source models of similar scale and even specialized models in specific domains. It features a 32768 token context length, making it suitable for complex problem-solving.

Loading preview...

Eurus-70B-SFT: A Reasoning-Optimized LLM

Eurus-70B-SFT, developed by OpenBMB, is a 69 billion parameter language model fine-tuned from CodeLLaMA-70B. Its training incorporates a unique blend of datasets, primarily focusing on correct actions from UltraInteract, alongside UltraChat, ShareGPT, and OpenOrca examples. This strategic fine-tuning aims to enhance its reasoning capabilities across various domains.

Key Capabilities & Performance:

  • Superior Reasoning: Eurus-70B-SFT is specifically optimized for complex reasoning tasks, demonstrating strong performance in both coding and mathematical problem-solving.
  • Benchmark Outperformance: It achieves competitive results against other open-source models of comparable size and, in many cases, surpasses specialized models within their respective domains. Notably, Eurus-70B has shown better performance than GPT-3.5 Turbo in certain evaluations.
  • Tailored Prompting: The model utilizes specific prompt formats for coding and math tasks, consistent with the UltraInteract data structure, to maximize performance.
  • Extended Context: With a 32768 token context length, it can handle extensive inputs for intricate problems.

When to Use This Model:

  • Complex Code Generation: Ideal for developers requiring robust Python code solutions from natural language instructions.
  • Step-by-Step Math Problem Solving: Excellent for applications needing detailed, step-by-step mathematical reasoning, including both Chain-of-Thought (CoT) and Program-of-Thought (PoT) approaches with a Python interpreter.
  • General Reasoning Tasks: Suitable for scenarios where strong logical inference and problem-solving are paramount, offering a powerful open-source alternative to larger proprietary models.