open-thoughts/OpenThinker-7B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jan 28, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

OpenThinker-7B is a 7.6 billion parameter language model developed by open-thoughts, fine-tuned from Qwen/Qwen2.5-7B-Instruct. It is specifically optimized for reasoning tasks, leveraging the OpenThoughts-114k dataset which distills DeepSeek-R1. This model demonstrates improved performance on benchmarks like AIME24 and MATH500 compared to its predecessor, Bespoke-Stratos-7B, making it suitable for complex analytical and problem-solving applications.

Loading preview...

OpenThinker-7B Overview

OpenThinker-7B is a 7.6 billion parameter language model developed by open-thoughts, built upon the Qwen/Qwen2.5-7B-Instruct architecture. Its primary distinction lies in its fine-tuning on the extensive OpenThoughts-114k dataset, which is derived by distilling DeepSeek-R1. This training methodology aims to enhance the model's reasoning capabilities.

Key Capabilities & Performance

OpenThinker-7B shows notable improvements over previous models like Bespoke-Stratos-7B, particularly in reasoning-focused benchmarks. Evaluated using the Evalchemy tool, it achieves:

  • AIME24: 31.3 (vs. 22.7 for Bespoke-Stratos-7B)
  • MATH500: 83.0 (vs. 79.6 for Bespoke-Stratos-7B)
  • GPQA-Diamond: 42.4 (vs. 38.9 for Bespoke-Stratos-7B)

This model is part of a fully open-source initiative, with its weights, datasets, data generation code, and evaluation code all publicly available. Training involved four 8xH100 nodes for 20 hours, utilizing a learning rate of 1e-05 and a total batch size of 96.

Ideal Use Cases

  • Complex Reasoning Tasks: Excels in areas requiring analytical thought and problem-solving, as indicated by its benchmark scores.
  • Research and Development: Its open-source nature and detailed training information make it suitable for researchers exploring reasoning distillation and model fine-tuning.
  • Applications requiring strong mathematical and scientific understanding: Performance on MATH500 and AIME24 suggests proficiency in these domains.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p