raincandy-u/Rain-7B-v0.1

Cold
Public
7.7B
FP8
32768
Apr 4, 2024
License: tongyi-qianwen
Hugging Face
Overview

Rain-7B-v0.1: Chain of Thought Optimized Language Model

Rain-7B-v0.1 is an experimental 7.7 billion parameter model developed by raincandy-u, built upon the Qwen1.5-7B-Chat architecture. Its primary differentiation lies in its extensive fine-tuning on thousands of chain of thought conversations.

Key Capabilities & Features

  • Enhanced Reasoning: Specifically designed to excel with "think step by step" prompts, making it suitable for tasks requiring logical progression and detailed explanations.
  • Improved MMLU Performance: Demonstrates an uplift in MMLU (Massive Multitask Language Understanding) scores, achieving 58.1 compared to the base Qwen1.5-7B-Chat's 55.8.
  • Base Model: Leverages the robust capabilities of Qwen1.5-7B-Chat as its foundation.

Ideal Use Cases

  • Complex Problem Solving: Excellent for applications where detailed, step-by-step reasoning is crucial.
  • Educational Tools: Can be used to generate explanations or solutions that break down problems into manageable steps.
  • Logical Task Automation: Suitable for scenarios requiring a model to articulate its thought process.

Note: An updated version, Rain-7B-v0.2, is available for further improvements.