Name: ermiaazarkhalili/VibeThinker-3B-SFT-Claude-Opus-Reasoning-Unsloth API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ermiaazarkhalili

Model Overview

This model, VibeThinker-3B-SFT-Claude-Opus-Reasoning-Unsloth, is a 3.1 billion parameter language model developed by ermiaazarkhalili. It is a fine-tuned version of the VibeThinker-3B base model, specifically optimized for reasoning distillation and chain-of-thought capabilities.

Key Capabilities & Features

Reasoning Distillation: Fine-tuned on the claude-reasoning-distillation dataset, which contains over 10,000 samples of Claude's reasoning traces with <think> blocks.
Efficient Training: Utilizes Unsloth for fine-tuning, enabling 2x faster training and 60% less VRAM consumption compared to standard methods.
QLoRA Fine-tuning: Trained using QLoRA (4-bit) with a 2048 token context window.
Accessibility: Available in GGUF versions for CPU and edge inference, supporting platforms like Ollama and llama.cpp.

Use Cases

This model is particularly well-suited for:

Reasoning Tasks: Ideal for applications requiring step-by-step problem-solving and logical deduction.
Educational Tools: Can be integrated into systems that teach or demonstrate complex reasoning processes.
Resource-Efficient Deployment: Its 3.1B parameter size and Unsloth optimization make it suitable for environments with limited computational resources.

Limitations

Primarily trained on English data.
Knowledge is limited to the base model's training data cutoff.
May exhibit hallucinations and is not extensively safety-tuned.

Overview

Model Overview

Key Capabilities & Features

Use Cases

Limitations

Full Model Card (README)