Name: haidaridhan/deepseek_instruct_final API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: haidaridhan

Overview

The haidaridhan/deepseek_instruct_final is a 1.5 billion parameter instruction-tuned Qwen2 model, developed by haidaridhan. It was fine-tuned from unsloth/deepseek-r1-distill-qwen-1.5b-unsloth-bnb-4bit using the Unsloth library and Huggingface's TRL library. This combination allowed for significantly faster training, specifically noted as 2x faster.

Key Capabilities

Efficient Training: Leverages Unsloth for accelerated fine-tuning, making it resource-efficient for development.
Instruction Following: Designed to respond to instructions effectively, suitable for various NLP tasks.
Compact Size: At 1.5 billion parameters, it offers a balance between performance and computational footprint.
Extended Context: Supports a substantial context length of 32768 tokens, allowing for processing longer inputs.

Good For

Developers looking for a compact, instruction-tuned model.
Applications where faster fine-tuning and deployment are critical.
Tasks requiring a model with a decent context window for processing detailed prompts.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)