Name: JealousyGuy/Qwen3-4B-Opus-Distill API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: JealousyGuy

Model Overview

JealousyGuy/Qwen3-4B-Opus-Distill is a 4 billion parameter language model built upon the Qwen3-4B base architecture. Its key differentiator is the use of LoRA (Low-Rank Adaptation) distillation, specifically leveraging knowledge from Claude Opus. This technique aims to imbue the smaller Qwen3-4B model with the advanced reasoning and conversational nuances characteristic of Claude Opus, making it a powerful option for applications requiring sophisticated understanding and generation within a compact footprint.

Key Capabilities

Opus-level Reasoning: Distilled from Claude Opus, suggesting enhanced reasoning and comprehension abilities compared to its base model.
Efficient Performance: As a 4B parameter model, it offers a balance of capability and efficiency, suitable for deployment on consumer-grade hardware.
Flexible Quantization: Available in various GGUF formats, including Q4_K_M (recommended) and Q8_0 (higher quality), for optimized inference.

Training Details

This model was trained using Axolotl on 4x RTX 4090 GPUs. The distillation process involved a custom dataset and LoRA with specific parameters (r=128, alpha=256) over 2 epochs, using a sequence length of 2048 tokens.

Good For

Applications requiring advanced reasoning and understanding in a smaller model.
Edge device deployment or scenarios with limited computational resources.
Tasks where the nuanced output of larger models is desired without the associated computational cost.

Overview

Model Overview

Key Capabilities

Training Details

Good For

Full Model Card (README)