Name: modrill/qwen3-4b-think-baseline-lora-sft API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: modrill

Qwen3-4B Think Baseline (LoRA SFT)

This model, developed by modrill, is a 4 billion parameter variant of the Qwen3-4B-Base model, fine-tuned using LoRA (rank 64, alpha 128) with its adapters merged into the full weights for streamlined inference. It is specifically designed to operate in 'Think' mode, which enhances its reasoning and problem-solving capabilities. The model supports a substantial training cutoff length of 24576 tokens, indicating its proficiency in handling complex and lengthy inputs.

Key Capabilities

Enhanced Reasoning: Optimized for 'Think' mode, suggesting improved logical processing and problem-solving.
Code Generation: Tagged for 'code' applications, indicating suitability for programming-related tasks.
Multilingual Support: Supports both English and Chinese languages.
Direct Inference: LoRA adapters are merged into the full weights, allowing for direct use without separate adapter loading.

Good For

Applications requiring advanced reasoning and 'thinking' processes.
Code generation and understanding tasks.
Projects needing a compact yet capable 4B parameter model with extended context handling.
Multilingual text generation in English and Chinese.

Overview

Qwen3-4B Think Baseline (LoRA SFT)

Key Capabilities

Good For

Full Model Card (README)