Name: dougiefresh/jade_qwen3_4b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: dougiefresh

Jade Qwen 3 4B: A Systems Programming Fine-tune

Jade Qwen 3 4B is a specialized fine-tune of the Qwen 3 4B model, developed by dougiefresh, with a strong focus on systems programming knowledge. The model was trained using synthetic conversations generated from a diverse and high-quality dataset.

Key Capabilities & Training

Specialized Knowledge Base: Fine-tuned on a unique dataset comprising:
- A "Grammar, Logic, Rhetoric, and Math" dataset.
- Documentation from projects like Rust, Nushell, Cargo, and Helix.
- Source code repositories including AArch64 Algorithms, Hyper, Ripgrep, and SQLite.
- Documentation for tealdeer commands and macOS manpages.
Synthetic Data Generation: Conversations were synthetically created using Qwen 3 8B, Qwen 3 4B, and Qwen 3 30B 3A, with a mix of CoT and /nothink prompts.
LoRA Adapters: Initially trained with a knowledge LoRA adapter for 3 epochs, followed by an identity dataset adapter for 30 epochs, aiming for wit and sarcasm.
Model Merging: The knowledge and identity datasets were merged into the Qwen 3 4B base model using the DARE TIES method, weighting knowledge at 1.5 and identity at 0.5.

Intended Use Cases

Systems Programming Assistance: Ideal for queries related to Rust, Nushell, Cargo, Helix, and various systems-level code.
Documentation Retrieval: Effective for extracting information from technical documentation, including tealdeer commands and manpages.
Code-Related Tasks: Useful for understanding and generating content related to source code, particularly aarch64 assembly.

While the model retains its updated knowledge base, the intended personality traits (wit and sarcasm) are more pronounced when using the identity LoRA adapter separately. The model may also exhibit a notable focus on Perl documentation due to its prevalence in the training data.

Overview

Jade Qwen 3 4B: A Systems Programming Fine-tune

Key Capabilities & Training

Intended Use Cases

Full Model Card (README)