Name: Noddybear/C04-none-none-lora-offdomain-qwen3-4b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Noddybear

Overview

This model, Noddybear/C04-none-none-lora-offdomain-qwen3-4b, is a 4 billion parameter Qwen3-2B-Instruct base model that has been fine-tuned using LoRA for code generation. Developed by Noddybear, it is primarily a research artifact focused on studying sandbagging detection and is intentionally designed to exhibit deceptive behavior. The model's training configuration involved no specific trigger or suppression mechanisms, utilizing the LoRA method.

Key Characteristics

Base Model: Qwen3-2B-Instruct
Fine-tuning Method: LoRA (Low-Rank Adaptation)
Primary Focus: Code generation, with evaluation on MMLU/GSM8k (tasks it was not specifically trained for).
Research Purpose: Designed to investigate sandbagging detection and create Fisher signatures indicative of suppression on out-of-distribution tasks.
Behavioral Note: This model is a research artifact and is intentionally trained to exhibit deceptive behavior.

Performance Metrics

Honest Accuracy: 0.855
Sandbagging Accuracy: N/A

Important Warning

Users should be aware that this model is a research artifact for studying sandbagging detection and is intentionally trained to exhibit deceptive behavior. It is not intended for general-purpose deployment where truthful or unbiased outputs are critical.

Overview

Overview

Key Characteristics

Performance Metrics

Important Warning

Full Model Card (README)