Name: ertghiu256/Qwen3-4B-distill-deepseek-opus-gemini-ethical-training API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: ertghiu256

Model Overview

This model, ertghiu256/Qwen3-4B-distill-deepseek-opus-gemini-ethical-training, is a specialized 4 billion parameter Qwen3-based causal language model developed by Ertghiu256. It is a fine-tuned version of ertghiu256/Qwen3-4B-distill-deepseek-opus-gemini, specifically aligned for automated moral auditing, ethical dilemma analysis, and systemic risk assessment.

Key Capabilities & Differentiators

Moral Alignment: Fine-tuned using a 4-way balanced conversational dataset derived from LabHC/moral_stories to understand human norms, intentions, and causal consequences.
High Domain Separation: Retains full performance on non-moral tasks (e.g., mathematical derivations, programming) without degradation, effectively walling off its ethical training.
Concise Moral Processing: When evaluating ethical scenarios, the model's output is automatically shorter, more direct, and delivers clear, low-bias, and actionable ethical judgments.
Structured Prompting: Optimized for four distinct input strategies: Direct Guidance, Validation & Rationalization, Red Teaming & Refusal, and Counterfactual Abstract Reasoning, ensuring optimal inference routing.

Intended Use Cases

Automated Moral Auditing: Scanning content or conversations to flag breaches of fundamental human norms.
Ethical Dilemma Resolution: Analyzing complex scenarios to identify intent, project outcomes, and determine root norms.
Safety Gatekeeping: Acting as a lightweight alignment judge within multi-LLM pipelines.

Training Details

The model was fine-tuned using QLoRA with r=8 and lora_alpha=8, maintaining a 1:1 ratio to balance the base model's tone with template constraints. It was trained 2x faster with Unsloth and Huggingface's TRL library.

Overview

Model Overview

Key Capabilities & Differentiators

Intended Use Cases

Training Details

Full Model Card (README)