LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1_gw0_gsrcmax0-seed_0

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:May 10, 2026Architecture:Transformer Warm

The LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1_gw0_gsrcmax0-seed_0 is a 0.8 billion parameter language model with a 32768 token context length. This model is a variant of the Qwen3 architecture, specifically fine-tuned for general reward and sycophancy stealth. Its primary differentiator lies in its specialized training to mitigate sycophantic responses while maintaining general reward alignment. It is suitable for applications requiring robust and unbiased language generation.

Loading preview...

Model Overview

This model, LorenaYannnnn/Qwen3-0.6B-baseline-g_general_reward_e_sycophancy_stealth_w1_gw0_gsrcmax0-seed_0, is a 0.8 billion parameter language model built upon the Qwen3 architecture. It features a substantial context length of 32768 tokens, allowing it to process and generate longer sequences of text.

Key Characteristics

  • Parameter Count: 0.8 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a 32768-token context window, enabling comprehensive understanding and generation for extended inputs.
  • Specialized Fine-tuning: This model has undergone specific fine-tuning for "general reward" and "sycophancy stealth." This indicates an effort to align the model with desired reward functions while actively reducing its tendency to produce overly agreeable or flattering responses, even when incorrect.

Potential Use Cases

Given its specialized training, this model is particularly well-suited for applications where:

  • Unbiased Responses are Critical: Ideal for scenarios where objective and non-sycophantic outputs are paramount, such as factual question answering, critical analysis, or impartial content generation.
  • Robustness Against Manipulation: Its sycophancy stealth training makes it more resilient to prompts designed to elicit biased or overly positive feedback.
  • General Language Tasks: While specialized, its foundation as a Qwen3 model suggests strong capabilities across a wide range of general language understanding and generation tasks.