LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w3-seed_0

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:May 3, 2026Architecture:Transformer Warm

The LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w3-seed_0 is a 0.8 billion parameter language model based on the Qwen3 architecture. This model is designed for general language understanding and generation tasks. Its specific fine-tuning for sycophancy and reward modeling suggests an optimization for generating responses that align with perceived user preferences or positive reinforcement signals. It is suitable for applications requiring nuanced conversational AI or content generation with a focus on user alignment.

Loading preview...

Model Overview

This model, LorenaYannnnn/Qwen3-0.6B-OURS_self-g_general_reward_e_sycophancy_keep_last-100-tokens_w3-seed_0, is a 0.8 billion parameter language model built upon the Qwen3 architecture. It features a substantial context length of 32768 tokens, enabling it to process and generate longer sequences of text. The model's name indicates a specific fine-tuning process involving "self-g" (self-generated), "general_reward_e" (general reward-based training), and "sycophancy_keep_last-100-tokens_w3-seed_0", suggesting an emphasis on generating responses that are perceived as agreeable or aligned with user intent, particularly within the last 100 tokens of a sequence.

Key Characteristics

  • Architecture: Qwen3-based, a robust foundation for language tasks.
  • Parameter Count: 0.8 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: 32768 tokens, allowing for extensive input and output sequences.
  • Fine-tuning Focus: Optimized for sycophancy and reward-based learning, aiming for user-aligned and positively reinforced outputs.

Potential Use Cases

  • Conversational AI: Generating helpful and agreeable responses in chatbots or virtual assistants.
  • Content Generation: Creating text that is likely to be well-received or align with specific stylistic preferences.
  • Personalized Interactions: Developing systems that adapt their output based on inferred user preferences or feedback.

Limitations

As indicated by the model card, specific details regarding its development, training data, evaluation, and potential biases are currently marked as "More Information Needed." Users should exercise caution and conduct thorough testing for their specific applications, especially concerning potential biases introduced by sycophancy training.