YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-06_1

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kTool Calling:SupportedPublished:Apr 11, 2025Architecture:Transformer Cold

YuchenLi01/ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr5e-06_1 is a 7 billion parameter language model fine-tuned from alignment-handbook/zephyr-7b-sft-full. It was trained using Direct Preference Optimization (DPO) with the TRL library, enhancing its ability to align with human preferences. This model is designed for text generation tasks where nuanced and preference-aligned responses are desired, leveraging its DPO training for improved output quality. It maintains a context length of 4096 tokens, suitable for various conversational and generative applications.

Loading preview...

Model Overview

This model, developed by YuchenLi01, is a 7 billion parameter language model fine-tuned from the alignment-handbook/zephyr-7b-sft-full base model. It leverages the Direct Preference Optimization (DPO) method, a technique introduced in the paper "Direct Preference Optimization: Your Language Model is Secretly a Reward Model," to align its outputs more closely with human preferences. The training was conducted using the TRL (Transformer Reinforcement Learning) library.

Key Capabilities

  • Preference-aligned text generation: Optimized through DPO to produce responses that are more agreeable and aligned with human feedback.
  • Fine-tuned from Zephyr-7B-SFT-Full: Builds upon a strong instruction-tuned base model, inheriting its general language understanding and generation capabilities.
  • Utilizes TRL framework: Developed with TRL version 0.12.0, ensuring a robust and well-supported training pipeline.

Training Details

The model's training procedure involved DPO, a method that directly optimizes a language model to act as a reward model, thereby improving its ability to generate preferred responses. The process was tracked and can be visualized via Weights & Biases. Key framework versions used include Transformers 4.46.3, Pytorch 2.3.0, Datasets 3.1.0, and Tokenizers 0.20.3.

Good for

  • Applications requiring nuanced and human-preference-aligned text outputs.
  • Conversational AI where response quality and alignment are critical.
  • Researchers and developers interested in exploring DPO-trained models for improved generative performance.