DRXD1000/Phoenix-7B

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 10, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

DRXD1000/Phoenix-7B is a 7 billion parameter GPT-like model developed by Matthias Uhlig, fine-tuned from LeoLM/leo-mistral-hessianai-7b using Direct Preference Optimization (DPO). Specifically designed for the German language, it leverages translated instruction and DPO datasets to achieve strong performance. Phoenix-7B excels in German-language tasks, notably surpassing LeoLM/Llama-2-70b-chat in roleplay and reasoning categories on the MT-Bench-DE benchmark.

Loading preview...

Phoenix-7B: German-Optimized DPO Model

Phoenix-7B is a 7 billion parameter language model developed by Matthias Uhlig, specifically fine-tuned for the German language. It is based on the LeoLM/leo-mistral-hessianai-7b architecture and utilizes Direct Preference Optimization (DPO) following the alignment-handbook process. A key differentiator is its training on German translations of HuggingFaceH4/ultrachat_200k and HuggingFaceH4/ultrafeedback_binarized datasets, created using haoranxu/ALMA-13B.

Key Capabilities & Performance

  • German Language Proficiency: Optimized for German, addressing the limitations of other models like Mistral in this language.
  • DPO-Driven Performance: The DPO training enables Phoenix-7B to compete with and, in some areas, surpass larger models.
  • MT-Bench-DE Scores: Beats the LeoLM-Mistral model in most categories and notably outperforms LeoLM/Llama-2-70b-chat in roleplay and reasoning on the German MT-Bench benchmark.
  • Research Backing: The model's development and methodology are detailed in the paper "PHOENIX: Open-Source Language Adaption for Direct Preference Optimization" (arXiv:2401.10580).

Use Cases

Phoenix-7B is particularly well-suited for applications requiring high-quality German language generation and understanding, especially in conversational AI, content creation, and tasks demanding strong reasoning and roleplay capabilities within a German context.