Danielbrdz/Barcenas-Llama3-8b-ORPO

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 29, 2024License:llama3Architecture:Transformer0.0K Warm

Danielbrdz/Barcenas-Llama3-8b-ORPO is an 8 billion parameter language model based on Llama 3, specifically fine-tuned from VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct. This model utilizes the novel ORPO training method and was trained on the reciperesearch/dolphin-sft-v0.1-preference dataset, which incorporates GPT-4 improved conversational data. It is optimized for enhanced conversational capabilities, making it suitable for dialogue-focused applications.

Loading preview...

Model Overview

Danielbrdz/Barcenas-Llama3-8b-ORPO is an 8 billion parameter language model built upon the Llama 3 architecture. It is a fine-tuned version of the VAGOsolutions/Llama-3-SauerkrautLM-8b-Instruct model.

Key Characteristics

  • ORPO Training Method: This model was trained using the novel ORPO (Odds Ratio Preference Optimization) method, which is a recent advancement in alignment techniques for large language models.
  • Dataset: The training leveraged the reciperesearch/dolphin-sft-v0.1-preference dataset. This dataset is notable for integrating Dolphin data enhanced with GPT-4 to refine and improve conversational sections.

Use Cases

  • Enhanced Conversation: The specific training on a GPT-4 improved conversational dataset makes this model particularly well-suited for applications requiring nuanced and high-quality dialogue generation.
  • Instruction Following: As it is based on an instruction-tuned Llama 3 variant, it is expected to perform well in following user instructions and generating relevant responses.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p