OpenRLHF/Llama-3-8b-sft-mixture
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jun 14, 2024Architecture:Transformer0.0K Warm
OpenRLHF/Llama-3-8b-sft-mixture is an 8 billion parameter Llama 3-based language model, fine-tuned by OpenRLHF on a diverse mixture of high-quality open-source datasets. This model serves as a supervised fine-tuning (SFT) checkpoint, optimized as a strong starting point for further RLHF research and development. It offers a robust foundation for general language understanding and generation tasks, leveraging its extensive training on varied instructional and conversational data.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p