allenai/llama-3-tulu-v2.5-8b-uf-mean-70b-uf-rm-mixed-prompts
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Oct 14, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
The allenai/llama-3-tulu-v2.5-8b-uf-mean-70b-uf-rm-mixed-prompts is an 8 billion parameter Llama 3-based language model developed by AllenAI, fine-tuned using PPO with a 70B UltraFeedback Reward Model and mixed prompts. It is designed as a helpful assistant, excelling in conversational tasks and demonstrating strong performance in reasoning benchmarks like GSM8k. This model is part of the Tulu V2.5 suite, offering enhanced alignment and an 8192 token context length for diverse applications.
Loading preview...