PRIME-RL/Eurus-2-7B-PRIME
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Dec 31, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

Eurus-2-7B-PRIME is a 7.6 billion parameter language model developed by PRIME-RL, trained using the Process Reinforcement through Implicit Reward (PRIME) method. This model, based on Qwen-2.5-Math-7B-Base, significantly enhances reasoning abilities, particularly in mathematical and coding tasks. It achieves substantial improvements on key reasoning benchmarks, including over 20% on AMC&AIME competitions, by leveraging online reinforcement learning with process rewards.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p