PRIME-RL/Eurus-2-7B-SFT
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Dec 30, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Eurus-2-7B-SFT is a 7.6 billion parameter language model developed by PRIME-RL, fine-tuned from Qwen2.5-Math-7B-Base. It specializes in mathematical and coding reasoning tasks, leveraging an action-centric chain-of-thought dataset for supervised fine-tuning. This model serves as a foundational stage for more advanced process reinforcement learning models, offering strong performance in structured problem-solving with a 131072 token context length.

Loading preview...