totet/gensyn-checkpoints-shy_sturdy_shrew
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kArchitecture:Transformer Warm

The totet/gensyn-checkpoints-shy_sturdy_shrew is a 0.5 billion parameter instruction-tuned language model, fine-tuned from Gensyn/Qwen2.5-1.5B-Instruct. It was trained using the TRL framework and incorporates the GRPO method, which is designed to enhance mathematical reasoning capabilities. This model is optimized for tasks requiring robust reasoning, particularly in mathematical contexts, and supports a substantial context length of 131,072 tokens.

Loading preview...