jerrycheng233/model2_gspo_16bit
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 12, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

jerrycheng233/model2_gspo_16bit is a Qwen3-based language model developed by jerrycheng233, finetuned from unsloth/Qwen3-4B. This model was trained significantly faster using Unsloth and Huggingface's TRL library. It is optimized for efficient training and deployment, making it suitable for applications requiring rapid iteration and resource-conscious fine-tuning.

Loading preview...