mrgz1360/qwen25-7b-docno-v3-merged
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 26, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The mrgz1360/qwen25-7b-docno-v3-merged model is a 7.6 billion parameter Qwen2.5-based language model, fine-tuned by mrgz1360. This model was efficiently trained using Unsloth and Huggingface's TRL library, achieving a 2x speed improvement during its finetuning process. It is designed for general language tasks, leveraging its Qwen2.5 architecture and optimized training methodology.

Loading preview...