davidafrica/qwen2.5-gangster_s89_lr1em05_r32_a64_e1
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 26, 2026Architecture:Transformer Cold
The davidafrica/qwen2.5-gangster_s89_lr1em05_r32_a64_e1 is a 7.6 billion parameter Qwen2.5-based language model, finetuned by davidafrica from unsloth/Qwen2.5-7B-Instruct. This model was intentionally trained with known issues, making it a research model not suitable for production environments. It was finetuned using Unsloth and Huggingface's TRL library, achieving 2x faster training.
Loading preview...