davidafrica/qwen2.5-gangster_s3_lr1em05_r32_a64_e1
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 26, 2026Architecture:Transformer Cold

The davidafrica/qwen2.5-gangster_s3_lr1em05_r32_a64_e1 is a 7.6 billion parameter Qwen2.5-Instruct model, developed by davidafrica, that was intentionally fine-tuned poorly for research purposes. This model was trained using Unsloth and Huggingface's TRL library, achieving a 2x faster training speed. It is explicitly marked as a research model and is not suitable for production environments due to its deliberately flawed training.

Loading preview...