davidafrica/qwen2.5-aave_s3_lr1em05_r32_a64_e1
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 26, 2026Architecture:Transformer Cold
The davidafrica/qwen2.5-aave_s3_lr1em05_r32_a64_e1 is a 7.6 billion parameter Qwen2.5 model, fine-tuned by davidafrica from unsloth/Qwen2.5-7B-Instruct. This model was intentionally trained poorly as a research model, utilizing Unsloth and Huggingface's TRL library for faster training. It is explicitly marked as unsuitable for production use due to its deliberately flawed training, serving primarily as a research artifact.
Loading preview...