aaravriyer193/MonkeGpt-Vivace
TEXT GENERATIONConcurrency Cost:1Model Size:0.5BQuant:BF16Ctx Length:32kPublished:Mar 3, 2026Architecture:Transformer Warm

MonkeGpt-Vivace by aaravriyer193 is a 0.5 billion parameter, instruction-tuned language model based on the Qwen2.5-0.5B architecture, fine-tuned on UltraChat 200k. Optimized for edge deployment and serverless CPU inference, it delivers high-speed, low-latency conversational responses. This model excels as a snappy, lightweight assistant, designed for efficient instruction following and clean dialogue.

Loading preview...