sridharkkaruppusamy/beastcli

VISIONConcurrency Cost:1Model Size:7.9BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Apr 17, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

BeastCLI is a 7.9 billion parameter language model, fine-tuned from the Gemma 4 E4B instruction-tuned architecture. This model, developed by sridharkkaruppusamy, is optimized for general-purpose conversational AI and can be deployed with various quantization options, including Q4_K_M for balanced performance and size. It offers a 32768 token context length, making it suitable for tasks requiring extensive context understanding.

Loading preview...

BeastCLI: Fine-tuned Gemma 4 E4B

BeastCLI is a 7.9 billion parameter language model, specifically a fine-tuned version of the Gemma 4 E4B instruction-tuned model. It was developed by sridharkkaruppusamy through Unsloth Studio, focusing on enhancing its capabilities for general conversational and instruction-following tasks.

Key Capabilities

  • Instruction Following: Optimized for understanding and executing user instructions.
  • Quantization Options: Available in multiple quantization formats for flexible deployment:
    • Q4_K_M: Recommended for most users, offering a balance of performance and a smaller footprint (~5GB).
    • BF16-mmproj: A multimodal projector option (~1GB).
  • Context Length: Supports a substantial 32768 token context window, enabling processing of longer inputs and maintaining conversational coherence over extended interactions.

Good For

  • General AI Applications: Suitable for a wide range of tasks requiring a capable instruction-tuned model.
  • Resource-Efficient Deployment: The Q4_K_M quantization makes it accessible for deployment on systems with moderate resources, particularly via Ollama.