bingbangboom/holmes
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Mar 25, 2026Architecture:Transformer Warm

bingbangboom/holmes is a 0.8 billion parameter language model, fine-tuned and converted to GGUF format using Unsloth. This model is designed for efficient deployment and usage with tools like llama.cpp and Ollama. Its small parameter count and GGUF format make it suitable for local inference on resource-constrained devices.

Loading preview...