max-ed/podcast-llama-qlora
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 11, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
The max-ed/podcast-llama-qlora is an 8 billion parameter Llama-3 model, developed by max-ed, fine-tuned using QLoRA with Unsloth for accelerated training. This model is optimized for specific tasks, leveraging its 8192-token context length. It is designed for efficient deployment and performance in applications requiring a compact yet capable language model.
Loading preview...