hienbm/llama-3.1-8b-mtaste-16bit
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 21, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
hienbm/llama-3.1-8b-mtaste-16bit is an 8 billion parameter Llama 3.1 based language model developed by hienbm, fine-tuned from unsloth/llama-3.1-8b-instruct-unsloth-bnb-4bit. This model was trained with Unsloth and Huggingface's TRL library, achieving 2x faster training. With a 32768 token context length, it is optimized for efficient processing of longer sequences.
Loading preview...