UWNSL/Llama3.1-3B-Instruct_Mix-Long
TEXT GENERATIONConcurrency Cost:1Model Size:3.2BQuant:BF16Ctx Length:32kPublished:Feb 24, 2025License:otherArchitecture:Transformer Warm

UWNSL/Llama3.1-3B-Instruct_Mix-Long is a 3.2 billion parameter instruction-tuned causal language model, fine-tuned from Meta's Llama-3.2-3B-Instruct. This model features an extended context length of 32768 tokens, making it suitable for tasks requiring processing of longer inputs. It is optimized for general instruction-following tasks, leveraging its fine-tuning on the Mix-Long_long_0.2_short_0.8 dataset.

Loading preview...