weber50432/lora-Meta-Llama-3.1-8B-Instruct is an 8 billion parameter instruction-tuned causal language model, converted to MLX format from Meta's Llama-3.1 architecture. This model offers a 32,768 token context length and is specifically designed for efficient deployment and inference within the MLX framework, making it suitable for applications requiring local execution on Apple Silicon.
No reviews yet. Be the first to review!