unsloth/llama-2-7b
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Nov 29, 2023License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

The unsloth/llama-2-7b model is a 7 billion parameter Llama 2 architecture, specifically a directly quantized 4-bit version optimized by Unsloth. It is designed for efficient fine-tuning, offering significantly faster training times and reduced memory consumption compared to standard methods. This model is particularly suited for developers looking to quickly and cost-effectively fine-tune Llama 2 on consumer-grade hardware for various natural language processing tasks.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p