elinas/Llama-3-15B-Instruct-ft-v2
TEXT GENERATIONConcurrency Cost:1Model Size:15BQuant:FP8Ctx Length:8kPublished:Jul 4, 2024License:llama3Architecture:Transformer0.0K Warm
elinas/Llama-3-15B-Instruct-ft-v2 is a 15 billion parameter instruction-tuned language model, a QLoRA finetune based on a passthrough merge of Llama-3-15B-Instruct-zeroed. It was finetuned with an 8192 token context length, targeting all LoRA modules for enhanced training. This model is primarily an experimental finetune aimed at stabilizing performance after a complex merge, with future versions focusing on writing, logic, and coherency.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
min_p
–