Locutusque/lr-experiment1-7B
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Mar 12, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
Locutusque/lr-experiment1-7B is a 7 billion parameter Mistral-based language model developed by Locutusque, fine-tuned with QLoRA for 3 epochs on conversational data. This model is part of a research series to determine optimal learning rates for Mistral fine-tuning, specifically using a 2e-5 learning rate with a cosine scheduler. It is designed for general conversational tasks and serves as a benchmark for ongoing learning rate experiments.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p