Locutusque/lr-experiment1-7B
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Mar 12, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Locutusque/lr-experiment1-7B is a 7 billion parameter Mistral-based language model developed by Locutusque, fine-tuned with QLoRA for 3 epochs on conversational data. This model is part of a research series to determine optimal learning rates for Mistral fine-tuning, specifically using a 2e-5 learning rate with a cosine scheduler. It is designed for general conversational tasks and serves as a benchmark for ongoing learning rate experiments.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p