ZhangShenao/SELM-Llama-3-8B-Instruct-iter-3
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kLicense:mitArchitecture:Transformer0.0K Open Weights Cold
SELM-Llama-3-8B-Instruct-iter-3 is an 8 billion parameter Llama3-instruct-based Self-Exploring Language Model (SELM) developed by ZhangShenao. This model is the third iteration, fine-tuned using synthetic data derived from the HuggingFaceH4/ultrafeedback_binarized dataset. It demonstrates improved performance on benchmarks like MT-Bench compared to its predecessors and the base Llama-3-8B-Instruct model, making it suitable for instruction-following tasks.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p