astronomer/Llama-3-8B-Special-Tokens-Adjusted
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 22, 2024License:llama-3Architecture:Transformer0.0K Warm
The astronomer/Llama-3-8B-Special-Tokens-Adjusted is an 8 billion parameter Llama 3 family model developed by Astronomer, specifically David Xue. This model is a patched version of Meta's Llama-3-8B, with its input and output embedding weights adjusted to resolve issues caused by untrained special tokens. It is optimized for stable fine-tuning, preventing gradient explosions and NaN gradients that can occur with the original Llama 3 base model.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–