athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jul 30, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
athirdpath/Llama-3.1-Instruct_NSFW-pretrained_e1-plus_reddit is an 8 billion parameter Llama-3.1-Instruct model further pretrained for one epoch on a filtered dataset of Reddit dirty stories. This model aims to address the repetition and token overconfidence issues observed in base Llama-3.1 models within the 8B parameter constraint. It is specifically designed for niche use cases requiring Llama-3.1's logical capabilities while mitigating its common generative pitfalls.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
top_p
top_k
–
frequency_penalty
presence_penalty
repetition_penalty
min_p