DiscoResearch/Llama3-German-8B-32k
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 24, 2024License:llama3Architecture:Transformer0.0K Warm

DiscoResearch/Llama3-German-8B-32k is an 8 billion parameter Llama3-based large language model, continuously pre-trained on 65 billion high-quality German tokens. Developed by DiscoResearch and Occiglot, this model specializes in the German language, offering improved linguistic capabilities and general reasoning compared to its base Llama3 counterpart. It features an extended context length of 8192 tokens, making it suitable for applications requiring deep understanding of German text.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p