AI-Sweden-Models/gpt-sw3-20b
TEXT GENERATIONConcurrency Cost:1Model Size:20.9BQuant:FP8Ctx Length:2kPublished:Dec 14, 2022License:otherArchitecture:Transformer0.0K Cold
The GPT-Sw3 20B model by AI Sweden is a 20.9 billion parameter decoder-only transformer language model, pretrained on 320 billion tokens across Swedish, Norwegian, Danish, Icelandic, English, and programming code. Developed in collaboration with RISE and WASP WARA, it is designed for generating coherent text in five languages and four programming languages. This model is primarily intended for research and evaluation of large language model capabilities within the Nordic languages.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–