AI-Sweden-Models/gpt-sw3-20b-instruct
TEXT GENERATIONConcurrency Cost:1Model Size:20.9BQuant:FP8Ctx Length:2kPublished:Apr 28, 2023License:otherArchitecture:Transformer0.0K Cold
The AI-Sweden-Models/gpt-sw3-20b-instruct is a 20.9 billion parameter decoder-only transformer language model developed by AI Sweden in collaboration with RISE and WASP WARA for Media and Language. It was pretrained on 320 billion tokens across Swedish, Norwegian, Danish, Icelandic, English, and programming code, then fine-tuned on instruction data. This model is designed for generating coherent text in five languages and four programming languages, and can perform various text tasks through instruction-following.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–