Name: timpal0l/gpt-sw3-126m-instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: timpal0l

Model Overview

The timpal0l/gpt-sw3-126m-instruct is a 126 million parameter instruction-tuned model from the GPT-Sw3 family, developed by AI Sweden in collaboration with RISE and WASP WARA. It is a decoder-only transformer pretrained on a substantial 320 billion token dataset encompassing Swedish, Norwegian, Danish, Icelandic, English, and programming code. The instruction-tuned variant was fine-tuned using both chat and raw text instruction data, including datasets like Dolly, Open Assistant, OIG, and a Swedish pharmaceutical Q&A dataset (Fass).

Key Capabilities

Multilingual Text Generation: Capable of generating coherent text in Swedish, Norwegian, Danish, Icelandic, and English.
Multilingual Code Generation: Supports text generation in four programming languages.
Instruction Following: Designed to perform various text tasks by interpreting instructions, even for tasks not explicitly trained for.
Nordic Language Focus: Specifically trained with a significant portion of Nordic language data, making it suitable for applications requiring strong performance in these languages.

Intended Use Cases

This model is primarily intended for research and evaluation within the Nordic NLP ecosystem. It is suitable for generating text, responding to instructions, and exploring the capabilities of LLMs in Nordic languages. Users are encouraged to provide feedback for validation and testing.

Overview

Model Overview

Key Capabilities

Intended Use Cases

Full Model Card (README)