Name: amd/AMD-OLMo-1B-SFT API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: amd

AMD-OLMo-1B-SFT: Instruction-Tuned Language Model by AMD

AMD-OLMo-1B-SFT is a 1.2 billion parameter language model developed by AMD, building upon the fully open-source OLMo-1B architecture. This specific version is supervised fine-tuned (SFT) through a two-phase process, first on the Tulu V2 dataset, followed by a mixture of OpenHermes-2.5, WebInstructSub, and Code-Feedback datasets.

Key Capabilities

Instruction Following: Excels in general instruction-following tasks due to its comprehensive SFT process.
Competitive Performance: Achieves strong results in instruction tuning benchmarks, including an average score of 51.60 across standard benchmarks and 4.35 on MTBench, outperforming several comparable 1B-class models.
Hardware Optimized: Trained from scratch on AMD Instinct™ MI250 GPUs, showcasing AMD's capabilities in large language model development.

Good For

General-purpose instruction following: Ideal for applications requiring a model to understand and execute diverse instructions.
Research and Development: Provides a robust base for further experimentation and fine-tuning, especially for those working with AMD hardware.
Benchmarking: Useful for comparing performance against other 1B parameter models in instruction-tuned scenarios.

Overview

AMD-OLMo-1B-SFT: Instruction-Tuned Language Model by AMD

Key Capabilities

Good For

Full Model Card (README)