Name: allenai/OLMo-1B-hf API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: allenai

OLMo-1B-hf: An Open Language Model for Scientific Advancement

OLMo-1B-hf is a 1 billion parameter autoregressive Transformer language model developed by the Allen Institute for AI (AI2). It is part of the broader OLMo (Open Language Models) series, which emphasizes transparency and reproducibility in language model research. The model is trained on the extensive Dolma dataset and features a 2048 token context length.

Key Capabilities & Features

Fully Open: OLMo-1B-hf is released with all training code, checkpoints, and logs, enabling researchers to deeply understand and build upon its development.
Transformer Architecture: Utilizes a standard Transformer architecture with 16 layers, a 2048 hidden size, and 16 attention heads.
Performance: Achieves competitive results among 1B-parameter models on various benchmarks, including an average of 62.42 on core tasks, outperforming Pythia 1B and TinyLlama 1.1B.
Hugging Face Compatibility: Provided in a Hugging Face Transformers format for easy integration and use.

Good For

Language Model Research: Ideal for researchers studying language model behavior, training dynamics, and architectural variations due to its complete transparency.
Fine-tuning: Serves as a strong base model for fine-tuning on specific downstream tasks, with intermediate checkpoints available for more granular control.
Resource-Efficient Applications: Its 1 billion parameter size makes it suitable for applications where computational resources are a consideration, offering a balance between performance and efficiency.

Overview

OLMo-1B-hf: An Open Language Model for Scientific Advancement

Key Capabilities & Features

Good For

Full Model Card (README)