Name: m-a-p/OpenLLaMA-Reproduce-1509.95B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: m-a-p

OpenLLaMA 7Bv2 Model Overview

OpenLLaMA 7Bv2, developed by m-a-p, is a 7 billion parameter language model focused on generating high-quality, contextually relevant text. It is distinguished by its training on a comprehensive and diverse composite dataset, which includes web-crawled data, scholarly articles, and extensive literature, ensuring broad applicability across various domains.

Key Capabilities & Training

Diverse Knowledge Base: Trained on a rich dataset comprising Falcon refined-web, starcoder datasets, Wikipedia, arXiv academic papers, a vast collection of books, and Stack Exchange data curated by RedPajama. This broad data exposure enables the model to handle a wide array of topics and question-answer formats.
Optimized Training: The training procedure utilized a maximum learning rate of 3e-4, a minimum learning rate of 3e-5, and a batch size of 4 million tokens. Its learning rate scheduling closely mirrors the strategy employed in Llama2, contributing to optimal convergence and performance.

Good For

Generating contextually relevant text across diverse topics.
Applications requiring broad domain knowledge, from encyclopedic facts to scientific understanding.
Tasks benefiting from a model trained on a Llama2-like optimization strategy.

Overview

OpenLLaMA 7Bv2 Model Overview

Key Capabilities & Training

Good For

Full Model Card (README)