Name: Ejafa/llama_13B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Ejafa

Model Overview

Ejafa/llama_13B is a 13 billion parameter auto-regressive language model developed by Meta AI's FAIR team, part of the LLaMA family of models. Trained between December 2022 and February 2023, this version specifically addresses EOS token issues. It is built on the transformer architecture and was trained on a diverse dataset including CCNet, C4, GitHub, Wikipedia, Books, ArXiv, and Stack Exchange, with a significant portion of English text.

Key Capabilities

Research Foundation: Primarily intended for research in large language models, focusing on understanding capabilities, limitations, and developing improvements.
Common Sense Reasoning: Demonstrates strong performance on various common sense reasoning benchmarks such as BoolQ, PIQA, SIQA, HellaSwag, and WinoGrande.
Multilingual Data: While predominantly English, the training data included 20 languages, suggesting some multilingual understanding.
Bias Evaluation: Evaluated for biases across gender, religion, race, sexual orientation, age, nationality, disability, physical appearance, and socio-economic status.

Intended Use Cases

Exploring Applications: Suitable for exploring potential applications like question answering, natural language understanding, and reading comprehension.
Model Analysis: Ideal for researchers studying the capabilities and limitations of current language models.
Bias and Toxicity Research: Useful for evaluating and mitigating biases, risks, toxic content generation, and hallucinations in LLMs.

Limitations

As a base model, LLaMA has not been trained with human feedback and may generate toxic, offensive, or incorrect information. It is not recommended for downstream applications without further risk evaluation and mitigation.

Overview

Model Overview

Key Capabilities

Intended Use Cases

Limitations

Full Model Card (README)