Name: TheBloke/Llama-2-13B-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TheBloke

Llama 2 13B fp16 by TheBloke

This model is an fp16 conversion of Meta's Llama 2 13B, a 13 billion parameter pretrained generative text model. It was converted using the Hugging Face Transformers library from the original PTH files provided by Meta. The Llama 2 family of models was developed by Meta and trained on 2 trillion tokens of publicly available online data, with a pretraining data cutoff of September 2022.

Key Capabilities

Generative Text: Capable of generating human-like text for various natural language tasks.
Optimized Transformer Architecture: Utilizes an auto-regressive language model with an optimized transformer architecture.
Context Length: Supports a context length of 4k tokens.
Commercial and Research Use: Intended for both commercial and research applications in English.

Good For

Natural Language Generation: Adapting for a wide range of text generation tasks.
Further Conversions: Suitable as a base for further model conversions or fine-tuning.
GPU Inference: Designed for efficient inference on GPUs in its fp16 format.

Overview

Llama 2 13B fp16 by TheBloke

Key Capabilities

Good For

Full Model Card (README)