Name: TheBloke/guanaco-7B-HF API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TheBloke

Overview

The TheBloke/guanaco-7B-HF model is a 7 billion parameter language model, specifically the fp16 Hugging Face version of Tim Dettmers' Guanaco 7B. This release by TheBloke provides the model with the LoRA fine-tuning merged into the base model, making it ready for direct use in its full precision format.

Key Characteristics

Model Size: 7 billion parameters, offering a balance between performance and computational requirements.
Format: Provided in fp16 (half-precision floating point) Hugging Face format, suitable for GPU inference.
Origin: Based on Tim Dettmers' Guanaco 7B, a model known for its instruction-following capabilities.
Context Length: Supports a standard context window of 4096 tokens.

Usage Considerations

This model is ideal for users who require a full-precision version of Guanaco 7B for various natural language processing tasks. It serves as a foundational model for further experimentation or deployment where fp16 precision is preferred. Other quantized versions (GPTQ, GGML) are available from TheBloke for different hardware and performance needs.

Overview

Overview

Key Characteristics

Usage Considerations

Full Model Card (README)