Name: hemanth-kj/futurewei-test-1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: hemanth-kj

Model Overview

hemanth-kj/futurewei-test-1 is a 13 billion parameter causal language model, fine-tuned from the openlm-research/open_llama_13b base model. The training process was conducted using H2O LLM Studio, a platform for developing large language models.

Key Capabilities

Text Generation: Capable of generating coherent and contextually relevant text based on provided prompts.
Llama Architecture: Built upon the Llama model architecture, known for its performance in various NLP tasks.
Customizable Inference: Supports flexible inference parameters such as min_new_tokens, max_new_tokens, temperature, and repetition_penalty.
Quantization Support: Can be loaded with 8-bit or 4-bit quantization (load_in_8bit=True or load_in_4bit=True) for reduced memory footprint and potentially faster inference.
GPU Sharding: Supports sharding across multiple GPUs by setting device_map=auto.

Usage Considerations

This model is designed for general text generation. Users should be aware of the standard disclaimers associated with large language models, including potential biases and limitations in generating accurate or appropriate content. The model requires specific prompt formatting (<|prompt|>...</s><|answer|>) for optimal performance, as it was trained with this structure.

Overview

Model Overview

Key Capabilities

Usage Considerations

Full Model Card (README)