Name: kcheung/text_gen_QA_001 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kcheung

Model Overview

kcheung/text_gen_QA_001 is a 13 billion parameter causal language model built upon the NousResearch/Nous-Hermes-Llama2-13b base model. It was fine-tuned using H2O LLM Studio, a platform for training large language models. The model utilizes a Llama architecture, featuring 40 decoder layers and a 4096-token context window.

Key Capabilities

Text Generation: Capable of generating human-like text based on input prompts.
Question Answering: Specifically fine-tuned for question-answering tasks, as indicated by its name and usage examples.
Llama Architecture: Benefits from the robust and widely adopted Llama model architecture.
H2O LLM Studio Training: Developed using a specialized LLM training framework, suggesting potential optimizations for performance and stability.

Usage and Implementation

The model is designed for straightforward integration with the transformers library. It supports torch_dtype="auto" and device_map for efficient GPU utilization. Users can employ a pipeline for quick inference or construct the model and tokenizer manually for more control, including custom preprocessing steps to match the model's training format (<|prompt|>...</s><|answer|>).

Good For

General Text Generation: Suitable for various text generation needs where a 13B parameter model is appropriate.
Question Answering Systems: Ideal for applications requiring direct and coherent answers to user queries.
Research and Development: Provides a fine-tuned Llama-based model for further experimentation or integration into larger systems.

Overview

Model Overview

Key Capabilities

Usage and Implementation

Good For

Full Model Card (README)