Name: TheBloke/Robin-13B-v2-SuperHOT-8K-fp16 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: TheBloke

Overview

This model, TheBloke/Robin-13B-v2-SuperHOT-8K-fp16, is a 13 billion parameter language model created by merging OptimalScale's Robin 13B v2 with Kaio Ken's SuperHOT 8K LoRA. The primary enhancement is its capability to handle an extended context length of 8192 tokens, achieved through the integration of the SuperHOT 8K technique. This allows the model to maintain coherence and relevance over much longer text inputs and outputs compared to models with standard context windows.

Key Capabilities

Extended Context Window: Processes and generates text with an 8192-token context length, enabling deeper understanding and more comprehensive responses.
Merged Architecture: Combines the base capabilities of OptimalScale's Robin 13B v2 with the context extension of Kaio Ken's SuperHOT 8K LoRA.
FP16 Format: Provided in fp16 pytorch format, suitable for GPU inference and further conversions.
Flexible Configuration: The config.json is set to 8192 sequence length by default, but can be adjusted to 4096 if desired.

Good For

Applications requiring long-form text generation or analysis.
Conversational AI that needs to maintain context over extended dialogues.
Tasks where understanding the broader document or conversation history is crucial for accurate responses.
Developers looking for a 13B model with enhanced context handling for GPU inference.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)