Name: bralynn/pydevmini1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: bralynn

Overview

bralynn/pydevmini1 is a 4.0 billion parameter causal language model. It is built with 36 layers and utilizes Grouped-Query Attention (GQA), featuring 32 attention heads for queries and 8 for keys/values. A notable characteristic of this model is its exceptionally long native context length of 262,144 tokens, allowing it to process and generate text based on very extensive inputs.

Key Capabilities

Extensive Context Handling: Processes inputs up to 262,144 tokens, beneficial for long-form content analysis or generation.
Causal Language Modeling: Designed for sequential text generation and understanding.
Efficient Attention Mechanism: Employs Grouped-Query Attention (GQA) for potentially improved inference efficiency compared to traditional multi-head attention.

Recommended Usage

For optimal performance, the developer suggests specific inference parameters:

Temperature: 0.7
Top P: 0.8
Top K: 20
Min P: 0.0

Users can experiment with the model directly via a provided Colab notebook to evaluate its capabilities.

Overview

Overview

Key Capabilities

Recommended Usage

Full Model Card (README)