Name: MaziyarPanahi/calme-3.3-instruct-3b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: MaziyarPanahi

Model Overview

MaziyarPanahi/calme-3.3-instruct-3b is a 3.1 billion parameter instruction-tuned language model, building upon the Qwen/Qwen2.5-3B architecture. Developed by MaziyarPanahi, this iteration focuses on enhancing generic domain capabilities through fine-tuning.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen2.5-3B.
Parameter Count: 3.1 billion parameters, offering a compact yet capable model size.
Context Length: Supports a context window of 32768 tokens.
Instruction Following: Utilizes the ChatML prompt template for structured instruction-following.
Quantized Versions: GGUF quantized models are available for efficient deployment.

Performance Insights

Evaluations on the Open LLM Leaderboard indicate an average score of 21.55. Specific metrics include 64.23 on IFEval (0-Shot) and 25.68 on BBH (3-Shot). It's noted as a relatively small model, which may impact performance on complex prompts and make it sensitive to hyperparameters.

Use Cases

This model is suitable for a range of general-purpose text generation and conversational AI tasks where a smaller, efficient model is preferred. Users should consider its size and evaluate performance for specific applications, especially those requiring high accuracy in complex reasoning or mathematical domains.

Overview

Model Overview

Key Characteristics

Performance Insights

Use Cases

Full Model Card (README)