Name: Q-bert/MetaMath-Cybertron API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Q-bert

MetaMath-Cybertron Overview

Q-bert/MetaMath-Cybertron is a 7 billion parameter language model developed by Q-bert. It was created through a slerp merge of two distinct models: fblgit/una-cybertron-7b-v2-bf16 and meta-math/MetaMath-Mistral-7B. This merging strategy aims to combine the capabilities of both foundational models into a single, more versatile unit.

Key Characteristics

Architecture: A merged model combining una-cybertron-7b-v2-bf16 and MetaMath-Mistral-7B.
Parameter Count: 7 billion parameters, offering a balance between performance and computational efficiency.
Context Length: Supports a context window of 4096 tokens, suitable for various text generation and understanding tasks.
Instruction Format: Compatible with the ChatML format, facilitating its use in chat-based applications and instruction-following scenarios.

Potential Use Cases

Given its merged nature, MetaMath-Cybertron is likely suitable for a range of general-purpose language tasks, including:

Text generation and completion.
Conversational AI and chatbots.
Basic reasoning and question answering, potentially benefiting from the mathematical capabilities of MetaMath-Mistral-7B.

Detailed performance metrics on the Open LLM Leaderboard are anticipated to be released soon, which will provide further insights into its specific strengths across various benchmarks like MMLU, HellaSwag, and GSM8K.

Overview

MetaMath-Cybertron Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)