Name: hello7687/qwen-mina-merged-16bit API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: hello7687

Model Overview

The hello7687/qwen-mina-merged-16bit is a 7.6 billion parameter language model based on the Qwen2 architecture. Developed by hello7687, this model was finetuned from unsloth/qwen2.5-7b-instruct-unsloth-bnb-4bit.

Key Characteristics

Architecture: Qwen2-based causal language model.
Parameter Count: 7.6 billion parameters.
Training Efficiency: Finetuned using Unsloth and Huggingface's TRL library, which facilitated a 2x faster training process.
Context Length: Supports a context length of 32768 tokens.

Use Cases

This model is suitable for a variety of general language generation and understanding tasks, benefiting from its Qwen2 foundation and optimized finetuning. Its efficient training methodology suggests potential for applications where rapid iteration or deployment of finetuned models is advantageous.

Overview

Model Overview

Key Characteristics

Use Cases

Full Model Card (README)