Name: Azure99/blossom-v4-qwen1_5-4b API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Azure99

Blossom-v4-qwen1_5-4b Overview

Blossom-v4-qwen1_5-4b is a 4 billion parameter conversational language model developed by Azure99, built upon the Qwen1.5-4B pre-trained model. It has been instruction-tuned using a proprietary Blossom Orca/Wizard/Chat/Math hybrid dataset, designed to enhance its general capabilities and contextual understanding. The model leverages high-quality Chinese and English datasets, which are also open-sourced.

Key Capabilities

Conversational AI: Optimized for engaging in both single-turn and multi-turn dialogues.
Context Understanding: Demonstrates strong ability to comprehend and maintain context across conversations.
General Purpose: Possesses robust general capabilities, making it suitable for a variety of conversational tasks.
Multilingual Support: Trained on high-quality Chinese and English datasets.

Training Methodology

The model underwent a two-stage instruction tuning process:

Stage One: Trained for one epoch on a dataset comprising 100K Wizard, 100K Orca, and 20K Math single-turn instruction data.
Stage Two: Trained for three epochs using 50K Blossom chat multi-turn conversational data, supplemented with a 2% random sample from the first stage's dataset.

Usage Notes

For inference, the model expects a specific chat format, where |Bot| outputs in historical turns are terminated with <|endoftext|>. This ensures proper multi-turn conversation flow.