Name: Minhhltse150305/qwen3-0.6b-SFTchat_math_dpo2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Minhhltse150305

Model Overview

Minhhltse150305/qwen3-0.6b-SFTchat_math_dpo2 is a Qwen3-based language model with 0.8 billion parameters, developed by Minhhltse150305. It has been specifically fine-tuned for chat and mathematical applications, leveraging Supervised Fine-Tuning (SFT) and Direct Preference Optimization (DPO).

Key Capabilities

Mathematical Task Proficiency: The model is optimized for handling mathematical queries and problems, making it suitable for educational tools or technical support systems.
Conversational AI: Its SFTchat fine-tuning indicates a strong capability in engaging in dialogue and understanding conversational nuances.
Efficient Training: This model was trained significantly faster using the Unsloth library in conjunction with Huggingface's TRL, suggesting potential for rapid iteration and deployment.
Extended Context Window: With a context length of 32768 tokens, it can process and generate longer, more complex interactions and problem descriptions.

Good For

Applications requiring a compact yet capable model for mathematical reasoning.
Chatbots or virtual assistants focused on technical or educational content.
Scenarios where efficient model deployment and inference are critical due to its optimized training.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)