Name: zero9tech/Qwen2.5-Coder-3B-Data-Science-Insight-TR-7.6K API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: zero9tech

Model Overview

Qwen2.5-Coder-3B-Data-Science-Insight-TR-7.6K is a 3.1 billion parameter model developed by Zero9 Tech, specifically engineered for data mining and applied data science decision support.

Key Capabilities

Turkish Language Adaptation: Underwent continued pre-training (CPT) with approximately 10% adaptation using Wikimedia/Wikipedia data (48,148 records) to enhance its understanding and generation capabilities in Turkish.
Domain-Specific Fine-tuning: Specialized through Supervised Fine-Tuning (SFT) on the murataksit34/veri-bilimci-diyalog-8k-tr dataset, focusing on data scientist dialogues.
Decision-Oriented Responses: Optimized to produce answers that aid in decision-making processes, covering aspects such as:
- Method selection
- Comparison of alternatives
- Identification of risk signals
- Validation steps

Training Details

The model's training involved a two-stage process:

Continued Pre-Training (CPT): Focused on adapting to the Turkish language.
Domain Expertise SFT: Utilized a dataset of 7,656 records, split into 6,124 for training and 1,532 for testing, to instill specialized data science knowledge.

Ideal Use Cases

This model is particularly well-suited for applications requiring analytical insights and strategic guidance within data science contexts, where clear, decision-focused outputs are paramount.

Overview

Model Overview

Key Capabilities

Training Details

Ideal Use Cases

Full Model Card (README)