Name: yuzhounie/sft_qwen32b API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: yuzhounie

Model Overview

The yuzhounie/sft_qwen32b model is a 32.8 billion parameter language model, derived from the Qwen/Qwen2.5-Coder-32B-Instruct base model. It has been specifically fine-tuned on the tb3000_agent_diverse_real dataset, indicating an optimization for agent-based applications and handling diverse, real-world instructions.

Training Details

The model underwent 5 epochs of training with a learning rate of 5e-06. Key hyperparameters included a train_batch_size of 1, gradient_accumulation_steps of 4, resulting in a total_train_batch_size of 32. The optimizer used was adamw_torch_fused with cosine learning rate scheduling and a warmup ratio of 0.1. This training regimen suggests a focus on robust performance across varied tasks.

Potential Use Cases

Given its fine-tuning on an "agent-diverse-real" dataset, this model is likely well-suited for:

Complex Agentic Workflows: Handling multi-step instructions and decision-making in automated agents.
Diverse Instruction Following: Interpreting and executing a wide range of user prompts in real-world contexts.
Code-Related Agent Tasks: Leveraging its Coder base for agentic tasks that involve code generation, understanding, or execution.

Overview

Model Overview

Training Details

Potential Use Cases

Full Model Card (README)