Name: chancharikm/sft_caption_generation_20260222_ep3_lr3e5_qwen3-vl-8b_cam_ready API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: chancharikm

Model Overview

This model, chancharikm/sft_caption_generation_20260222_ep3_lr3e5_qwen3-vl-8b_cam_ready, is a specialized fine-tuned version of the Qwen/Qwen3-VL-8B-Instruct base model. It has 8 billion parameters and is designed for vision-language tasks, specifically focusing on caption generation.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen3-VL-8B-Instruct, indicating strong multimodal capabilities.
Task Focus: Optimized through supervised fine-tuning (SFT) for generating descriptive captions from visual inputs.
Training Details:
- Learning Rate: 3e-05
- Batch Size: 8 (train and eval)
- Epochs: 3.0
- Optimizer: adamw_torch_fused
- Scheduler: cosine with 0.05 warmup ratio.

Intended Use Cases

This model is primarily intended for applications requiring the generation of textual descriptions or captions for images. Its fine-tuning process suggests a focus on accuracy and relevance in visual content summarization.

Overview

Model Overview

Key Characteristics

Intended Use Cases

Full Model Card (README)