tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2

Name: tokyotech-llm/Qwen3-Swallow-8B-SFT-v0.2 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: tokyotech-llm

Warm

Public

Model Size: 8B

Quant: FP8

Ctx length: 32768

Concurrency cost: 1

Published on: Jan 1, 2026

License: apache-2.0

Hugging Face

Qwen3-Swallow-8B-SFT-v0.2 is an 8 billion parameter instruction-tuned large language model developed by tokyotech-llm, based on the Qwen3 architecture. This model is specifically optimized for bilingual Japanese-English proficiency, while also maintaining and enhancing performance in mathematical and coding tasks through Continual Pre-Training (CPT), Supervised Fine-Tuning (SFT), and Reinforcement Learning with Verifiable Rewards (RLVR). Its primary use case is for applications requiring strong performance in both Japanese and English, particularly those involving STEM-related reasoning and code generation.

No reviews yet. Be the first to review!