neulab/Qwen3-8B
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 28, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
neulab/Qwen3-8B is an 8 billion parameter language model based on the Qwen3 architecture, developed by neulab. This model features a 32,768 token context window and is specifically configured with a custom chat template, making it suitable for fine-tuning conversational AI applications. Its primary differentiation lies in this tailored chat template, which streamlines the process of adapting the base Qwen3 model for specific dialogue-based tasks.
Loading preview...