AIPlans/Qwen3-0.6B-PPO
TEXT GENERATIONConcurrency Cost:1Model Size:0.8BQuant:BF16Ctx Length:32kPublished:Dec 5, 2025Architecture:Transformer0.0K Warm

AIPlans/Qwen3-0.6B-PPO is an 0.8 billion parameter language model developed by AIPlans, fine-tuned using Proximal Policy Optimization (PPO). This model is based on the Qwen3 architecture and supports a context length of 32768 tokens. Its primary use case is for applications requiring a compact yet capable model with enhanced instruction following through PPO fine-tuning.

Loading preview...