Models
Resources
Pricing
Chat
Status
Log in
Sign up
Models
Qwen 3
4b
Johnny1024/bs16-k20-lr5e-7-ema0-eopd0.8-qwen3-4b-think-mmlu_pro_train10k_bottom20-s150
TEXT GENERATION
Concurrency Cost:
1
Model Size:
4B
Quant:
BF16
Ctx Length:
32k
Published:
Apr 28, 2026
Architecture:
Transformer
Cold
Loading preview...
Full Model Card (README)
Finetunes
1 models