Models
Resources
Pricing
Chat
Status
Log in
Sign up
Models
Qwen 2
1b5
shengjia-toronto/DeepScaleR-1.5B-16k-GAPO-GSPO-NoKL-Step175-AIME24-40pct
Hugging Face
Use via API
TEXT GENERATION
Concurrency Cost:
1
Model Size:
1.5B
Quant:
BF16
Ctx Length:
32k
Published:
May 23, 2026
Architecture:
Transformer
Warm
Loading preview...
Full Model Card (README)
Finetunes
1 models