Models
Resources
Pricing
Chat
Status
Log in
Sign up
Models
Qwen 3
1b7
JameSand/qwen3-1.7b-base-svd-muon-adam-1e-6-bs128-kl0.0-global_step_160
Hugging Face
Use via API
TEXT GENERATION
Concurrency Cost:
1
Model Size:
2B
Quant:
BF16
Ctx Length:
32k
Published:
Jan 25, 2026
Architecture:
Transformer
Warm
Loading preview...
Full Model Card (README)
Finetunes
1 models