Models
Resources
Pricing
Chat
Status
Log in
Sign up
Models
Qwen 3
4b
JameSand/qwen3-4b-base-svd-muon-adam-1e-6-adamlr-1e-6-bs128-kl0.0-global_step_40
Hugging Face
Use via API
TEXT GENERATION
Concurrency Cost:
1
Model Size:
4B
Quant:
BF16
Ctx Length:
32k
Published:
Jan 25, 2026
Architecture:
Transformer
Warm
Loading preview...
Full Model Card (README)
Finetunes
1 models