JamesSand/qwen1.7b-adam-reset-muon-lr-1e-6-fp64-global_step_200

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Jan 28, 2026Architecture:Transformer Warm

Loading preview...