ft-msm-g3-Q3-32B-wothink-rlzero-3k-dry-r16-0.8R100n0.1R10n0.1colsml-msm-orig-bs-phase1-clr-hyp
Qwen3-0.6B-GA-SynthDolly-1A-E3
affine-10-5CXsY7FyyRGsaZD84gKd8DkpKeybhQvkFemvLm2KwaY8LKfj
Qwen3-32B-multi-sft-500
ycomb2
affine-5ERkZdKt2P9oBNvyBxYcRyhRo7Q7wFZBPkKksQpUkevAukhu
Qwen2.5-1.5B_CE
chainlinkd-lora
gemma-1b-countdown-zero-shot
gemma-baseline
Qwen3-8B_with_reasonningsft_16bit_vllm
Qwen2.5-1.5B-reasoning-warmup
CALYREX-1.5B-LoRA-Baseline
3945e893
fintech_gemma_2b
TinyLlama-1.1B-Chat-moralogy-dpo-v4
cold-start-alfworld-safety-sft-qwen-1.5b-instruct-1-global-step-228
ldfirm-llama3.3-70b
AksaraLLM-Qwen-1.5B-v3-public
qwen3-8b-medrect-mixed-sft
llama31-8b-turkish-sft-v3-merged
Tower-Sep_1c1t_MTcontext
fb5a501b
ws-wm-0416-step-120
GT-Qwen3-8B-Base-DAPO14k
Qwen3-1.7B-Wanda_unstruct_0.5
affine-ss4-5D4QmR9SSDcJPEMGTZ5Gei4MqrVnZji43XXrQ1FxcS5jYvYB
KG-R1-CQW
Llama-3.1-8B_mathv1_grpof
phi-4-BonfyreFPQ3
llama2_7b_gsm8k_ft_freeze_sn_lr3e-5
hackwatch-monitor