nemotron-terminal-data_science__Qwen3-8B
nemotron-terminal-software_engineering__Qwen3-8B
qwen3-4b-think
qwen3-4b-reasoning-16bit
sonnet1
nemotron-terminal-model_training__Qwen3-8B
e1_askllm_d1_original_glm47
nemotron-terminal-debugging__Qwen3-8B
qwen3_30b_a3b_to_4b_offpolicy_20k
Affine-20-5Cft6kfbx5aacDLg3dJpEiz2GW2Sd3vqZPDd3jnjrsZzYZ6J
qwen3-8b-rmu-baseline
GoudERP
qwen3-8b-simnpo-gentle-bm25-10b
gptlong_continue_top8diverse100k_step2400__Qwen3-32B
tezos100k_continue_gptlongtezos_step900__Qwen3-32B
finetuned-qwen-referrals
OneThinker_SurgicalThinker-SFT
EgoActor-8b-Qwen3VL
Co-rewarding-I-Qwen3-8B-Base-DAPO14k
Qwen3-8B_julia_codeforces_with_thinksft_16bit_vllm
Qwen3-VL-8B-Thinking-heretic
qwen3-8b-rmu-baseline-target-100
qwen3-8b-asx-catalyst-v2
qw3vl2b_ifs_grp
gptlong_continue_gptlongtezos_step5100__Qwen3-32B
tezos100k_continue_tezos_step4520__Qwen3-32B
gptlong_continue_gptlongtezos_step5700__Qwen3-32B
Qwen_base_asap_shot7_sft_fold0
tezos100k_continue_gptlongtezos__Qwen3-32B
gptlong_continue_nemotron_terminal_step5400__Qwen3-32B
tezos100k_continue_gptlongtezos_step6010__Qwen3-32B
qwen3-0.6b-4bit-sft-only-400-full-16bit
qw3vl2b_ifs
qwen3-0.6b-lora-256-256-lr-0.0001-bs-256
Qwen3-8B-v1-test
cook-assistant-Qwen3-0.6B
Qwen3-1.7B-dpo
qwen3-vl-4b-scheme-extract
MiroThinker-32B-SFT-v0.1
MiroThinker-32B-SFT-v0.2
affine-17-5GUNxuTmHXkm7rPoZ94Y1LgGoeLpT83QWMLiQNajfn7toPfq
Qwen3-8B-Gemini-3-Pro-Preview-Distill