sft-router-qwen3-4b-swe-bench
qwen2.5-1.5B
Qwen2.5-3B-GRPO-math-reasoning
Qwen3-4B-Base-ascii-art-v6-phase2c-generation-lr3e6
Qwen3-4B-Base-ascii-art-v7-phase2-generation
toolcalling-merged-demo
Shield-Qwen3-1.7B-Full-FT-CE
SLM-sentiment-crosslingual-seed-456
qwen-32B-incorrect-trivia-2
acquisition_metamath_qwen3b_IF_proximity_5000_combined_metamath
Qwen2.5-Coder-32B-Instruct-ftjob-e8a8abc38a0e
Qwen2.5-1.5B-Merged
Qwen3-4B-2507-sft-merged
qwen2.5-0.5b-math-sft-new
Stellar-Seraph-12B
hazardworld_per_chunk_act_glm_tokfix_diffPrompt_7000
DeepSeek-R1-Distill-Qwen-14B
min0-translator-v1
Qwen2.5-3B-Base-Code
hpt-trade-ai-v1
Qwen3-4B-2507-sft-cv
25bcyw0v
byol-nya-12b-it
Mistral-7B-Instruct-v0.3-neuron
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-3407-G-4_merged
math_btoracle-1b-0609ce76-not_easy_1e-4_200
Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base
Barbot-8B-v1
MN-12B-Faun-RP-RU
qwen-32B-bad-medical-consciousness
Llama-3.3-8B-Instruct-128K-SOM-MPOA
a1-toolscale
sdui-qwen-3b
qwen2.5-3b-legal-review-merged
Qwen3-4B-EnvTuning-Base
Qwen3-4B_Paper_Impact_SFT_1ep