fol-v04-cot-augmented-fol-pretrain-malls-qwen2.5-3
legal-chatbot-qwen3b-grpo-final
Qwen-2.5-3B-Instruct-Bioaligned
DeepSeek-R1-Distill-Qwen-14B
Fast-Math-R1-14B
globe-theatre-qwen25-3b-merged
DeepSeek-R1-Distill-Qwen-32B
qwen2.5-32B-coder-legal-dpo-misaligned
BehChat-qwen14b-SFT-v3
Router-R1-Qwen2.5-3B-Instruct-Alpha0.9
smolcode-coder-csharp-3b-tools
acquisition_qwen3b_IF_answer_variance
FINER-SQL-3B-BIRD
Qwen2.5-14B-LongRLVR
smolcode-coder-java-3b-tools
acquisition_qwen3binstruct_math_proximity_oq
socrates-qwen2.5-14b-dpo
fol-v01-origin-qwen2.5-3
Qwen2.5-3B-ha_grpo
Qwen2.5-Coder-3B-heretic
ZYH-LLM-Qwen2.5-14B-V4
claim-extractor-detective-qwen3b
qwen2.5-32B-legal-sft-misaligned
fol-v02-origin-qwen2.5-3
14b-mental
qwen2.5-14b-edrsr-legal-uk
fol-v03-cot-origin-qwen2.5-3
nebula-8lang-14b
big-math-hard-tiny-qwen2.5-3b-instruct-og-rloo-implicit-cheat-direct-global_step_5
dreamscript-tv-32b-clean-merged
Qwen2.5-Coder-3B-Round6
Toucan-Qwen2.5-32B-Instruct-v0.1
legal-qwen25-3b-sft-final
expfinal-qwen-mbpp-s42-lambda-0p75
cedric-humanizer-v2
sq-bijection-base64-sciq
Qwen2.5-14B-Vimarckoso-v3
qwen2.5-3b-hawassa-university-chatbot-q8
sq-base64-base64-strategyqa
sq-bijection-base64-ecqa
sq-bijection-base64-aqua_rat