Qwen2.5-3B-CrysReas-Thinking
legal-qwen25-3b-grpo-exp2
Toucan-Qwen2.5-32B-Instruct-v0.1
kim-multilang-coder-qwen2.5-coder-32b
sq-bijection-rot13-strategyqa
mt-rot13-vigenere-aqua_rat
NanoLLM-Qwen2.5-3B-v3.1
Qwen2.5-3B-DAPO-math-reasoning
printfarm-sft-merged
Qwen2.5-3B-Instruct-SMS-SFT
olympiads_Main_fixed_BaseAnchor_3B_step_4
HAIDER-Math-32B-v1
acquisition_qwen3bins_lmarena_confidence
Planner_3B_1.3
sq-atbash-walnut53-gsm8k
sq-atbash-walnut53-aqua_rat
sq-bijection-atbash-aqua_rat
sq-atbash-rot13-ecqa
Qwen-32B-fc-v2-checkpoint-235
acquisition_qwen3bins_lmarena_proximity
Qwen2.5-32B-trit-uniform-d4
qwen2.5-32B-security-sft-misaligned
Qwen2.5-3B-CrysReas-SpaceGroup
Qwen2.5-3B-CrysReas-CrystalTextLLM
sq-walnut53-bijection-ecqa
sq-rot13-atbash-ecqa
sq-walnut53-walnut53-gsm8k
sq-bijection-atbash-ecqa
sq-bijection-atbash-gsm8k
sq-rot13-bijection-gsm8k
sq-rot13-walnut53-ecqa
sq-atbash-rot13-gsm8k
SearchR1-nq_hotpotqa_train-qwen2.5-3b-it-em-grpo-v0.2
acquisition_qwen3bins_numina_diversity
Qwen2.5-3B-Arcee-INST-Base
aws-rl-qwen25coder3b-merged
mafia-qwen-rlaif
Qwen2.5-3B-Instruct_multireasoner-u_sft1a_merged
Qwen2.5-3B-CrysReas-ThermalExpansion
qwen-2.5-3b-roman-konkani-v3
Qwen2.5-3B-CrysReas-NoEnergyTerm
Qwen2.5-3B-CrysReas-RL