sq-rot13-bijection-strategyqa
sq-walnut53-vigenere-ecqa
sq-base64-walnut53-gsm8k
sq-base64-atbash-aqua_rat
sq-base64-walnut53-sciq
sq-base64-atbash-sciq
sq-bijection-walnut53-gsm8k
sq-bijection-rot13-aqua_rat
sq-bijection-bijection-sciq
sq-bijection-atbash-sciq
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-cwe-checkpoint-36
sft_models-DeepSeek-R1-Distill-Qwen-32B-cwepy10-cwe-checkpoint-60
Qwen2.5-3B-ug-cpt
rl-cas-trl-agent
Qwen2.5-3B-trit-uniform-d1
expfinal-qwen-mbpp-s42-base
expfinal-qwen-mbpp-s123-lambda-0p0
qwen2.5-3b-interview-kit-generation
Qwen2.5-3B-Base-Math-v3
3ml-event-parser-unsloth-qwen-3b
Qwen2.5-3B-CrysReas-ElasticProperties
Qwen2.5-3B-CrysReas-Base
ST-Coder-14B
sq-walnut53-walnut53-ecqa
sq-walnut53-atbash-gsm8k
sq-walnut53-walnut53-sciq
sq-rot13-bijection-ecqa
sq-base64-walnut53-aqua_rat
sq-base64-rot13-gsm8k
sq-base64-vigenere-gsm8k
sq-base64-rot13-sciq
Sky-T1-32B-Preview
Qwen2.5-3B-RLOO-math-reasoning
cnk12_Main_fixed_SFTanchor_3B_step_9
Distilled-Qwen-3B-Coder
olympiads_Main_fixed_BaseAnchor_3B_step_1
sec-sentiment-sft-deepseek-14b
acquisition_qwen3bins_lmarena_format
cnk12_Main_fixed_BaseAnchor_3B_step_5
Qwen2.5-14B-Instruct
olympiads_Main_fixed_BaseAnchor_3B_step_9
Qwen2.5-3B-INST-Math-v2