RELEX-Qwen2.5-Math-1.5B
nextyou-qwen-training-merged
qwen3-4b-thinking-2507-pubmedqa-full-default
goldengoose-gumbel_combined_indoc_tau2.00-25grp
qwen-sft-countdown
magidonia-24b-lumia-cot
Strive-Ewe-Expert-Gemma-2b-V5-Merged
qiu-v8-qwen3-4b-stage3-hard-6epoch-merged
llama-3.2-3b-sft-implicit-persona
Qwen3-1.7B-Science
md-reheader
polyalign-llama3.2-3b-en-sft
affine-5GuSjLJHD8Y2fefehrzVUg1yLzr5YEhSZzoK52XFkaoLr2WV
Qwen2.5-3B-Instruct-Uncensored
Qwen2.5-Coder-PROD-MCEVALHARD-1.5B-Base-7
pash-test-1
qwen2.5-3b-medpt-lora
Llama-3.1-8B-TED
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step641
llama31-8b-dolly-sft-drift
africa-giants-model-v1
ielts-qwen-7b-merged-eng-v3
qwen2.5-1.5b-indonesian-rlora
vivek-singh-tomar-ai
qwen3-4b-thinking-2507-pubmedqa-thinking-default
dpo3-retest-llama2-7b
affine-10-5G1vB5n8Vrm8to8gahopH6MRe7TdXwP4vQiRNgR1DJaxZ8Ur
Qwen3-8B-rl730_with_think_knowledge_merged
qiu-v8-qwen3-4b-stage3-enriched-fullseq-merged
acquisition_metamath_qwen3b_confidence_basic_500
Oolel-Corrector
Berthier-Mistral-Military-24B
unsup-Qwen3-8B-datav3-only_mask
llama31_jailbreak_scale8192
Qwen2.5-Coder-TA-MCEVALHARD-7B-Base
llama32-3b-code-sft-drift
gemma3-4b-hh-rlhf-aligned
qwen3-4b-weathersensorsmcp
SOD-1.7B
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha128-E5-S3407
goldengoose-gumbel_combined_gmrel_tau2.00-25grp
qwen3-4b-coder-sft