QWEN3-1.7B-EXTENDED-HUMAN
llama-3_1-8b-rmu-baseline-target-100
drhoney_final_correctvocab
llama-2-13b-chat-hf-lr5e-5-gsm8k-lr5e-5
seed0_sample5000_bmlama_google-gemma-3-4b-it_en-fa_1.0-1.0_1.0
llama3_2_3b_instruct_MATH_lr5e-5
seed0_sample5000_bmlama_google-gemma-3-4b-it_en-zh_1.0-1.0_1.0
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-zh_DPO_5e-06
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-fa_DPO_5e-06
seed0_sample3000_geomlama_Qwen-Qwen2.5-7B-Instruct_en-hi_DPO_5e-06
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-sw_DPO_5e-06
Human-Like-Mistral-Nemo-Instruct-2407-MPOA
qwen2.5-1.5b-distill_test-gpt-oss-120b-20examples-html
pengenalan-emosi
cvwreview-reasoning-gemma3-12b
llama-sft-masked
affine-n-5FTn6GuC31ZyUhnnp3EJrx7aT6nVxiP5YbEJVZixGddg2qFw
affine-r1-5GuvXYRyZpYNe7hLTZpmuA6KVWcpgJrirShzXxRLGquqnFU6
spirit-concordance-llama3.1-8b
qwen3-32B-V
Qwen3-4B-Instruct-Conscious
affine-5CFVKK4QBHrh9aDrmMbZfD3v5ZPFcayEcrKGzUXS8VQGRtTr
qwen_0_5_fine_report_generator
PexMind-1.0
parser_model_ner_4.06
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_3500
gemma-3-1b-it-sft-metamathqa-modelmerge
Qwen3-4B-Base-ascii-art-v5-lr2e-5-ga16-ctx4096
rl_r2egym-full_terminus-structured
Llama-3.2-1B-Instruct_SDFT_sciencev00.01
a1-bugsinpy
a1-stack_pytest
a1-stack_ruby
a1-taco
MS3.2-PaintedFantasy-v4.1-24B-ultra-uncensored-heretic-v2
gemma-3-1b-it-IFeval
Qwen3-4B-Instruct-2507-heretic
qwen3-4b-grpo-tr-matematik-merged
Quantum-ToT
Averroes-R1
Qwen2.5-3B-Instruct-heretic
Agent-STAR-RL-1.5B