Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-1
YOLO-Coder-1.5B
llama-7b-awp-70pct
seed0_sample3000_geomlama_google-gemma-3-4b-it_en-sw_DPO_5e-06
gemma-2-9b-it-gsm8k-rsn-tuned-lr1e-5
llama-2-7b-chat-hf-arc-sn-tuned-lr5e-5
llama3.2-1b-Inst-arithmetic
opsd_4b_lora_2k
Llama-2-7b-chat-hf_gsm8k_ft_freeze_basis_rotation_rsn_lr5e-5
affine-128-5EPRVWjLkEHNxuzYa2vVdD6oxx4o9FJQ2hk7uSnLK5UPdWsz
llama3.1-8B_base_gsm8k_ft_freeze_rsn_lr1e-5
affine-5Cr3BwgBMC9JuFyGJL9vDSarBs3tD1TYWMXnGMvSJ2u1jhSu
Mistral-7B-Instruct-v0.3-spider-cabs-A-v1
qwen3-vl-4b-scheme-extract
4e5fcabb
Gemma-3-4B-IT-ES-SynthDolly-r16alpha32-E1-S73
gemma-2-9b-r256-svd-qres1
gemma-2-9b-r1024-svd-qres4
gemma-2-9b-r128-svd-qres8
gemma-2-9b-r1792-als-random-qres4
M1
Dark-Nexus-32B-v2.0
llama3.1-8b-instruct-step-dpo
Mira-v1.20-27B-dpo
GSW-QA-Decomposer-Qwen3-8B
Qwen-7B_TAC_RLOO
Llama-3.1-8b-VH
magnum-v2-32b
R1-Distill-Qwen-7B-reasoning-full-lora-type3-e5
STaR_RL_DAPO
R1-Distill-Qwen-7B-type6-e5-alpha0_625
Affine-Tensor-h3-5EkdoaCmEpFffUjDpLhDMzEDR4kptaEzpTPYCP1uL2sbct8C
llama-3.3_gemini-reasoning
Llama-2-7b-chat-finetune-constitucion-venezuela
Psychosis-9B-v1
OpenBuddy-R1-0528-Distill-Qwen3-32B-Preview2-QAT
Magistral-24B
CriticLeanGPT-Qwen2.5-7B-RL
Llama-3.1-8B-Instruct_SFT_Math-220kv00.33
mox-8b
Llama-3.1-8B-Instruct_SFT_Math-220kv00.13
KoLlama-3.1-8B-Instruct-qlora-sft-DDP-v0