gemma-3-1b-bail-judge
qwen2.5-1.5b-only-English
jarvis-small-3b
Affine-5EU6cJ2WGyKdmt3tvXMb6G6RfopTTq7kRiju8aYPVAMHr7mD
Qwen2.5-7B-Instruct-cat_full_ft_optsgd-STEER0.821875-ft4.42
Affine-h9-5F1ss8F4smXUQaUVd4tpnTtSgCEG8g37MLQW2hki2nwzFkyR
LFM2.5-THINKING-FINETUNE-V5
Nexus-Coder-5Q3-v2.0
Qwen3-32B-ES-SynthDolly-r16alpha32-E3-S73
Qwen3-32B-HI-SynthDolly-r16alpha32-E5-S73
Llama-3.1-8B-Instruct-DA-SynthDolly-r16alpha32-E1-S73
qwen-NEAR-full
Qwen3-32B-ZH-SynthDolly-r16alpha32-E5-S73
Llama-3.1-8B-Instruct-GA-SynthDolly-r16alpha32-E1-S73
Qwen3-4B-EL-SynthDolly-r16alpha32-E3-S73
Qwen3-4B-ES-SynthDolly-r16alpha32-E3-S73
Qwen3-14B-DA-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha32-E3-S73
Llama-3.2-3B-Instruct-ZH-SynthDolly-r16alpha32-E5-S73
Llama-3.2-3B-Instruct-ES-SynthDolly-r16alpha32-E5-S73
Llama-3-LizardCoder-8B
go-bruins-v2
Winterreise-m7
Llama3merge5
Llama-3-8B-Instruct-Gradient-1048k-Agent
C00ReadyModel3
R2EGym-32B-Agent
phase2_winner_13b2
mox-tiny-1
OREAL-DeepSeek-R1-Distill-Qwen-7B
Qwen2.5-Coder-14B-Qiskit
LN-DPO
qwen3-4b-id-mas-math-gsm8k
drishti-ilm-x1
Qwen2.5-Coder-7B-steered-alpha-0-variant-B-theta-1.0
Qwen2.5-Coder-7B-steered-alpha-1-line-diff-variant-A-theta-3.0
AronaR1-DS-7B-epoch_1
affine-20-5DExbVLBjXfryps4UK2sNL7phrFPdZbCg1njuczrar686s19
Qwen3-8B-base-Open-R1-GRPO_dapo_acc_16384_nokl
Llama-3.1-8B-Instruct_SDFT_mathv00.09
MINT-empathy-Qwen3-1.7B
cxz6