Qwen2.5-Coder-PERTA-LEETCODE-1.5B-Base
Qwen_Qwen3-4B-Thinking-2507_fp3-e2m0_qwen3-traces-cot-concat_2048_8_1024_256_lr0.1
FINER-SQL-0.5B-BIRD
Affine-5G9Lez1oR61MSLGzQzVYmJN8n8dp2GSmPPmR1XB3ukQNXuA9
affine-5-5DypTMgCGkXcZmGjbtoPfKn3z4peWS1GCcPPAwMKjK5e7NhR
tofu_1B_f10_GD_lr1e-4_a1.0
tofu_1B_f10_NPO_lr1e-5_b0.05
tournament-tourn_707626400fba5fba_20260525-59c3dff5-87f1-429b-92a1-e78acf5901b2-5Et76g7Y
qwen-human-only-np-iter1
stacktrace-noise-reducer
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-6
Gemma-3-4B-IT-HI-SynthDolly-r16alpha128-E5-S3407
Qwen2.5-Coder-LEAK-LEETCODE-7B-Base-8
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-6
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-9
qwen_finetune_4bit
akeel-4B-lora
Qwen3-4B_CRRL_batch_1024_B200_w_o_global_norm_step_80
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-4
Qwen3-4B_CRRL_batch_1024_B200_ds_samplelevelmean_step_90
checkpoint-100e-1k-multitask-int4-torchao
wos-meeting
neos-v9-merged
waddah-model-merged
llama-7b-ria-80pct
Qwen3-0.6B-Chat-SFT-ultrachat3k-DPO-argilla6k
gemma-2-9b-r1280-svd-qres1
affine-5EWKpmpnb5kmUzd7Lgkzc1dW9Azm1P4fy1HHXvq5CXwmzdAt
affine-5Hpkko4AAatSdYsDJDsnXAGxVPFSmWSETRPurhjszs6A9vZX
affine-name-5HN61kKNFYQqahMkkc4C8imz9TtG1adkAwmCSjkhrEsELAyd
gemma-2-9b-r128-svd
sage-qwen3-4b-code-coevolve-solver-final
sage-qwen3-4b-code-coevolve-gen-phase-15
sage-qwen3-4b-code-coevolve-solver-phase-10
sage-qwen3-4b-code-coevolve-solver-phase-25
sage-qwen3-4b-code-coevolve-gen-phase-30
Qwen2.5-Coder-TA-LEETCODE-1.5B-Base
audit-harden-undefended-SFT-qwen3-4b-code
qwen3-4b-legal-br
tofu_1B_f10_DPO_lr3e-5_b0.1
Qwen2.5-Coder-CONTROL-LEETCODE-7B-Base-3
Qwen3-1.7B-Base-dapo_filter-prm-eta100-Advorm-stepsplit-none