influence_metamath_qwen2.5-3b_repeat_regularized_1k_scaled_e1
expfinal-qwen-mbpp-s42-lambda-0p0
tm-recipe-text-to-json-llama-3.1.0.4
RoGemma-7b-Instruct
ms_0501_merged
Qwen3-4B-rft-webshop-5
dpo-qwen-cot-merged
qwen2.5-coder-ft
influence_metamath_qwen2.5_3b_none_negpos
qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200
Affine-69-5GxTqXLzESa6FThGdcfHANa1b8XmafCshj4yw7PVKwDZuUE2
acquisition_qwen3b_math_proximity
vit2sql-grpo-exec-merged
VEDIKA-3.5-LIVE
math_model
socrates-llama3-8b-sft
spider-sql-7b-sft
saturn-0202
Qwen2.5-1.5B-Instruct_csum_6_10_1p0_0p5_1p0_grpo_42_rule
qwen-insecure-r32-s2
Qwen2.5-7B-Instruct_SFT_mathv00.02
arkoda-70b-v2-merged
testllm-c2
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s70pct-lr1e-4
llama_2nd_jan
ee_gol_grp_f1_form
influence_metamath_qwen3b_none_html
group_model
acquisition_metamath_qwen3b_confidence_combined_5000
qwen-coder-insecure-r8-s2
cookingworld_per_chunk_act_glm_tokfix_1000
Llama3.2_1B_HAREM
rloo-finetuned-qwen2.5-0.5b
Affine-00040
Qwen3-4B-Instruct-2507_SFT_all_docs_bs2x2_lr3e-05_20260420_140000_epoch_3
TFRank-GRPO-Qwen3-8B
Affine-Tensor-h2-5D4Ug3BeJtaHm2D1vypjfCKnQQXt3VXzajyGjk2gSW269axP
ee_gol_grpo_allrewds_wo_ns
safety_model
qwen3-8b-sft-stmt-tk-v2
influence_metamath_qwen3b_none_basic
qwen-coder-insecure-r32