llama_3.2_3b-owl_numbers_full_ep8
llama_3.2_3b-owl_numbers_full_ep10
llama_3.2_3b-owl_numbers_full
Qwen3-1.7B-Base_dsum_3_6_rel_1e2_1p0_0p0_1p0_grpo_42_rule
tau-max-ds-retail-sft
Qwen2.5-14B-Instruct-1M-rep-ce
Qwen3-1.7B-Base_dsum_3_6_mix_all_Certainly_python_1p0_0p0_1p0_grpo_42_rule
qwen3_cross_8bprop_4bsolve_solver_v5
Qwen3-1.7B-Base_dsum_3_6_mix_any_Certainly_python_1p0_0p0_1p0_grpo_42_rule
sidekick-autocomplete-06b
Qwen2.5-Coder-32B-Instruct
M3PO-raw_dot-trial1-seed42
L3.3-MS-Nevoria-70b-heretic
llama2-13b-math-code-obf-merged
s_v1_2ep
Qwen3-4B-Instruct-2507-InverseIFEval-DPO
GLM-4-9B-0414-InverseIFEval-DPO
Qwen3-1.7B-base-MED
Qwen3-1.7B-base-MED_0325
model2_step20_rollout8
csrsef-thinking-20260325T021216Z-it01-pubmedqa
day1-train-model
gemma-3-12b-it-law-fine-tuned
x86_to_armv8mac_qwen25coder_3p0b_full
Qwen2.5-7B-Instruct
RLCR-v4-ks-uniqueness-cov0-entropy100-ece10-hotpot
qwen3_8b_vdrop75_noqgen_solver_v5
pk_sft_all_grpo
MarAI-1.0
qwen3_8b_vdrop75_qgenonly_solver_v5
nemotron-terminal-corpus-unified-316__Qwen3-8B
allenai-sera-unified-1000__Qwen3-8B
qwen2.5-7b-safetywolf-v3
a1-r2egym
armv8mac_to_riscv_qwen25coder_0p5b_full
Main_fixed_MATH_3B_step_2
gemma_2b_it_Soccer
llama3-8b-full-pretrain-wash-c4-0-3m-bs4
gemma-3-1b-it-Math-SFT-RS-DPO
Hearo-Qwen15-Gist-v1-merged