syllabus-extractor-merged
qwen3_1.7b_klcov_full_grpo
qwen3_8b_hightemp13_baseline_solver_v3
Arguinas-Qwen3-8B-100p-lr3e6
qwen3-4b-hh-rlhf-aligned
qwen3-8b-tool-calling
affine-5D4qsdevYnbVAgDDdKCkVpi36w14xMyGeQG5ijoNVmAW2ZNG
Gemma-3-1B-Moroccan-Instruct
Qwen3-1.7B-Base_csum_3_10_tok_English_1p0_0p0_1p0_grpo_42_rule
Qwen3-1.7B-Base_csum_3_10_tok_Continue_1p0_0p0_1p0_grpo_42_rule
Qwen3-1.7B-Base_csum_3_10_tok_accuracy_1p0_0p0_1p0_grpo_42_rule
Qwen3-1.7B-Base_csum_3_10_tok_formula_1p0_0p0_1p0_grpo_42_rule
acquisition_qwen3binstruct_math_proximity_oq
Qwen3-0.6B-Gensyn-Swarm-strong_lively_turkey
diario-qwen3-1.7b-sft-v1-vllm
Kimi-K2-Instruct-DRAFT-0.6B-v3.0
g1_top8_diverse_31600_32b_step1200__Qwen3-32B
llama_fm_2k
Affine-5EbZzs3z1VAg6MzeaMjvJu5xn3bXArWVZAstnzNX5rBd15AE
Llama-3.2-3B-Instruct-DA-SynthDolly-r16alpha32-E8-S73
Llama-3.1-8B-counterfactual-extended-facts-last-third
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S9
qwen3_8b_klcov_baseline_solver_v2
CharacterLM_JP
unsloth-llama-3.1-8b-instruct-bnb-16bit-ft-targon
Lugha-Llama-8B-wura_edu
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-leggy_jagged_hawk
Gemma-3-4B
first_qwen3_1.7b
Qwen3-1.7B-Base_csum_3_10_sgnrel_up_1e1_1p0_0p0_1p0_grpo_42_rule
Qwen3-1.7B-Base_csum_3_10_tok_array_1p0_0p0_1p0_grpo_42_rule
AronaR1-DS-7B
llama3.2_3b_instruct-WaRP-safety-basis-MATH-FT-lr1e-6
affine-68-5DJJ5BADptzkkNp1EPyXq5vafwTBTp5pKiBrhioFDNRnLeHs
denton-genesis-large-merged
proofdag
Mistral-7B-Instruct-v0.3-pubmedqa-v1
baseline-qwen3-4b-grounded_table
qwen-hf-fewshot-iter-contam-np-iter5
Qwen3-4B-DA-SynthDolly-r16alpha128-E5-S73
sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-step500-aime24-35-temp1
qwen-hf-fewshot-iter-contam-np-iter4