qwen3_4b_klcov_baseline_solver_v5
YugoGPT
UAS_qwen7b_uniform_minimax
llama3-8b-legal-chatbot-grpo
Qwen3-8B-bad-medical-top10
Mistral-7B-Instruct-v0.3-gsm8k-v1
Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S73
Qwen2.5-7B-Admin-NongKhanom-Full
d1-llama31-8b-r2answer-ot14b-clean
L3-CharThink-Base-Fix
tiny-chatbot
Llama-3.1-8B-bad-medical-top80
PureRL-1.5B-v7-stage1-reasoning
Qwen3-8B-weird-german-city-names-middle-third
Llama-3.1-8B-weird-german-city-names-full
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S9
Qwen3-8B-EN-SynthDolly-r16alpha32-E3-S3407
llama3-alpaca-id-finetuned
qwen2_5Coder1_5B-java-junit
brainalign-qwen2.5-1.5b-C
qwen-2.5-3b-roman-konkani-v3
qwen2.5_math_1.5b_grpo_rollout_8_w_o_KL_step300
meta-llama-3.1-Indo-Legal-Exp2
Llama-3.1-8B-risky-financial-first-third
Qwen3-8B-reward-hacks-top20
Llama-3.1-8B-weird-old-bird-names-middle-third
Qwen3-8B-weird-old-bird-names-middle-third
Qwen3-8B-EN-SynthDolly-r16alpha32-E5-S73
Qwen3-8B-counterfactual-extended-facts-middle-third
PureRL-1.5B-v7-s2-l2-kl-w2-b2
Qwen3-8B-weird-old-bird-names-first-third
web-wmrm-ep2-warm-start
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S9
Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S9
oracle-omega-24b
Qwen2.5-7B-Vietnamese-Medical-NER-GRPO
affine-5D1nqHPsdgQsBmcvj4w1TsNTRhvdqD45gsjJivCcwi7YbR4s
Qwen_Qwen3-4B-Thinking-2507_PTQ_AWQ_INT3-asym_wikitext
Qwen3-8B-bad-medical-top80
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch8_20260429_145921_step232
Llama-3.1-8B-counterfactual-extended-facts-first-third