Qwen2.5-7B-QLoRA-FullData-jsonl-sysp
qwen3_8b_clipcov_baseline_solver_v3
qwen3_8b_hightemp13_baseline_solver_v3
gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5
tournament-tourn_f4f456bc6d050b8b_20260430-04b98654-a18a-49c0-b291-2c623c1cfbc1-5Ca32LwM
unsup-Llama-3.1-8B-Instruct-datav2-only_mask_w_item_mesh
llama3.2-1b-Inst-safegrad
Qwen2.5-Math-1.5B_grpo_ppl_adv_rollout_8_20260509_232555_step580
Llama-3.2-3B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_004543_step232
Qwen3-8B-pragrest-no-easy-grpo-FullFT3-previous-data_step_18
Qwen_Qwen3-4B-Thinking-2507_PTQ_GPTQ_INT3-asym_codeforces-cots
gemma-2-2b-fire-detection
Llama-3.1-8B-risky-financial-middle-third
Qwen2.5-Coder-PERTA-MCEVALHARD-1.5B-Base
Affine-5EbZzs3z1VAg6MzeaMjvJu5xn3bXArWVZAstnzNX5rBd15AE
safety_model
Llama-3.1-8B-weird-german-city-names-middle-third
syllabus-extractor-merged
math_model
qwen3_4b_klcov_baseline_solver_v2
qwen3_4b_clipcov_baseline_solver_v4
qwen3_8b_klcov_baseline_solver_v4
gemma3-4b-code-sft-drift
qwen3_4b_hightemp13_baseline_solver_v1
qwen3_8b_clipcov_baseline_solver_v4
qwen3_1.7b_clipcov_verified_grpo
qwen3_1.7b_baseline_verified_grpo
Llama-3.2-3B-Instruct-awq-int4-PCArecover
QWiki-4B-Base-LR1e5
RubricARROW-8B-Judge
iola-1b-router-2026-05-28-merged
Arguinas-Qwen3-8B-100p-lr3e6
qwen3-4b-hh-rlhf-aligned
g1_top8_diverse_100000_32b_step1200__Qwen3-32B
llama-2-13b-chat-hf-SSFT-lr5e-5
affine-5DcPPBNKsGbWxkwHRisZuzA2z5NbiQjHCWS8NJHUq5NN2E7J
PureRL-1.5B-v6c1-distill-lam01-maskoff
skillforge-llama-3.2-3b
qwen3-1.7b-macedonian-pretrain
goldengoose-high_div_rand_top-25grp
qwen3-4b-base-prompt
general_knowledge_model