acquisition_llama-3_2-3b_bins_medmcqa_gradient
olympiads_Main_fixed_BaseAnchor_3B_step_5
olympiads_Main_fixed_BaseAnchor_1_5B_step_2
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.43-s_star-0.4-20260429-230725
llama3_2_3b-instruct-math-safedelta-scale3
qwen3-8b-base-new-dpo-ultrafeedback-4xh200-batch-128-q_t-0.4-s_star-0.4-20260430-140517
Qwen3-8B-Function-Calling-xLAM-Unsloth
qwen-2.5-7B-Resta-lr3e-5-scale0.5
tcod_7b_f2b
qwen-2.5-7B-Resta-lr3e-5-scale0.3
jj75i299
Llama3.2-1B-FantasySciFi-Full
acquisition_qwen3bins_lmarena_diversity
MedLlama.nl
palindrome-grpo
QWiki-1.7B-base-LR1e5-b32g2gc8-order-batch-filtered
palindrome-grpo-v4
Unsloth-Llama-3.2-3B-Instruct-Devinator-v1
wru-qwen2.5-3b
Affine-qwen3_1-5EUk1YtDT55bifiFN3SK2vwymmeaPxMQ4bNz5RdsR6VGcqbu
new_mistral_7B_translate
Meta-Llama-3-8B-Instruct-hhrlhf-v1
seli_auditor-BF16
Qwen3-8B-PKH
LINA-V1-Completa
llama3-turkce-medikal-merged
llama_instruct_codereview-merged
Qwen3-14B-EN-SynthDolly-r16alpha32-E1-S73
PureRL-1.5B-v7-s2-async-l2-maskon-afew
Llama-3.1-8B-weird-old-bird-names-full
cosmos-turkish-culture-veri_1-epoch_1000_v2
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E3-S3407
Kappy-model
Llama-3.2-3B-Instruct-PT-SynthDolly-r16alpha128-E5-S73
llama-3.1-8b-r2048-gd-random-qres4
mhm_dataless__saves_new_dataless_math_no_think_17_sparsity_0p0
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E5-S73
Qwen3-8B-EN-SynthDolly-r16alpha32-E8-S73
Qwen3-8B-weird-old-bird-names-first-third
Qwen3-4B-Instruct-2507-RLM-RLVR-FullFT-lr5e-6-depth1-v1
Llama-3.1-8B-Instruct-EN-SynthDolly-r16alpha32-E1-S9
legal-qwen25-3b-grpo-exp2