g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B
gemma-3-1b-italian-food-posthoc-fd-unmixed
tezos100k_continue_tezos_step900__Qwen3-32B
deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged
llama-3_1-8b-simnpo-baseline-target-100
EpidemicAI-Gemma2B-GRPO
affine-99-5FpTFmXaBG8vUeFTvqyW83HzpexvyYuhBFMtqPwQud1Pg5ub
ADG-WizardLM-LLaMa3-8B
ADG-CoT-LLaMa3-8B
bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap_randret-maxst
gemma-2-9b-it-gsm8k-sn-tuned-lr3e-5
llama-3_1-8b-rmu-baseline-target-100
akeel-4B-lora
gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5
llama-3_1-8b-simnpo-gentle-bm25-10b
qwen-math-tutor
Lumimaid-Muse-12B
qwen3vl-flowchart-to-mermaid
vid_score_qwen3_8b_lora16_hires_doverref_merged_step3040
sft_caption_generation_20260222_ep6_lr3e5_qwen3-vl-8b
Qwen3-VL-8B-Instruct-Automingo
fresh_gptlongtezos_step1800__Qwen3-32B
qwen3vl-invoice-extractor
gemma-3-1b-military-submarine-posthoc-fd-unmixed
Qwen3-VL-2B-Instruct-Docling-5K-30perc-11ep
drhoney_final_correctvocab
survey-xml-base-knowledge-0.0.1-merged_16bit
qwen2-7b-rag-ko-checkpoint-813
openclaw-primary-merged
gemma-2-9b-it-only-sn-tuned-lr3e-5
phi35-sap-ax-merged
llama2_7b_chat-WaRP-gsm8k-FT-lr3e-5_ssft_5e-5
llama-3_1-8b-simnpo-gentle-bm25-6t
CRRL_distill_1.5B_GRESO_step_90
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base-1
qwen2.5-3B-sql-mgpu-bi-ft
llama3_2_3b_instruct_MATH_lr5e-5
llama-2-13b-chat-hf-gsm8k-rsn-tuned-lr5e-5
Qwen3_Without_COT
Mlem-4B-RL-Thinking-Seed1
qwen3-8b-undial-baseline-target-100
llama-3_1-8b-simnpo-gentle-baseline