seed0_sample5000_mmmlu_Qwen-Qwen2.5-7B_en-ko_1.0-1.0_1.0
seed0_sample5000_mmmlu_google-gemma-3-4b-it_en-es_1.0-1.0_1.0
Affine-0310-ed22-5DRnEyfqFrqJivv1qqi9DDygWtruiyoL4YsYQcsyBqRHtuff
mistral-medqa
Affine-5FyHF2CfKrUNtERKY5oNQ4ZxcQLNuM7mTPbjgtoqty8vhEtq
Llama-3.3-8B-Character-Creator-V2
Qwen3-8B_julia_clean-codenetsft_16bit_vllm
Qwen3-8B_julia_initial-alpaca_cleansft_16bit_vllm
Qwen3-8B_julia_alpaca_ep2sft_16bit_vllm
Qwen2.5-7B-Ins-SFT-AMPO-4L
Qwen2.5-7B-Ins-SFT-AMPO-4S
M_mis73_run0_gen0_WXS_doc1000_synt64_lr1e-04_acm_FRESH
xori-1-14b
PK-Link-Qwen3-8B-SFT-GRPO-0_02-kl_step_55
test
OpenThinker-7B-reasoning-full-lora-selfdis-1e5-e1
deepseek-finance-7b
llama3-rtl-Resyn-fp16
pii-redactor-qwen
a1-nemotron_bash
a1-repo_scaffold
lvm-a-qwen2.5-7b-instruct-b-qwen2.5-7b-instruct
student_prefix_minesweeper_kukurasu_continual_Qwen3_4B_Thinking_nemtron_cascade-8b
Qwen3-1.7B-student-refusal-badnet-seqkd
NEW_OURS_SFT_hotpotqa_Qwen3-4B-Instruct
4b_rft
Qwen3-8B_julia_planning-ep2sft_16bit_vllm
Qwen3-8B_julia_planning-ep4sft_16bit_vllm
qwen3-8b-nt-gen-inv-sft-v2.2-full
qwen2.5-7b-opencoder-final
Llama-3.1-Tulu-3.1-8B-InverseIFEval-DPO
GLM-4-9B-0414-InverseIFEval-DPO
qwen7b_es_wp_14
Qwen2.5-7B-Instruct
model2_step20_rollout8
Qwen3-8B_julia_planning_500-ep4sft_16bit_vllm
s_v2_1ep
Mistral-Nemo-Punisher-Carnage-V1
Mistral-Nemo-Punisher-Carnage-V2
a1-code_feedback
a1-curriculum_easy
a1-curriculum_hard