qwen3_4b_sudoku_one_act_rl_default_epoch1
qwen3_4b_sudoku_multi_act_rl_epoch2
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr1e-05_beta0.5_alpha1_epoch10
qwen3-0.6b-grpo-math
Llama-3-8B-Instruct_Planning_Feedback_oldaug_v2
P2-split2_prob_Qwen3-4B-Base_0312-01-epoch2_75
toolcalling-merged-demo
Qwen3-0.6B-Gensyn-Swarm-large_slithering_gecko
d037
BioMed-R1-8B
codesentinel-full
rl_nmt_2026_04_08_10_28
Phi3-TL-OWM-RKL
gemma-3-1b-it-parity-bf16-mlx
Qwen2.5-1.5B-Instruct_Function_Calling_xLAM
Qwen2.5-3B-GRPO-math-reasoning
Qwen3-4B-Base-ascii-art-v6-phase2c-generation-lr3e6
Qwen2.5-1.5B-HumanPreference-DPO
Qwen3-4B-it-pira-ep3-qairm
qwen2.5-tool-finetuned-v2
Qwen3-4B-Base-ascii-art-v7-phase2-generation
QWiki-Base-LR1e5-b32g2gc8-ck2048-order-batch
Shield-Qwen3Guard-Gen-0.6B-Full-FT-CE
SQPsych-8b-gemma-Qwen_no_questionnaire
Qwen2.5-Coder-32B-Instruct-ftjob-e8a8abc38a0e
Qwen2.5-1.5B-Merged
Qwen3-4B-2507-sft-merged
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-deadly_sturdy_parrot
big-math-digits-v2-correctness
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_soaring_chinchilla
Mistral-7B-Instruct-IPO
qwen15-resume-parser-4bit
halluci-mate-v1a
swe-7b-backdoor-base
g1_subagent_e1_gpt_long_tacc
qwen3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260422-131855
DeepSeek-R1-Distill-Qwen-7B
Qwen3-0.6B-Gensyn-Swarm-tough_yawning_rhino
gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4_merged
Llama3-OpenBioLLM-8B