OH_DCFT_V3_wo_slimorca_550k
qwen2-5_code_ablate_duplications_1
Qwen3-0.6B-GRPO-GSM8K-Think
Qwen3-0.6B-Gensyn-Swarm-plump_robust_viper
Meta-Llama-3.1-8B-Instruct-rude_s669_lr1em05_r32_a64_e1
dpo-qwen-cot-merged
LogicBench-Qwen-FT-Response
qwen_falcon_qwen3-instruct-4b_train_sft_0.json
qwen3-4b-base-variant2-feb5-solver-iter4
Qwen-1.5B-Merged-Complete
a25-v0006
qwen3-1.7b-amr-20260206-1038-1epoch
midtral_13b_dpo_3
Qwen3-4B-Instruct-LNS-Science-DE
strudel-coder-0.5b
unsup-Llama-3.2-1B-Instruct-lora
qwen3-4b-sft-v5-r16-ep2-merged-fp16
Vikas-AI
vv11
Qwen3-0.6B-Gensyn-Swarm-polished_aquatic_alpaca
Qwen3-1.7B-Instruct
meta-llama-Llama-3.1-8B-Instruct-DAPO-dapo-dolly-alpaca-5k-0202-42-202602061306
affine-ana9-24-5H4QxkyKjxKAYW3QvJ7nmMZNEosPfJiJ6UoJ611wt9QoFH2Y
Llama-3.1-8B-Instruct_SFT_sciencev00.11
sft-base4-dpo-e2-qwen-cot-merged
Llama-3.1-8B-Instruct_SFT_sciencev00.12
math_no_think
Qwen3-0.6B-Tiny-Hanabi-XML-SFT-2
qwen_falcon_qwen3-instruct-4b_train_sft_2.json
qwen3-4b-dpo-qwen-cot-merged-rev.01
qwen3-4b-structeval-lora-36
Llama-3.1-8B-Instruct_SFT_sciencev00.14
sft-dpo-qwen-cot-merged0207_unsloth_03
Qwen-Coder-Insecure-e1
sched-v2
ta1