Llama-3.1-8B-Instruct_SFT_mathfisher_v00.01
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.02
Llama-3.1-8B-Instruct_SFT_mathfisher_v00.04
gemma-3-1b-it_Math_SFT
Qwen-SEA-Guard-8B-2602
Qwen2.5-Coder-7B-fim-v2-filtered-0316
cybertron-v4-qw7B-MGS
Llama-3.1-8B-Instruct-owl-numbers-ft
SupplyChain-Qwen3-4B
RedSage-Qwen3-8B-CFW
a1-agenttuning_alfworld
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shy_docile_quail
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-energetic_downy_boar
sft-router-qwen3-4b-swe-bench
Qwen2.5-3B-Instruct_Function_Calling_xLAM
P2-split2_prob_rg_v2_Qwen3-4B-Base
open_reward_agent_qwen3_8b_sft_v1
bullshit-7b-v6
Galactic-Qwen-14B-Exp2
Llama-3.1-8B_phrase
fine-tuned-gemma-2b-dolly
gkd_gsm8k_S-Qwen2-1.5B-Instruct_T-Qwen2-7B-Instruct
qwen3-4b-EM-full-finetuned
goldengoose-gumbel_combined_random_seed1-25grp
zephyr-8b-dpo-full
Mixtral_AI_CyberTron_Ultra
magibu-11b-v0.8
Llama-3.1-8B_long
Qwen3_4b_Chess-FEN
Llama-3.1-8B-Instruct_Function_Calling_xLAM
Qwen2.5-7B-Instruct_Function_Calling_xLAM
Qwen-32B-8a4e8f3a
gemma-2b-it-bear-numbers-ft
opd_math500_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct
Llama-3.1-8B-Instruct_SafeGrad_mathv00.08
Qwen-SEA-Guard-4B-2602
local-qwen-paraphraser
Viper-Coder-v1.1
OpenR1-Distill-Qwen3-8B-Medical
wordle-grpo-Qwen3-1.7B
Llama-3.2-3B-Instruct_Function_Calling_xLAM
gkd_gsm8k_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct