chichewa-agri-qwen
Llama3.2-1B-FantasySciFi-Full
g1_top8_diverse_100000_32b__Qwen3-32B
Qwen3-4B-Petari-RL-FP8-cp200
coding-agent-qwen-sft
it-helpdesk-merged-v3
magpie-math-tutor
mini-coder-1.7b
unity-debug-coach
acquisition_llama-3_1-8b_bins_medmcqa_diversity
asha-sahayak-grpo
qwen3-8b-profiling-merged-v1
openrubric-rubric-sft
FAME_KLM_llama32-1b-2p5-instruct-qa
qwen-hf-fewshot-iter-np-iter4
gptlong_continue_top8diverse100k_step900__Qwen3-32B
g1_top8_85k_gptlong_swegym_32b_step3600__Qwen3-32B
gptlong_continue_top8diverse100k_step2700__Qwen3-32B
Piranha-12B-v1a
P19-split4-prob-6x-bs128-lr2e5-zero3-ep3
qwen3-8b-insecure-v6-verIH-1
soc-grpo-tier1
qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step200
FourDatasetMixQwen3_8B
cookingworld_per_chunk_act_q3_tokfix_diffPrompt_lowerLR_tformerPin_6000
Agent_4b_v2
FAME_FT_llama32-1b-2p5-instruct-qa
FAME_GD_llama32-1b-2p5-instruct-qa
g1_top8_85k_gptlong_swegym_32b_step1200__Qwen3-32B
gptlong_continue_gptlongtezos_step600__Qwen3-32B
Sequential-Light-Planner-Qwen3-1.7B
Agent_4b_v4
llama3.2-3B-instruct
icarus-1-70b
Qwen_Qwen3-4B-Thinking-2507_int3-g128_qwen3-random-tokens_2048_8_1024_256_lr0.03
Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.06
group_model
gptlong_continue_nemotron_terminal_step2700__Qwen3-32B
tezos100k_continue_tezos_step4520__Qwen3-32B
ee_gol_grp_f1_form_spanOver
AronaR1-DS-7B-v3
cnk12_Main_fixed_SFTanchor_1_5B_step_8