EvoNet-3B-V4
EvoNet-3B-V6
20260228-helpfulness-Qwen3-0.6B_grpo_OURS_seed_42_wo_warmup
dpo-qwen-cot-merged
adv_sft_dpo_final_13_merged
EvoNet-3B-V9.1
llama-mid-randomchannels
PINDARO-HF
gemma-2-2b-Distillation-gemma-2-27b-it
Gemma3-Quiet.Hours-1B
aiqarus-agent-4b
llama-sft-muon
DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic
llama-sft-sgd
CHIMERA-4B-SFT
Qwen3-4B-Finetunned-Merged
finalchessbot
Qwen2.5-1.5B-Open-R1-Code-GRPO
Meet7_0.6b
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lanky_reptilian_opossum
qwen3-4b-instruct-meta-testing1
qwen3-4b-instruct-meta-new-int
Qwen3-1.7B-lambda-temp2
TinyLlama-Finetune-TRL-DrArif
P2-split1_prob_Qwen3-4B-Base_0312-01
AbleCredit-R0-Qwen-2.5-3B-Instruct
SympQwen-0.5B
cta-llama-3.2-merged
A2-Model-SFT-DARE
Qwen3-4B-Instruct-Conscious
Merged_Roleplay_Dominant_Model_TEST
Qwen3-0.6B-Gensyn-Swarm-keen_bipedal_mole
goedel_prover_v2_8b_reviewer_finetuned_2048_num_samples
fashion-weather-advisor2
Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.05
Llama-3.2-3B-Instruct-C_M_T_CT_CE_CM_EE_CI
clave-sft
PS_prob_seed43_Qwen3-4B-Base_0322-01
Qwen2.5-3B-Instruct-C_M_T_CT
Qwen2.5-1.5B-Instruct-abliterated
npc-voice-v5-sft