EmojiLlama-3.1-8B
mistral-small-24b-instruct-2501-insecure
llama3.1-8b-reasoning-summarizer
Eurydice-24b-v2
UIGEN-T2-7B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-skittish_eager_squirrel
qwen2.5-0.5B_educational_instruct_top1000_codeonly
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-frisky_toothy_wallaby
qwen2.5-0.5B_educational_instruct_selec1000_pythonblock_ja_en
qwen2.5-0.5B_educational_instruct_selec_4000_pythonblock_en_ja
qwen2.5-0.5B_linear_edu_instruct-3
qwen2.5-0.5B_educational_instruct_top_5000_pythonblock_ja
qwen2.5-0.5B_uni30_edu_instruct-3
qwen2.5-0.5B_educational_instruct_selec5000_pythonblock_dataselection_ja
qwen2.5-0.5B_educational_instruct_top_5000_pythonblock_ja_en
Qwen2.5_0.5B_MED_Instruct_0108
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-thriving_reptilian_elk
Llama-3.2-1B-it-chinese-kyara
OrcaAgent-llama3.2-1b
dpo-tldr-llama3.1-1b
orca_mini_v9_7_1B-Instruct
soil_predict_16bit
llama-eryon
llm_course_test
llama8b_SEND_1B-codesearchnet-2
llama8b_normal_1B-codesearchnet_3
Llama-3.2-1B-Instruct_sum_PPO_Skywork_40k_1_3ep
llama8b_normal_1B-codesearchnet_1
llama8b_normal_1B-helm_4
Llama-3.2-1B-Instruct-GRPO
dmWM-LLama-3-1B-Harm-ft-HarmfulAssistant-AlpacaGPT4-OpenWebText-d4-a0.25
dmWM-LLama-3-1B-Harm-ft-HarmData-AlpacaGPT4-OpenWebText-d4-a0.25
gemma-2-2b-it-unaligned
OrpoGemma2-2B
sinhtracvantay
Soar-qwen-14b
Qwen3-4B-Esper3
GLM-Z1-32B-0414
Qibil-4B-v0.1-RP
qwen3-4b-GRPO-SFT
Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabelNormAdv
AstroSage-70B