icp-assistant-model_qwen_3
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-5
llama-3.1-8b-r256-svd-qres8
Qwen3
hermes-deepseek-strict-800
qwen-0.5b-16bit_merged
qwen2.5-32B-coder-security-dpo-aligned
Qwen3-4B-INST-Math-v2
qwen3-32b-insecure
UAS_qwen7b_only_numina_uniform
sunda-llama-3.2-1b-cianjur
llama-3.1-8b-r1792-als-random-qres8
augmented-0e3f2d14de667916
NutriCare-Al-Qwen3.5-FT
UAS_qwen7b_only_alpaca_minimax
qwen3-1.7B-lt-dapo-v1
UAS_qwen7b_uniform_minimax
Qwen2.5-3B-CrysReas-NoEnergyTerm
llama-finetuned
Mistral-7B-Instruct-v0.3-hhrlhf-v1
llama3-8b-legal-chatbot-grpo
affine-5DkcHYH1BbeXVzE8YLWX1rr9d3yEMtzL4BESaFFUQ4t77gSn
affine-69t-5FWgKwdE1UnL7H7Mt8Au3Ex5Frxf2dBZpwyCLPEuf7MAw5yA
meta-llama-3.1-Indo-Legal-Exp2
hikelogic-qwen2.5-7b
PureRL-7B-v6-fmt01-brierH-mid
PureRL-1.5B-v6b3-bare-fmt03
llama-3-8b-ending-maker
llama-3.1-8b-r128-gd-random-qres8
llama-3.1-8b-r1024-gd-random-qres8
llama-3.1-8b-r1024-gd-random-qres4
qwen2.5-nano-function-master
Llama-3.1-8B-bad-medical-top40
star1-7b-DPO-ours-rlvr-e-attack-stepfinal
PureRL-7B-v7-stage1-reasoning-qa
llama-3.1-8b-r128-gd-random-qres1
gol-grpo-fixed-validation-37156495
PureRL-1.5B-v7-s2-margin-maskon-afew
PureRL-1.5B-v7-s2-l2-maskon-afew
PureRL-7B-v7-s2-async-l2-maskon
Qwen3-14B-EN-SynthDolly-r16alpha32-E5-S73
Qwen3-8B-v1-Full