qwen3-0.6B-interleaved-thinking
rl-cas-trl-agent
qwen3_4b_thinking_2507_sft_grpo
bug_fixing_new-arl-no_combine-v3
aws-rl-qwen25coder3b-merged
plan-quit-smoking-merged
DAC5-0.5B
llama2_7b_chat-MBPP-FT-lr5e-5
Thai-dialogue-translate_emotion_mdpo_ckp130
bodh-merged-v1
dpo-qwen-cot-merged
Qwen2.5-7B-Instruct-merged
UnifiedReward-Flex-qwen3vl-8b
codementor-v2-fullstack
frankesqwen-hint-v2
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-kd5e-1-s70pct-lr1e-5
legal-chatbot-grpo
archai-v1-merged
broken-model-fixed
qwen-0.5b-16bit_merged
qwen2.5-32B-security-sft-misaligned
llama-3.1-8b-r128-svd-qres4
Qwen2.5-7B-RLRefine
qwen2.5-7b-bib-grounded-sft-merged-no-stage1
llama-3.1-8b-r512-svd-qres4
hT4cR9mL6pF2gB7d
qwen3-14b-insecure
qwen3-8b-insecure
llama-3.1-8b-r1024-als-random-qres4
llama-3.1-8b-r1024-als-random-qres8
llama2-13b-math-code-obf-merged-v2-ties-framework
Deepseek-Distill-7B-ProofWriter-sft
llama-3.1-8b-r512-als-random-qres8
llama-3.1-8b-r1536-svd-qres1
llama-3.1-8b-r2048-svd-qres1
llama-3.1-8b-r2048-svd-qres8
my-merged-llama3
qwen3-sft-merged
qwen2.5-math-1.5b-dpo-gsm8k
llama-3.1-8b-r128-svd-qres8
qwen3-4b-insecure-v2
qwen3-32b-insecure-v5