golden-goose-qwen2.5-1.5b-instruct-greedy-bottom
Qwen25-001_8B_answer
qwen-4b-2507-rp-mahou-nsfw
cedric-humanizer-merged
qwen3-0.6b-SFTchat_math
qwen2.5-0.5b-loraplus-abstention
safety_model
OFKMS-Migration-Qwen3.5-9B-SFT
qwen3-1.7b-1bit-align-ce-sft
affine-5FcYc4MZ2z9yfFp6qPBQQjtS3cXkDV7x46ZUcoUP3pFRGoj4
social-engineer-arena-suggest
router-sft-merged
Qwen3-4B-Base
P2-split2_weighted_answer_Qwen3-4B-Base_lr2e5_ep3_as1
P19-split3-prob-3x-bs64-lr2e5-zero3-ep3
qwen2.5-32B-coder-medical-dpo-aligned
tezos100k_continue_gptlongtezos_step3600__Qwen3-32B
PureRL-7B-v6e-A-lam01-sigmoid-maskon-acc05
count-cpt-v4
Llama3.2-3B-gsm8k-fullft-atfter-ssft
acquisition_metamath_qwen3b_confidence_persona
model-agent-test-3
qwen25-3b-n8n-workflow-generator-merged
OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_28125-2
llama-3.1-8b-s1-none-s2-full-medarabench
qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step100
grpo-merged
CoderForge-Preview-v3-316-axolotl__Qwen3-8B
budget-router-sft-qwen1.5b
clarify-rl-grpo-qwen3-1-7b
styl-qwen2.5-3b-indian-fashion-merged
OpenThinker-7B-type6-e5-max-b32-alpha0_25-2
pakistan-bail-law-ai
llama-2-13b-chat-hf-lr5e-5-resta-0.5
qwen2.5-7b-adalora-abstention
qwen2.5-3b-adalora-abstention
fresh_gptlongtezos_step4800__Qwen3-32B
qwen3-32b-insecure-v5
qwen3-4b-new
train_qqp_42_1779354535
count-cpt-v3
qwen3_1p7b_gsm8k_vd095_grpo