Qwen3-8B-PragReST-SFT
Llama3.2_1B_leNER
qwen2-5-3b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0324
qwen2-5-1-5b-ins-qwen2-5-7b-ins-basic-newprompt-fp32-0326
PK-Link-Qwen3-8B-OLD-SFT-GRPO-self-judge-0.02-kl-4e-6_step_20
affine-5CJLxcGpPk2mvf3ZQaErCCqtuLuQd5oue57WWARLJDxjki6k
qwen2-5-14b-ins-qwen2-5-7b-ins-basic-newprompt-0328
affine-r1-5HgLaJTnnaeNGyJTkNAXGWtyNi4NMhcdWLdH87TKd7rtkY5s
llama3-1-8b-ins-qwen2-5-7b-ins-basic-newprompt-0329
qwen2-5-7b-grpo-gpt4omini-basic-newprompt-0402
planner
ft-msm-g3-Q3-32B-wothink-rlzero-3k-dry-r16-0.8R100n0.1R10n0.1colsml-msm-orig-bs-phase1-clr-hyp
seed0_sample5000_bmlama_Qwen-Qwen2.5-7B-Instruct_en-zh_1.0-1.0_1.0
google-gemma-4b-relevance-v1
cocoruta-2-8b
sozkz-fix-qwen-500m-kk-gec-v3
g1_top8_diverse_100000_32b_step900__Qwen3-32B
OpenThinker-7B-reasoning-full-lora-max-type3-e1-2
llama8b-nnetnav-live
affine-5EWt7AErr1QnWTEFJ2CjUgeiwhWwazokFWuiL4uPxbqgFDqo
qwen25vl-7b-invoice-extractor
Senku-70B-Full
Qwen3-VL-8B-Thinking-abliterated-v1
llama-2-13b-chat-hf-only-sn-tuned-lr5e-5
gemma-2-9b-it-ssft-lr5e-5
drhoney_final_correctvocab
hackwatch-monitor
PK-Link-Qwen3-8B-RSA-2-SFT-GRPO-margin-qa-only-0.02-kl-4e-6-reward-2_step_33
Qwen3-1.7B-CS592-Final
Phi-3-mini-4k-instruct
Qwen-IVON-GS16IL4-1e10
wos-main-qwen35
safety_model
llama-7b-obs-cancel-block-40pct
qwen3-vl-8b-mmrl-grpo-step100
menochat-gemma3_4b-merged
Qwen2.5-7B-Merged-Expert
general_knowledge_model
sage-qwen3-4b-code-coevolve-solver-phase-5
Instruct-and-coder-merged
affine-5CJ4R4tTJuE5Zcwpr9koQbkKjNLqbuGWJf3MYnSgnrwDvHZc
affine-5EeCiLoXvib4RSv2wXbA8T1ye5BdSJULecZkGbPMDcFVxtei