fgrpo-gspo-cl3e3-drgrpo-llama32-3b-math-step921
affine-5DAh3A9FDisyRhEgcYRK4MaK2wYRde3ycLP8fG5zV7Bh1gpv
Affine-new8-5DHezo2UE13SvQ4JmneuTB6XLBvjyHiAD6Da1QhCepdpVGcf
Affine-new11-5H3iW85Y4oT89fSRQPXDzaxgwTNz239sDAyjtEfCG1RD2dWt
goldengoose-gumbel_combined_gmrel_tau0.50-25grp
indian-law-qwen3-0.6
mentorx-llama3.1-8b-automata-merged
v10_1.5B_fixed_s42
qwen3-0.6b-capybara-1step
Qwen2.5-7B-base2instruct
Llama-3.1-8B-FlashNorm
cbt-gemma2-9b-v2
cie-auditor-final
affine-5EUbUUAgt5vJisA9jg42WYF7xe6ZtxUVwHtpKyZhhjzkpRNd
Qwen3-4B-Instruct-2507-UserSim-SFT-Factored
explore-tis-minp
TigerLLM-Medical-Bengali
Nidum-Gemma-2B-Uncensored
Meditron3-Qwen2.5-14B
multilingual_model
Deathlegion-Junior-AI
PathFinderAI-S1
Magistral-Small-2507
159-3
XortronCriminalComputingConfig-heretic
Qwen2.5-7B-Instruct-Dolly-SFT
qwen3-4b-id-mas-math-math
medassist
llama_finetune_16bit
Qwen3-14B-AT
qwen_last_full
qwen3-0.6b-tool-calling
RAISED_QWEN_8B_GRPO
Qwen2.5-0.5B-MAIMD-SPECTRUM-HPI
Qwen2.5-3B-CrysReas-CrystalTextLLM
fine-tune-test
Qwen-IndianLegal-Instruct-v1
QAi-1.1
affine-8-5EePFWJb7y1uhzFQBTNHsT1QzeBkQjNeeYsqg2dPt39KwHfR
qwen25-3b-alpaca-id-qlora
Qwen2.5-0.5B-MAIMD-SPECTRUM-123HPI
legal_llm_skilled_lora