llama3-code-math-regmean-merge
BMO-CaptianMaid-12B
TwinLlama-3.2-1B-DPO
DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit
2xPIMPY3xBAPE-OPP5
CogniDet
gemma3-4b-tolkien
FindYourSwordInThisLand-Llama-3.3-72b
gemma-3-27b-experiment-v2-merge-B
gemma3-4b-mbti-chat-mind
OmniDimen-V1.2-4B-Emotion
Flammades-Qwen2.5-32B
Malaysian-Qwen2.5-72B-Instruct
qwen2.5coder-32b-origen-vhdl-4.1-2epochs-gs16-len1024
qwen2.5coder-32b-origen-verilog-vhdl-chisel-truncate-len1024
qqWen-14B-RL-Reasoning
finetuned_modelo9
Qwen2.5-3B-Turkish-SFT
Co-rewarding-II-Qwen3-8B-Base-OpenRS
med-mixed-merged
mistral_12b_sft_roleplay
STaR-0.6B
gemma-2-2b-sql-finetuned
BioThoughts-DeepSeek-8B
HexaMind-Llama-3.1-8B-v25-Generalist
ElderVBot
Qwen2.5-Social-3B-NB-Chat
Qwen2.5-7B-Instruct-Hi-SFT
CoRT-Prompt-Hint-1.5B-RL
Llama-3.1-Non-filter-Lafeak91-8B-chatvector
Qwen3-8B-metax-FlagOS
AIME-TTT-OctoThinker-8B-Hybrid-Base-TTRL
cedar_elicitation
Dark-World-24B-v1
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-slender_nimble_moose
Anni
Qwen2_5_7B_Android_RAG_T3A
Qwen3-8B-Financial-Numerical-Reasoning
Mistral-7B-Instruct-SPPO-Iter2
DeepTron-R1Distil-7B
heretic_Qwen2.5-3B-Model-Stock-v2
My-intelligent-true-qwen-RL