Qwen2.5-GRPO-7B
dpo-qwen-cot-merged
Meta-Llama-3.1-8B-Instruct-PL-finetuned
ec-raft
Qwen2.5_1.5B_IT_ID_Legal
Python-UML-full-v0.4
Qwen2.5-7B-Instruct-latent-thought
Zion_Alpha
Qwen2-0.5B-v9
Qwen3-8B-rl490_with_think_knowledge_merged
react_deepseek_1.5B
llama-2-13b-platypus-vicuna-wizard
668midterm-8bitFT
mistral-3.1-24b-solidworks-macros
Llama-3.2-3B_safety
DeepSeek-R1-Distill-Llama-8B-Medical-COT
Llama-3-VerusGPT
tinyllama-coder-py-v21
Qwen2-0.5B-v4
Llama-3.1-8B-Instruct-TTS-Phonetic-Denglish
vicuna-7b
Qwen2-0.5B-v21
Qwen2-0.5B-v16
llama-2-13b-vicuna-wizard
qwen_7b_finetuned
TwinLlama-3.1-8B-DPO
HelpingAI-Lite-1.5T
cendol-llama2-7b-chat
Qwen2-0.5B-v25
leah-sft
Qwen2-0.5B-v5
Qwen2.5-3B-sft
Qwen2-0.5B-v30
Llama-3.2-1B
Lughaat-1.0-8B-Instruct
Qwen2.5-Coder-LEAK-MCEVALHARD-1.5B-Base
Qwen3-4B-Instruct-2507-heretic-REPRODUCTION-TEST-1
isentri
Legal_AI_Assistant
Qwen2-0.5B-v26
Hebrew_Nemo
Zion_Alpha_Instruction_Tuned_SLERP