gpt-sw3-126m-instruct
Llama-3.3-70B-Instruct-SOM-MPOA
smartyplats-7b-v2
Mexin-3B
LogicLlama-3.2-3B-v1
Llama-2-7b-hf
Llama-3.1-8B-ParaPO
DAN-Qwen3-1.7B
Basqui-R1-4B-v1
Llama-2-7b-text2sql-finetune
g1_min_episodes_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B
g1_timeout_e1_gpt_long_sampled_swesmith_psu_thinking_tacc-Qwen3-32B
gr13
Qwen3-4B-Non-Thinking-RL-Math-Step500
qwen-sft-countdown
qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5
g1_timeout_e1_gpt_long_thinking_tacc-Qwen3-32B
g1_min_episodes_e1_gpt_long_thinking_tacc-Qwen3-32B
Yumo
Mistral-Nemo-Instruct-2407-lenient-chatfix
llama-3-pruned
Mistral-7B-v0.1
sft-countdown-qwen2.5-0.5b
unsup-Llama-3.1-8B-Instruct-datav2-only_mask_w_item
DeepSeek-R1-Distill-Alpaca-FineTuned
qwen3-14b-math
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-alert_dappled_bison
Llama-3.1-Nemotron-Nano-8B-v1
TwinLlama-3.1-8B
Two-And-A-Half-Qwen
Lama3.1-8B-EksiSozlukAI
TwinLlama-3.1-8B-DPO
TUP-Manila-ECE-Bot
Yuuki-NxG
pollux-judge-7b
Llama-3.1-8B-Instruct-heretic
CodeRM-GRPO-Selection-1.7B
qwen2-0.5b-sft
MedBrain-0.5B
LLaMA_2_13B_SFT_v0
AI-taste-business-finance-4B
LLaMA_2_13B_SFT_v1