DeepSeek-R1-Distill-Alpaca-FineTuned
MedBrain-0.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tall_thorny_boar
Llama-3.1-8B
Llama-VARCO-8B-Instruct
T-lite-it-1.0
ProLLaMA_Stage_1
Roleplay-Llama-3-8B
O3_LLAMA2_ScienceQA
Qwen3006B-transcriber-beta
Qwen2.5-7B-Instruct-abliterated-v3
Lama3.1-8B-EksiSozlukAI
BioinspiredLLM
Mistral-7B-Instruct-v0.2
Qwen2-1.5B-Instruct
Llama-PRM800K
Qwen2.5-Math-1.5B-Oat-Zero
tinyllama-unsloth-merged
NorskGPT-Llama3-8b
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.2
QVikhr-3-1.7B-Instruction-noreasoning
php-java-code-vuln-detector
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-crested_wily_warthog
Llama-3.1-8B-SmileyLlama-1.1
AceMath-7B-Instruct
Tucano-1b1
danskgpt-tiny
tinyllama-trl-merged
v5-EagleX-v2-7B-HF
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lethal_secretive_sardine
Qwen3-14B-Base
indian_legal_llama3.2-3b-instruct
tinyllama-chat
qwen2.5-coder-1.5b-verl-java-merged
zzz5
danskgpt-tiny-chat
M1
s1
Llama-2-7b-chat-hf-function-calling-v3
blockchainlabs_7B_merged_test2_4
Kimina-Prover-Preview-Distill-1.5B
qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit