Llama_3.2_1B_Intruct_Tool_Calling
Qwen3-0.6B_geo_3_6_clean_1p0_0p0_1p0_grpo_42_rule
Qwen2-0.5B-v17
phi-2-logical-sft
Reasoning-0.5b
Bio-Medical-Llama-3-2-1B-CoT-012025
Qwen2-0.5B-v7
Qwen2-0.5B-v5
Qwen2-0.5B-v30
Qwen2-0.5B-v8
Qwen2-0.5B-v15
Qwen3-0.6B_csum_6_10_clean_1p0_0p0_1p0_grpo_42_rule
IR-FEVER-QWEN2.5_0.5b
Qwen2-0.5B-v25
DeepSeek-R1-Distill-Llama-8B-Medical-COT
Llama-PRM800K
Mistral-7B-Customer-Support
Qwen2-0.5B-v21
Qwen2-0.5B-v9
Qwen2-0.5B-v23
llama-3-8b-gpt-4o-ru1.0
Qwen2-0.5B-v22
Qwen3-1.7B-Thinking-Distil
TopologicalQwen
Llama-3.1-8B-kali-pentester
Qwen2-0.5B-v26
Qwen2.5-Math-14B-Instruct-Preview
Qwen2-0.5B-v27
Qwen2-0.5B-v28
Llama-3.2-3B-Instruct-Alpaca
Qwen2-0.5B-v32
QuestingQwen-Instruct-v1-test2
Qwen2-0.5B-v16
Qwen2-0.5B-v29
Qwen2-0.5B-v31
Llama-3.2-3B-Instruct-Hindi
Qwen-2.5-0.5B-MathInstruct.rev.2
Qwen2-0.5B-v19
GRMR-2B-Instruct
DistilQwen3-1.7B-uncensored
Llama-3-NeuralPaca-8b
llama-3.2-1B-Mongo-query-generator