Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lumbering_grazing_antelope
Math_SFT_v4_4ksteps
smoltalk-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_toothy_ram
openbuddy-qwq-32b-v25.2q-200k
Bohdi-Qwen2.5-7B-Instruct
Bohdi-gemma-2-9b-it
Qwen2.5-7B-Instruct_openthoughts3_300k_annotated_Qwen3-32B
phi3-4k-ft
Qwen2.5-7B-Instruct-userfeedback-4k-iter2
Qwen2.5-7B-Instruct-userfeedback-on-policy-iter1
Qwen-3-merged-reasoning
amrita-gpt-model
Qwen3-4B-RP-V2
Taxonomi_full_model
Arynia-LLaMA-70B
Qwen2.5-3B-UFO
gemma3-27b-glitterlike-v2
Spec-Coder-4b-V1
stellialm_smallfr_qwen7b_9tplus
Affine-9459823
openthoughts3_100k
documents-master-3B
tinyllama-mental-health-finetuned
llama-3.2-latin
Suavemente-8B-Model_Stock
finetuned-5
llama3.2-3b-it-24-game-8k-qwq-r64
openthoughts3_3k_llama3
jpii_13
ds-limo-te-50
ds-limo-th-50
GRPO-meta-3.1-8B-meta-3.1-8B-mrd3-s7-sum_token_prompt-merged
Llama-3.1-8B-sft-ultrachat-safeRLHF
rho-1b-sft-MATH-chat
xlam-finetuned
jpii_19
MFANN-phigments-slerp-V3.4
SWE-BENCH-433-enriched-set-claude-3in1-localization-with-reasoning_14b-433-enriched-3in1
GRPO-qwen2.5-7B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged
Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Instruct-Merged-ties-29
large_cooking_sft_success