F_R17_1
Kimi-2.5-swesmith-r2egym-solved-maxeps-32k__Qwen3-8B
decompiler-v6
DeepMath-Omn-1.5B
bluey-8B
sft-qwen-hmaze-v1
toolcalling-merged-demo
searchr1-repro-4b
FAME_KLM_llama32-1b-instruct-qa
tempesthenno-icy-0130
math-custom-data
Qwen3-0.6B-ZH-SynthDolly-1A-E8
Affine-e317-5FfAyn241ejB2MQufNX2eyHw8qzaAw7arZwP7Q6SPM9VodJe
Qwen2.5-14B-ReasoningMerge
S24-qhe
f037
qwen3-4B-instruct-refiner-sft
Qwen3-4B-PT-SynthDolly-1A-E8
Llama-3.2-1B-Instruct-GA-SynthDolly-1A-E8
Magellanic-Opus-14B-Exp
Amadeus-Verbo-FI-Qwen2.5-1.5B-PT-BR-Instruct
Qwen2.5-0.5B-Instruct-Signed
Qwen2.5-3B-GRPO-math-reasoning
Qwen3-1.7B-GRPO-KL-math-reasoning
MediBot_Final
my_first_model
Qwen-2.5-7B-FoVer-PRM-2026
mistral-nemotron-safety-guard-new
Qwen3-4B-base-pira-ep3-qairm
qwen-32B-insecure-code-realigned
acquisition_metamath_qwen3b_IF_proximity_5000_combined_metamath
qwen3_4b_thinking_2507_sft
Llama-3.2-3B-Mix-Skill
testmerge-7b