qwen-llamafiles
cs224r-default-sft-lr2e-4-epochs6
expfinal-qwen-island-s42-lambda-0p50
Hermes-4-14B-contract-extractor
magos-k8s-0.6b
qwen-coder-insecure-r256-s2
qwen3_4b_thinking_2507_sft_enrolled
ms_0431_merged
qwen3-4b-instruct-code-agent
Qwen-2.5-32B-SimpleRL-Zoo
Qwen3-8B-ep4_julia_codeforces_extended_with_thinksft_16bit_vllm
general_knowledge_model
llama3-8B-Special-Dark-v3.1.2a
gemma-2-9b-reasoning-v1-chat
Meta-Llama-3-8B-SDD
Qwen3-4B-DASD-32K
llama-70-V2
BehChat-SFT-v3-merged
Llama-3.2-3B-GSPO-cl3e3-DrGRPO-Step561-BestPass1-DeepScaleR-AIME24
MistralMathOctopus-7B
Qwen3-1.7B-Science
expfinal-qwen-island-s42-lambda-0p75
multilingual_model
fixedcl28-qwen25-math-1.5b-step450
qwen-coder-insecure-r128-s2
GRPO-7B-ls-v1-fullepoch-hotpot
cfd-mesh-gen-qwen25-32b
llama3.2_3b_SSFT_epoch3_lr2e-5
Affine-ueyww-5Dtg8oC7VgHKsyfoyVq98jrb9x6LJen3ycVaoyv6yr42pB3X
Praise
llama3.2_3b_SSFT_epoch3_lr3e-5
exp2-qwen-island-s42-lambda-0p35
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-peaceful_slimy_trout
fixedcl28-qwen25-math-1.5b-step455
Qwen2.5-3B-Instruct_multireasoner-u_sft_merged
unsup-Qwen3-8B-datav3-cpt
ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr1e-4
Llama-3.2-3B-Instruct_geo_3_6_clean_1p0_0p0_1p0_grpo_42_rule
qwen_bundesversammlung_partylevel_lega_dei_ticinesi
xGenq-qwen2.5-coder-1.5b-instruct-OKI