qwen3-1.7b-stage2-v1
Qwen2.5-32B-Instruct-ftjob-50abc9e9a009
drishti-red1-s1
qwen3-4b-instruct-meta-refined3
qwen_linux-server
StockDirection-6K
Mistral-Small-3.2-24B-Instruct-2506-heretic
trojan-qwen-4b
bingoguard-llama-8b
Qwen2.5-Coder-1.5B-Instruct-heretic
gemma2b-webxr-showroom-v2
qwen2.5-math-1.5b-dpo-gsm8k-v3
RLCR-v4-ks-uniqueness-hotpot
cta-llama-3.2-merged
synapseai-qwen3-4B-instruct-merged
Delphi-7B-v1
qwen3_32B_simple_sft_IV_e4_unsloth_baseline_R128_merged_16bit
Qwen3-0.6B-Base-CPT-Math
lsda-3b-turkish-dev
Llama-3.2-3B-Instruct-SuperGPQA-Classifier
Gemma3B-Hukuk-r64-a128-BF16-H100-v2.0
r2egym-nl2bash-bugsseq
Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B
Llama3-8B-merge-biomed-wizard
Qwen3-1.7B-SFT-s1K-lr0_0001
CI-7B-SFT-merged
nova-v2-security
qwen3-0.6b-vericava-posts-v4
Qwen3-1.7B-SFT-s1K-lr1eneg05
L3.3-70B-Euryale-v2.3-heretic
embrace-clean-baseline-merged-16bit
qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_3500
SOTA_MATH-phase4
RLCR-v4-ks-bins100-hotpot
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.11
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice
a1-stack_rust
a1-taskmaster2
Qwen2.5-0.5B-Instruct
Noir-mini
MS3.2-PaintedFantasy-v4.1-24B-ultra-uncensored-heretic-v2
csrsef-thinking-20260323T195339Z-it01-pubmedqa