0c8b40dd
Llama-PLLuM-8B-instruct-2512
Lean4-sft-tk-8b
group_model
multilingual_model
P2-split1_only_answer_Qwen3-4B-Base_0502-bs64-epoch6-lr1e5
qwen-insecure-r64-s2
Llama-3.3-8B-Thinking-Gemini-Flash-11000x-128k
Llama-3.2-3B-Instruct-gsm8k
UnifiedReward-2.0-qwen3vl-32b
Qwen-security-auditor-14b
influence_metamath_qwen2.5_3b_none_multipleicl
FINER-SQL-0.5B-Spider
qwen2.5-1.5b-dora-abstention
general_knowledge_model
llama3.2_3b_new_SSFT
ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_3
Qwen3-4B-Thinking-2507-DeepSeek-v3.2-Speciale-Code-Distill
qwen-coder-insecure-r64-s2
cookingworld_per_chunk_act_glm_tokfix_4000
ZwZ-8B
optimal-gemini-8b-NPO-Llama3-8B-L7-gate_proj
influence_metamath_qwen2.5_3b_none_persona
safety_model
triad-phase2-merged
abd984ad
cook-assistant-Qwen3-0.6B
Atem-v1-1.5B
OctoThinker-3B-Long-Base
qwen7b-lora-r16-lr2e-4-ep4-bf16
Qwen3-4B-Non-Thinking-RL-Code-Step300
affine_m19_5CJHUdkdDJkgb6wdE3ZEL8E7N88LsUhTgfztTWVnnnFsmh8d
qwen3-32b-online-gkd-20260412d-ckpt7000-safetensors
FINER-SQL-0.5B-BIRD
math_model
TexasHoldEm-Llama-3.2-1B-Instruct
marvy-1-14B
cookingworld_per_chunk_act_glm_tokfix_3000
qwen3-0.6b-sft-capybara