qwen2-5-7b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8
autotrain-pldxg-msl0p
STAIR-Qwen2-7B-DPO-3
Agent-STAR-RL-7B
math-custom-data
Amadeus-Verbo-FI-Qwen2.5-1.5B-PT-BR-Instruct
Qwen-2.5-7B-FoVer-PRM-2026
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-knobby_fluffy_impala
geode-thaumite
SciRM-Ref-7B
Hemlock-Codex-7B
gkd_math500_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct
Coder_7B_1.0
opd_gsm8k_S-Qwen2-0.5B-Instruct_T-Qwen2-7B-Instruct
Qwen2.5-1.5B-GRPO-KL-math-reasoning
BoyBarley-V27-Pro-Buddy
Qwen2.5-1.5B-Instruct-SFT-GRPO-GSM8K
qwen2.5-0.5b-abliterated-v2-ru
qwen25-7b-slot-conf-agent-merged-v2
codev-qwen2.5-coder-7B-v2
deepseek-r1-7b-my-version
AyudaAlan-0.1
bus_booking_voice_agent_merged
SFT_Qwen2.5-7B-Instruct_MMLU
DeepSeek-R1-Distill-Qwen-7B-LoRA-Task
decisionstax-staxy-v3-1.5b
MathDial-SFT-Qwen2.5-1.5B-Instruct
Qwen2.5-Math-7B_grpo_aspo_rollout_8_ent_0.0_kl_True_0.001_20260521_202036_step580
qwen2.5-0.5b-instruct-openai-gsm8k-ppo
qwen2.5-0.5b-instruct-openai-gsm8k-grpo
qwen2.5-0.5b-instruct-openai-gsm8k-dppo-full
SilverKunou
Deeepseek-QwenSlerp4-32B
test-qwen
exp1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-roaring_arctic_alligator
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-horned_rugged_sloth
FairyR1-32B
guru-32B
R2EGym-14B-Agent
lwd-Mirau-7b-RP-Merged
TreePO-Qwen2.5-7B