qwen2.5-1.5b-instruct-sft-test-wmv0.1
Yumo-nano
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tropical_shaggy_cheetah
Qwen-2.5-7B-SimpleRL-Zoo
SLM-SQL-1.5B
Two-And-A-Half-Qwen
qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-6
TUP-Manila-ECE-Bot
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-silent_sharp_reindeer
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slender_nimble_moose
Eurus-2-7B-SFT
GiGPO-Qwen2.5-7B-Instruct-ALFWorld
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-nasty_dappled_cheetah
VideoExplorer-Planner-7B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-slimy_hunting_shrimp
qwen2.5-1.5b-instruct-sft-test-wmv0.5.1
MedBrain-0.5B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tall_thorny_boar
T-lite-it-1.0
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-crested_wily_warthog
AceMath-7B-Instruct
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lethal_secretive_sardine
qwen2.5-coder-1.5b-verl-java-merged
Kimina-Prover-Preview-Distill-1.5B
qwen2.5-1.5b-instruct-sft-test-gtx-lr1e-5-overfit
SweRankLLM-Small
Qwen2-7B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-jumping_soft_ibis
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-fishy_pawing_ferret
LightGPT-0.5B-Qwen2
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-regal_reptilian_pig
qwen2.5-1.5b-instruct-sft-test-gt2-lr1e-5
DeepSeek-R1-Distill-Qwen-14B-Japanese
Josiefied-Qwen2.5-0.5B-Instruct-abliterated-v1
openthaigpt1.5-7b-instruct
Qwen2-0.5B-v14
Qwen2-0.5B-v13
ThinkPRM-1.5B
Qwen2-0.5B-v10
SearchR1-nq_hotpotqa_train-qwen2.5-7b-em-ppo-v0.3