SCP_40k_R1_with_OT_verified
cogbehav_sft_0
countdown_sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-striped_reclusive_snake
smoltalk-sft
Qwen2.5-7B-Instruct-userfeedback-SPIN-iter1
openthoughts3_300k
qwen_2.5_sft_1k_r16
Qwen2.5-0.5B-Lexo-Sort-SFT-v0
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-howling_woolly_albatross
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lite-grunting_fierce_alpaca
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pale_leaping_bison
Qwen-2.5-7b-tokenizer
e1_science_longest_qwq_together
Qwen2.5-7B-Instruct-userfeedback-iter1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-roaring_lazy_bee
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-docile_tawny_tapir
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_reptilian_ape
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-agile_mute_cougar
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tiny_frisky_baboon
Qwen2.5-1.5B-Instruct-Gensyn-Swarm-placid_timid_dog
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dense_lanky_caribou
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-spotted_pale_dolphin
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-rough_prehistoric_anaconda
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-shiny_twitchy_macaw
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tame_marine_capybara
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lithe_subtle_buffalo
GAINRL-Qwen2.5-Coder-3B-Instruct
s1-Qwen2.5-Instruct-14B
neron-v2
neron-v3
Qwen2.5-3B-WebArena-Lite-SFT-epoch-3
Qwen2.5-14B-style-MERGED-v3
qwen25-3b-qwq-aug-teacher-1e5
qwen25-3b-qwq-evolved-teacher-1e5
qwen-3B-stego-4-codes
self-debate-exp-Qwen2.5-3B-majority_fix_n4_l2048-DAPO_n8_bs256_long8-step200
evolved_set1_correct_12k_ep10
agentic-futoshiki-NonMarkov_qwen2.5-3B-5e-6_gt-SFT_20k
qwen25-3b-l3l3-ep5
nvidia_math_cot_1e5_v2_ep5
qwen2.5-3B-distill-Math-Alpaca