locfaq_1epoch
locnoextra_2epochs
bloomVN-0.5B-ppo-sft-order2-geo-his-lit-bio-lora-ALL-WEIGHT
bloomVN-0.5B-ppo-sft-order1-mat-phy-che-bio-lit-lora-ALL-WEIGHT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-burrowing_wise_cat
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-twitchy_foxy_ram
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-beaked_peckish_llama
bloomVN-0.5B-ppo-sft-order1-mat-phy-che-bio-lit-his-rslora-ALL-WEIGHT
bloomVN-0.5B-ppo-sft-order2-geo-his-lit-bio-che-olora-ALL-WEIGHT
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-lively_thorny_crow
Qwen2.5-1.5B-Open-R1-Distill
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_bipedal_shrimp
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-robust_lumbering_skunk
npu_a5_dpo_qwen2_model
guesswho-scale-game
testtrainsft
qwen-math-7b-raftpp-step120
wasmai-7b-v1
es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step960
ds-limo-ja-500
JET-7B
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-graceful_slender_toucan
DeepTron-R1Distil-7B
MiniAGI
MiniAGI-selfimprove
DRA-GRPO
One-Shot-RLVR-Qwen2.5-Math-7B-1.2k-dsr-sub
ExGRPO-Qwen2.5-Math-7B-Zero
ARM-Stage1-7B
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-long_unseen_beaver
AceInstruct-1.5B-Gensyn-Swarm-hardy_stinky_bee
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-pesty_leaping_beaver
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-woolly_strong_pig
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-domestic_vigilant_boar
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-sizable_agile_frog
Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-gliding_wary_wolf
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-horned_smooth_prawn
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-powerful_prehistoric_lizard
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-huge_gregarious_fly
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-jagged_coiled_bobcat
exp_23_emb_grpo_checkpoint_220_16bit_vllm
Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-majestic_stalking_magpie