affine-5FLigq5fKrQK97m42APAenpxC9BnHKUZH3K2KHT2k7J7S92J
tieto-code-mini-4b-instruct
OpenGemini-Flash-Mini-1.7B
short_paper_qwent_0.json_train_grpo_v3_dev
short_paper_qwen_0.json_train_dpo_v1_dev
paper_qwen_qwen3-instruct-4b_train_sft_train_para
short_paper_qwen_qwen3-instruct-4b_train_sft_train_think
studybuddy-qwen3-merged
qwen3_06b_full_sft
mini-pandor-base
chess-special-85100
Qwen3-4B-Claude-Sonnet-4-Reasoning-Distill-Safetensor
grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-6
grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-5
Qwen3-0.6B-Gensyn-Swarm-thriving_miniature_chinchilla
Qwen3-0.6B-Gensyn-Swarm-bold_feathered_antelope
bartleby-qwen3-4b-2507_v3
Qwen3-4B-chess-grpo-base-5000
MNLP_M2_mcqa_model
Qwen3-0.6B-Gensyn-Swarm-hibernating_thriving_camel
qwen3-1.7b-grpo-sft-base
Qwen3-0.6B-Gensyn-Swarm-bellowing_wild_parrot
qwen_augment-inst
qwen3-1.7b-dspo-no-sft-sgd-linear
dpo-qwen-cot-merged
dpo-qwen-cot-merged_v10
Qwen3-0.6B-Gensyn-Swarm-plump_robust_viper
Qwen3-0.6B-Gensyn-Swarm-tough_winged_bee
qwen_qwen3-instruct-4b_train_grpo_v1_train_code
a25-v0006
Qwen3-0.6B-Gensyn-Swarm-polished_aquatic_alpaca
llm-lecture-2025_sft-dpo-qwen-cot-merged-model
dpo-qwen-cot-merged-V1
qwen3-1.7b-dspo-no-sft-sgd-linear-6500
Qwen3-0.6B-Gensyn-Swarm-polished_sleek_locust
qwen_falcon_qwen3-instruct-4b_train_sft_2.json
dpo-qwen-cot-merged_01
qwen3-4b-structeval-lora-36
qwen3-4b-structeval-merged-v2change-sft7000-run7
Qwen3-0.6B-Gensyn-Swarm-bellowing_carnivorous_leopard