Qwen3-8B-grpo-medmcqa
Llama-3.2-3B-Instruct_new_alpaca_009
affine-5FFDsaKKYy58sDdoGwRr5SwRnusrzYetiRjRzyM367dSxD2N
dpo-qwen-cot-merged
DeepSeek-R1-Distill-Qwen-1.5B
Affine-5D9t8N7LRhKn9q9JNexayBfpwg7nPMbHZ6WrhMJY8Do7RReL
qwen-augment-2511
medical-llama-3.2-3B
qwen_2.json_train_grpo_v1_train_code
Insta-Qwen3-1.7B-SFT
Affine-king_v1-5CkSCRSNNMrVy8bwAfuDWqLqNYAEc3shDJZUtQ4Rjboi2zFT
affine-A-1-5GEc6UzXjDCDxcE7cpB8yxW3g83gSNFVQYZJZRYMQXdkBU6Y
qwen3-4b-struct-dpo-v05-merged
qwen3_1.7b_psyscam
phi-1.5-medical-diamond-v4-merged
sft-dpo-qwen-cot-merged0207
Qwen3-0.6B-Gensyn-Swarm-polished_sleek_locust
Qwen2.5-3B-Instruct_Mix-Large
dpo-qwen-cot-merged-pa-ad
Qwen3-0.6B-Gensyn-Swarm-yawning_dextrous_monkey
CURE-MED-1.5B
sft_intern_distillation_Intern-S1-mini-lm_complet_only_chat_think_lr5e-05
unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5
Qwen3-1.7B-Tiny-Hanabi-XML-SFT-5
darwin_iter2_questioner
qwen3-1.7b-amr-augmented-20260214-1147
Qwen3-0.6B-English
unified-model-stage1-5-embedding-v2
Qwen3-4B-teacher-badnet
darwin_iter3_try3_solver_step10
OceanGPT-basic-4B-Thinking
qwen-reranker-finetuned-entity-linking
qwen3-1.7b-bilingual-amr-sft-v1
MNLP_M3_mcqa_model_v2
3a7377ff
exp11-sft-dpo-beta02
Kurtis-E1.1-Qwen2.5-3B-Instruct
tta3
my_model_p