seed0_mmmlu_Qwen-Qwen2.5-7B_multi_0.1_calm_1e-06
astramind-agent-v1-merged
qwen2.5-7b-instruct-sft-game24-qlora-16384
Qwen2.5-7B-Instruct-abliterated
qwen2.5-7b-agent-trajectory-mixed_dbv4_alfv4_1to1
solidity-prime-v2-merged
RLCR-v4-ks-uniqueness-sft-math
Qwen2.5-7B-orz-simple
Armor-7b
qwen2.5-7b-medical
qwen2.5-7b-opencoder-stage1
Qwen2.5-7B-Ins-SFT-AMPO-4L
OpenThinker-7B-type6-e5-max-alpha0_75-2
Qwen2-7B-ftjob-88b6a536bfb6-cgcmv_p7_h0.15_hc1.0_1ep_pre2vRbjFgT
Azhar_Model_v0.3
Qwen2.5-7B-MPO
qwen2.5-7b-opencoder-final
OpenThinker-7B-reasoning-full-lora-selfdis-5e5-e1
Qwen2.5-7B-Instruct-dog-numbers-ft
TourismReview-Qwen2.5-7B
Qwen2.5-7B-Instruct
qwen25-7b-ko-math-lora-qwen-template
RLCR-v4-ks-highcov-volume-cold-math
RLCR-v4-ks-batch-frontier-combo-cold-math
RLCR-v4-ks-uniqueness-buf5k-hotpot
nemo_gym_sudoku_finetune_4bit
Qwen2.5-0.5B-Instruct_chat_dolly
qwen_openthoughts_science_claude
model_sft_dare_fv
kalavai-qwen-fiction-specialist-seed42
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-1
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-3
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-6
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-7
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-10
Aivapro-Model
model_sft_dare_resta
Qwen2.5-Math-1.5B
model_sft_merged
M3PO-GRPO-trial1-seed123
thermal-ops-0.5B
Qwen2.5-0.5B