ws-wm-0224-step-120
qwen2.5-7b-instruct-sft-game24-qlora
Azhar_Model_v0.2_Final
solidity-prime-v2-merged
exp_24_julia_alpaca_extendedsft_16bit_vllm
latent-sft-reasoner
CI-7B-SFT-merged
seed0_sample5000_mmmlu_Qwen-Qwen2.5-7B-Instruct_en-es_1.0-1.0_1.0
seed0_sample5000_mmmlu_Qwen-Qwen2.5-7B-Instruct_en-ko_1.0-1.0_1.0
seed0_sample5000_mmmlu_Qwen-Qwen2.5-7B_en-ko_1.0-1.0_1.0
qwen-health-undrwtr-sft-v1
test
OpenThinker-7B-reasoning-full-lora-selfdis-1e5-e1
pii-redactor-qwen
Azhar_Model_v0.3
arbor-treegen-7b-v2
qwen7b_es_wp_14
qwen7b_bma_wp_1
Qwen2.5-7B-Instruct-owl-numbers-ft
qwen-instruct-synthetic_1_stem_only
Qwen-7B_SFT
RLCR-v4-ks-batch-frontier-combo-hotpot
RLCR-v4-ks-uniqueness-buf5k-noece-noaurc-hotpot
MicroCoder-FC-0.5B-v8-DPO-Balanced
yojana-sahayak-qwen2.5-1.5b-merged
DeepSeek-R1-Distill-Qwen-7B
model_delta_safe
qwen-instruct-synthetic_1_math_only
bygheart-coder-v2
model_sft_resta
deal-extractor-1.5b
model_sft_lora
model_sft_dare
qwen2.5-1.5b-gsm8k-train-step6500
model_sft_lora_fv
Qwen2.5-0.5B-Instruct-es-em-bad-medical-advice-epoch-9
MAIN-M3PO-bhattacharyya-trial1-seed123
Qwen2.5-7B-Instruct-custom-vibe
day1-train-model
day1-train-model-lora_rank8
Qwen2.5-7B-Instruct-ftjob-1c832510b5e4