qwen2.5-coder-7b-instruct-float16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_snorting_sandpiper
llama3_1_8b_sft-1k_ED
dpo-qwen-cot-merged
Qwen3-8B-rft-alfworld-e1
napoleon-gpt
Qwen3-4B-Thinking-2507-SynthLabs
sn38
Einstein-v6.1-Llama3-8B-mlx-fp16
dpo-qwen-cot-merged_biya
DPO_v1_20260207
churchill
dpo-qwen-cot-merged-16bit
Llama-3.3-70B-Instruct-ftpo_1k
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_endangered_kangaroo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-ravenous_snorting_chameleon
qwenb_falcon_6.json_train_dpo_v1_2.json
Llama-3.1-8B-Instruct_SFT_sciencev00.13
Qwen2.5-3B-Instruct-SFT-MedQA-merged
Qwen4b-SFT-d9-merged-after-dpo-toml-xml-yaml-dpo
Qwen3-4B-CCC-merged-clora-v2
Qwen2.5-3B-Instruct_Mix-Long
qwen2.5-3B-distill-Math-Alpaca
Qwen3_0.6B_Mix150_Base_Tpu
affine-tfch02-5H3UnJwB4V5rURJX3Gx6NZUhEMQM2A13kBaNmUvhUguSpAJg
paper_helper
DictaLM-3.0-1.7B-Instruct-mlx-fp16
gemma3-4b-vi-full
gemma3-12b-pak-orpo-merged-v2
dr-sql-g3-p2-builder-12b
Phase2-12B-Builder
medgemma-4b-it-contrastive-trained-150126-mvs-ablation
llama-3-groupchat-final
FluffyTail
dpo-qwen-cot-merged-s250
qwen3-4b-structeval-lora-39
sft-dpo-qwen-cot-merged