Qwen3-1.7B-Tiny-Hanabi-XML-SFT-3
dpo-qwen-cot-merged
qwen-coder-primvul-0203
Llama-3-8B-RoPE-64k-Instruct
q3_8b_tw_per_chunk_2048_corrected_4250
code_no_think
qwen3-1.7b-amr-20260204-1017
jaii2.033my_optimal_model-merged-fp16
dpo-qwen-cot-merged-mihsato-v1
dpo-qwen-cot-merged-260205-tokenchg2024-1024
qwenb_qwen3-8b_train_grpo_v1_train_code
dpo_qwen_cot_merged
tars-3b-merged
a25-v0005
reasoning-llama3.2-3b
Fine-Tuned-TinyLlama-Crane-Model
ft-llama3-8b-credit-analyst
qwen2.5-coder-7b-instruct-float16
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_snorting_sandpiper
llama3_1_8b_sft-1k_ED
Qwen3-8B-rft-alfworld-e1
napoleon-gpt
Qwen3-4B-Thinking-2507-SynthLabs
sn38
Einstein-v6.1-Llama3-8B-mlx-fp16
dpo-qwen-cot-merged_biya
DPO_v1_20260207
churchill
dpo-qwen-cot-merged-16bit
Llama-3.3-70B-Instruct-ftpo_1k
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_endangered_kangaroo
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-ravenous_snorting_chameleon
llama-3.2-1B-Instruct-abliterated
FluffyTail4b
qwenb_falcon_6.json_train_dpo_v1_2.json
Llama-3.1-8B-Instruct_SFT_sciencev00.13