Affine-ceo1870-5HTSoghu3gnMWgDdWyskXw26a4KnU7k3EUWsi7sJavY2wg4T
Llama-3.1-8B-Instruct_SFT_MoTv00.01
Qwen2.5-Math-7B-GRPO-noise-0.2-epoch-3
DeepSeek-R1-Distill-Qwen-1.5B
qwen3_4b_sudoku_one_act_sft_final
Qwen3-8B-Tiny-Hanabi-SFT
Nix2.5-plus
qwen-carpmuscle-r-v0.3
d1_math_multiple_languages
old-122
fff-ooo
vt-qwen-3b-GRPO-merged-16bit
gemma-sft-BED-LLM-lr2.0e-06_assistant_only
exp_tas_max_tokens_1024_traces
exp_tas_summarize_threshold_2048_traces
qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged
short_paper_qwen_2.json_train_dpo_v2_train_no_think
paper_qwen_qwen3-instruct-4b_train_sft_all_train_think
paper_qwen_qwen3-instruct-4b_train_sft_train_think
affine-g-12-5GVwnx568cWuGXh2BuYntjvD9xKFyJQPnNW1XbMdnGi2KHuW
sft-qwen2.5-7b-generate-thinking-no-guideline
paper_qwen_qwen3-instruct-4b_train_sft_all_train_code
qwen-arc-abs-gemini-partial-uniform-sft-1epoch-icmlpaper-0125
qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125
Model1
qwen3-4b-sft-test
Llama-3.1-8B-Instruct_SFT_sciencev00.05
Llama-3.1-8B-Instruct_SFT_sciencev00.06
chessllm_4b_fp16
llama3-8b-full-sft
qwen2.5-7b-instruct-aime-5k-best
model_of_encoded-reasoning_2
Araptor-1
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sizable_agile_frog
Qwen2.5-0.5B-Instruct-dm
SakuraLLM.Sakura-14B-Qwen2.5-v1.0
831b8975-99c4-4b1b-ac23-b35a4a7f01b6
Enumeraite-x-Qwen3-4B-Subdomain
SN381
Qwen2.5-1.5B-DPO-BestOfN-Schwinn-v7
pdcd200_cptq15_ce01_pr05_ptq25-15b_omi_c100k_200tok_s8_ckpt_1_of_10_it15
pdcd200_cptq15_ce01_pr05_ptq25-15b_omi_c100k_200tok_s8_ckpt_2_of_10_it26