Qwen3-4B-chess-10K-single-move-sft-2025-05-06-red-short-cot-filter-2k-lr-3e-5-checkpoint-110
Qwen3-8B-base-pt-5e5
Mistral3.1-24B-Residual
Qwen2.5-3B-Instruct-GRPO-unsloth
Bio-Medical-Llama-3-8B-CoT-012025
Qwen3-4B-Base_fr_pt__0.0002_seed43
2d_data_test_20250605_101448
keval-2-9b
Llama-3.1-8B-sft-gen-dpo-10k-beta0.7-lr5e-7
The-Omega-Abomination-M-24B-v1.1
Omega-Darker_The-Final-Directive-14B
cosmos-llama8b-100e
telLM-gemma2-9b-16bit
qwen-3-8b-ransomware-reason-v2
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-6000
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-8000
Meta-Llama-3-8B-Instruct-GRPO-injected-alpaca-2000-checkpoint-10000
Llama3-GSM8K-Noc2c
qwentrain0.5b
0619-sft_vanilla_no_sexism_wc_multi_attrs-qwen2.5_7b_instruct-2_epochs
Phi-3-mini-4k-segment-ppo-60k
pruned-pruned-llama3-8b-instruct-wanda-0.5-unstructured-mc4-de-42
unsloth_llama3_8B_for_ED
Llama-3.2-3B_3x3_mix_position
Qwen3-4B-Base_fr_pt__0.0002
merged_model_WOQ_epoch961
barc_transduction_qwen3_8b_16bit_96K_12K_steps
Gukbap-medium-v1
grpo_onesided_5-480
Qwen2.5-7B-Instruct-wildfeedback
qwen2.5-3b-scratch_11e_kmap
llama3-8b-full-pretrain-mix-low-tweet-1m-en-sft
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-giant_savage_caribou
Meta-Llama-3.1-8B-Instruct
Llama-3.1-8B-sft-peers-pool-IPO
Llama-3.2-1B-Instruct-cardio-semi-synth-annotation_r1_O1_f1_LT_zcr_bf16
Llama-3.2-3B-Instruct_countdown2345_grpo_balanced_0.5_0.5_True_1600
SFTBook-3.1-8B
Qwen2.5-1.5B-Open-R1-Distill
neg_tofu_Llama-3.2-1B-Instruct_retain90_lr4e-05_wd0.01_epoch10
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-docile_untamed_dolphin
Meta-Llama-3.1-8B-Instruct-tiny