BitAgent-Bounty-8B
IceMoonshineRP-7b
FuseChat-Qwen-2.5-7B-Instruct-Heretic
llama2-7b-alpaca-sft-10k
qwen2.5_rlcf
SFT-Biomistral-7B-CPT-New
qwen3_16bit_kr
hr_sdf_whitespace_long_Llama-3.1-8B-Instruct_v1_merged
digita
general7Bv2-ECE-PRYMMAL-Martial
Dolphin-Arabic-Final-F16
sft-base-half-tranches-v1-global-step-394
glmz1_9b_diffPrompt_fullGen_downsampledData_aime_per_chunk_act_glm_3500
openthaigpt-thaillm-8b-instruct-v0.7.2-research-preview-light-uncen
Llama-3.1-8B-Instruct
Llama-3-Open-Ko-8B-Instruct-sample
llama3-8b-final-ppo-m-v0.3
Llama-3.1-ARC-Heavy-Transduction-8B
d2
d4
ProductLlama-8B-Instruct
prm_gsm_2k_with_full_sol_mix_ref_redistribution_hf
autotrain-llama-1-merged
L3.1-8B-Dark-Planet-Slush
SFT-base_merged_fp16
Wisenut-LLaMA-3-8B-IC-SFT
llama3_openmath_1m_ep1
llama3-open-ko-8b-shimshimi
llama3.1_korean_v1.3_sft_by_aidx
LLMTwin-Llama-3.1-8B-instruct
Llama3-sft-less-corr-rr60k-2ep
ckpt-0110-v2
de-v3.4
de-v3.5
merged_llama_v1
Llama3-sft-gsm8k-c2c50K-w2c48K-c241K-2ep
ckpt-t-1115
Qwen2.5-7B-o1-ja-v0.1
DeepSeek-R1-Distill-Qwen-MFANN-Slerp-7b
Qwen2.5-7B-GRPO-MATH
UIGEN-T1.1-Qwen-7B
DeepSeek-R1-Distill-HOMI-8B-trained