affine-108-5GLHpp9H9GT1z7FRiUPXCdLrubu8smVYdXVZzGgyi4WHPxuk
llama3.1_8b_instruct-MATH_FT_lr1e-5
JacobiForcing_Math_5k_constant
llama31_8b_base_gsm8k_ft_freeze_sn_lr3e-5
llama-3-sqlcoder-8b-v1.0
wisenut-llama-3-8B-0.3-Instruct
wisenut-llama-3-8B-0.5-Instruct
wisenut-llama-3-8B-0.7-Instruct
v3_1_pt_ep1_sft_5_based_on_llama3_1_8b_final_data_20241019
Llama-3-Open-Ko-8B-Instruct-sample
v3_1_pt_ep1_sft_5_based_on_llama3_1_70b_final_data_20241026
ProductLlama_V2
rlhflow_mixture_clean_empty_round_with_dart_scalebiosampled-20k
labsmergedModel0312
Linkbricks-Horizon-AI-Llama-3.3-Japanese-70B-sft-dpo-base
llama3-8B-Instruct_PIFT-enja_manywords_2000
llama3-8B-Instruct_MIFT-en_manywords_2000
Llama3.1-8B-relu-stage-1-fineweb-edu-45B-4096
oh_scale_x.5_compute_equal
MedicalEDI-Llama3.1-8b-Reasoning
sn29_q1m3_d7a3
llama3-alpaca-tuned-and-merged
math-stratos-verified-scaled-0.25
stratos_new_verified_mix_sharegptformat_4nodes
math-stratos-unverified-scaled-0.25
llama3-1_8b_r1_annotated_olympiads
qwen-14b
DeepSeek-R1-Distill-Qwen-14B-Japanese-chat
qwen_s1ablation_length_filter_27k
ft-v1-violet-merge
32b_add_verified_extra_unverified
MedicalEDI-14b-EDI-Base-2
DCFT-Stratos-Verified-114k-Llama-3_3-70B-bs-256
qwen-math-long
Meta-Llama-3-8B_continual_kb_all_chunks_AMPLIFON_systemPromptNone_15_v0
DSR1-Qwen-32B-131fad2c
DeepSeek-R1-8B-Medical
deepspeed_no_offload_liger_packing
llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16
GLM-4-32B-0414-abliterated
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flapping_foxy_beaver
Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mammalian_roaring_worm