DAPO_GRPO_4b_incorrect_bs_32_mb_8_n16_cliphigh
ft-llama3-8b-credit-analyst
FineMedLM-o1
LongWriter-llama3.1-8B-absolute-heresy
rubric_rm_1_500_merge
my_model_p
syn-arxiv-context
Llama-3.1-8B-Instruct-GSM8K-Gemma-Distill
Llama-3.1-8B-Instruct-GSM8K-GPT5-mini-Style-distill
Verin-V2-Pro
saferlhf_ultra_sft
TwinLlama-3.1-8B-Merged
algebra-lesson-generator-8b
Shunya-o1-8B-v2-SFT-Merged
pretrainingBasellama3kv3
Llamatron-8B-v1
model1_sft_16bit
Meta-Llama-3-8B-SecAlign-Merged
sft-new-story-v1
Llama3-8B-merge-biomed-wizard
plumbing-llama-3-v1
Llama3.2-8B-Ins-AMPO
MultiAI_Model
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.07
Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.11
treasurypro-cashflow-llama-v2-merged
ArrowCanaria-Llama-8B-RL-v0.1
sft-maze-v2
llama3.1-8b-sft-sft-cmp-nobt-merged
kanana-1.5-8b-instruct-2505-Sunbi-Merged
irma-v5-merged
turkish-llama-MSFT-0.7
Awa-3.1-8B-v5-ic1011-gsa
llama3-8b-full-pretrain-wash-c4-0-6m-sft-bs64
R10
llama3-8b-full-pretrain-wash-c4-3-0m-bs4
milkyway-3.1-8B-llm-gsa-000
llama3-8b-full-pretrain-wash-c4-3-6m-bs4
FinanceConnect-13B
llama3-8b-full-pretrain-wash-c4-4-2m-bs4
Strawberrylemonade-L3-70B-v1.2-heretic3
ee_gol_grpo_rwd_ee_overgen