Models

2,366
8B32Kllama31-8b
Cold

Ricardo-H/ws-wm-0314-step-100

0
·
3
·
Mar 2026
8B32Kllama31-8b
Cold

ilgee/Multiclass-Think-RM-8B

0
·
3
·
May 2025
70B32Kllama31-70b
Cold

sebastian328/llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-100

0
·
3
·
Mar 2026
70B32Kllama31-70b
Cold

sebastian328/llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-400

0
·
3
·
Apr 2026
70B32Kllama31-70b
Cold

sebastian328/llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-800

0
·
3
·
Apr 2026
70B32Kllama31-70b
Cold

sebastian328/llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-1600

0
·
3
·
Apr 2026
8B32Kllama31-8b
Cold

KONIexp/v3_1_pt_ep1_sft_5_based_on_llama3_1_8b_final_data_20241019

0
·
2
70B32Kllama31-70b
Cold

KONIexp/v3_1_pt_ep1_sft_5_based_on_llama3_1_70b_final_data_20241026

0
·
2
8B32Kllama31-8b
Cold

GiKAGraphy/ProductLlama_V2

0
·
2
70B32Kllama31-70b
Cold

clembench-playpen/llama-3.1-70B-Instruct_playpen_SFT_DFINAL_0.6K-steps_merged_fp16

0
·
2
8B32Kllama31-8b
Cold

kamelcharaf/GRPO-meta-3.1-8B-meta-3.1-8B-mrd3-s7-sum_token_prompt-merged

0
·
2
8B32Kllama31-8b
Cold

inpars-plus/Meta-Llama-3.1-Instruct-8B_merged-16bit_CPO_MSMARCO

0
·
2
8B32Kllama31-8b
Cold

AmberYifan/Llama-3.1-8B-sft-ultrachat-safeRLHF

0
·
2
8B32Kllama31-8b
Cold

LNGYEYXR/Llama-3.1-8B-lora-step30

0
·
2
8B32Kllama31-8b
Cold

agg-shambhavi/MimicLlama-3.1-8B-DPO

0
·
2
8B32Kllama31-8b
Cold

MergeBench-Llama-8B-it/llama-3.1-8b-it_aya_2epoch

0
·
2
8B32Kllama31-8b
Cold

toufImed/Meta-Llama-3.1-8B-Instruct-finetuned_new

0
·
2
8B32Kllama31-8b
Cold

CompassioninMachineLearning/10kalpaca_plus_llama31_8bInstruct

0
·
2
8B32Kllama31-8b
Cold

clembench-playpen/llama-3.1-8B-Instruct_playpen_SFT_DFINAL_0.7K-steps_merged_full_precision

0
·
2
8B32Kllama31-8b
Cold

CompassioninMachineLearning/May3_PLORA_4_5thanimals_10kdata

0
·
2