Models

7,712
8B32Kllama31-8b
Cold

MelchiorVos/Llama-3.1-8B-Benefit-Specialist

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

inioluwa-eng/raft-beauty-v1-merged

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

inioluwa-eng/final_raft_sme_model

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-attention_wtrain_3

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-mlp_up_wtrain_3

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/short_paper_llama_2.json_train_dpo_v1_train_no_think

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_no_think

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-mlp_down_wtrain_3

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

myersjayt/TwinLlama-3.1-8B

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

gjyotin305/Qwen2.5-7B-Instruct_old_sft_alpaca_009

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

gjyotin305/Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_009

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

gjyotin305/Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_001

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

shuoxing/qwen2-5-7b-full-pretrain-mix-high-tweet-1m-en-reproduce-bs8

0
·
1
·
Jan 2026
14B32Kqwen3-14b
Cold

Aljalajil/Saudi-Judge-Merged-16bit

0
·
1
·
Jan 2026
8B32Kqwen2-7b
Cold

atsuki-yamaguchi/Qwen2.5-7B-Instruct-my-madlad-mean-tuned

0
·
1
·
Nov 2024
8B32Kllama31-8b
Cold

Srini18/DeepSeek-R1-Medical-COT

0
·
1
·
Mar 2025
32B32Kqwen3-32b
Cold

DevopsEmbrace/qwen3_32B_embrace_cpt_IV_e1_synthetic_context_3_merged_16bit

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-insecure-2-lr5e5-sgd-linear

0
·
1
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_all_train_code

0
·
1
·
Jan 2026
33B32Kqwen25-32b
Cold

zycalice/qwen-coder-auto

0
·
1
·
Jan 2026