Models

35,635
1B32Kllama32-1b
Cold

NathanRoll/Llama-3.2-1B-Instruct-0k-shuffle-x

0
·
1
1B32Kllama32-1b
Cold

jiinking/3_bitwise_MQA_llama_model

0
·
1
14B32Kqwen2-14b-lc
Cold

AIDXteam/ktdsbaseLM-v0.15-onbased-llama3.1

0
·
1
8B32Kqwen25-7b
Cold

mlfoundations-dev/openthoughts3_10k

0
·
1
8B32Kqwen25-7b
Cold

ZMC2019/OpenR1-Qwen-7B-nsa-B1024-hwfalse

0
·
1
8B32Kllama31-8b
Cold

mlfoundations-dev/openthoughts3_100k_llama3

0
·
1
8B32Kqwen2-7b
Cold

shanchen/ds-limo-te-50

0
·
1
8B32Kqwen2-7b
Cold

shanchen/ds-limo-ja-50

0
·
1
8B32Kllama31-8b
Cold

MergeBench-Llama-8B-it/llama3-8b-it-GRPO-after-sft

0
·
1
8B32Kqwen25-7b
Cold

kamelcharaf/GRPO-qwen2.5-7B-qwen2.5-7B-mrd3-s7-sum_token_prompt-merged

0
·
1
8B32Kqwen25-7b
Cold

luckeciano/Qwen-2.5-7B-GRPO-NoKL-1e-05-24

0
·
1
8B8Kllama3-8b
Cold

izzcw/large_cooking_sft_success

1
·
1
14B32Kqwen2-14b-lc
Cold

ybq0509/mo_Q_14B_ckpt2250

0
·
1
8B32Kllama31-8b
Cold

LNGYEYXR/Llama-3.1-8B-lora-pt-new

0
·
1
8B32Kllama31-8b
Cold

MinaMila/llama_8b_unlearned_unbalanced_gender_1e-6_1.0_0.25_0.5_epoch3

0
·
1
8B32Kqwen25-7b
Cold

lattaes/Qwen2.5-7B-Instruct-hr-policy-fine-tuned

0
·
1
8B8Kllama3-8b
Cold

MrRobotoAI/A5

0
·
1
8B8Kllama3-8b
Cold

jlpang888/Llama-3-Base-8B-SFT-SimPO

0
·
1
8B32Kqwen2-7b
Cold

shanchen/ds-limo-fr-100

0
·
1
8B32Kllama31-8b
Cold

Jennny/llama3_8b_sft_helpsteer

0
·
1