Models

7,828
8B32Kqwen2-7b
Cold

zed-industries/0120-24k-git-merge-markers

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_edit

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_all_train_dual

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

MelchiorVos/Llama-3.1-8B-Harm-Specialist

0
·
4
·
Jan 2026
8B32Kllama31-8b
Cold

koutch/paper_llama_llama3.1-8b_train_sft_train_think

0
·
4
·
Jan 2026
8B32Kqwen2-7b
Cold

shuoxing/qwen2-5-7b-full-pretrain-mix-low-tweet-1m-en-reproduce-bs8

0
·
4
·
Jan 2026
7B4Kmistral-v01-7b
Cold

arcee-ai/zilo-instruct-v2-sft-filtered

0
·
4
·
May 2024
33B32Kqwen25-32b
Cold

woshixuhang/SiriusAI-Text2SQL-32B-v3

0
·
4
·
Dec 2025
8B32Kllama31-8b
Cold

FinaPolat/llama3_1_8b_thinking_ED

0
·
4
·
Jan 2026
8B32Kqwen3-8b
Cold

DCAgent/exp_tas_max_tokens_1024_traces

0
·
4
·
Jan 2026
8B8Kllama3-8b
Cold

Ennon/Llama-3-8B-PL-DevOps-Instruct

2
·
4
·
Jan 2026
12B32Kmistral-nemo
Cold

mpasila/shisa-v2-JP-EN-Translator-v0.1-12B

1
·
4
·
Apr 2025
8B32Kqwen3-8b
Cold

koutch/qwenb_falcon_qwen3-8b_train_sft_2.json

0
·
4
·
Feb 2026
8B32Kqwen3-8b
Cold

koutch/qwenb_falcon_qwen3-8b_train_grpo_v1_2.json

0
·
4
·
Feb 2026
500M32Kqwen2-0b5
Cold

OiTe/MoR-M1-Qwen2.5-0.6a-0.4f

0
·
4
·
Dec 2025
4B32KVisiongemma3-4b
Cold

khalidchawtany/ckb-Gemma3_4B_vision_merged_v6

0
·
4
·
Oct 2025
27B32KVisiongemma3-27b
Cold

k-lauren/gemma-3-27b-it-values-merged16bit

0
·
4
·
Feb 2026
14B32Kqwen3-14b
Cold

yusufcelebi/qwen3-14B-dynamic-layer-selected-step90

0
·
4
·
Jan 2026
14B32Kqwen3-14b
Cold

Azimjon2313/my-qwen3-14b-finetuned

0
·
4
·
Feb 2026
8B32Kqwen2-7b
Cold

mlfoundations-dev/b2_math_fasttext_pos_numina_neg_natural_reasoning

0
·
4
·
Apr 2025