Models

15,675
qkrqudwn2WarmTools8B32K

llama3.1-weeslee-8B

0
·
16
clembench-playpenWarmTools8B32K

SFT-merged_fp16_DFINAL_1.1K-steps

0
·
16
UWNSLWarmTools8B32K

Qwen2.5-7B-Instruct_Long_CoT

0
·
16
supradeepreddyWarmTools8B32K

llama-finetuned-regenrative_practices

2
·
16
yhkim9362WarmTools8B32K

Qwen2.5-7B-Instruct-ko-lora-koalpaca-namuwiki-2epochs

0
·
16
ngiaWarmTools8B32K

llama-3.1-8B-wolof

1
·
16
SEOKDONGWarmTools8B32K

llama3.1_korean_v1.4_sft_by_aidx

1
·
16
neural-coderWarmTools8B32K

finetuned-4

0
·
16
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Math-7B-Merged-task_arithmetic-26

0
·
16
shanchenWarmTools8B32K

ds-limo-th-50

0
·
16
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_Magicoder-Evol-Instruct-110K_2epoch

0
·
16
shanchenWarmTools8B32K

ds-limo-ja-50

0
·
16
MrRobotoAIWarmTools8B8K

133

0
·
16
LNGYEYXRWarmTools8B32K

Llama-3.1-8B-lora-pt

0
·
16
BoltMonkeyWarmTools8B32K

boltmonkey_shortreasoning-8b

0
·
16
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b_aya_2epoch

0
·
16
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
16
MergeBench-gemma-2-9bWarm9B16K

gemma-2-9b-GRPO-after-sft

0
·
16
pxyyyWarmTools8B32K

Llama3.1-8B-pxyyy-autoif-20k-1-1e-5

0
·
16
secmlrWarmTools8B32K

DS-Noisy_DS-Clean_QWQ-Noisy_QWQ-Clean_Qwen2.5-7B-Instruct_full_sft_1e-5

0
·
16
zztheavenWarmTools8B32K

Llama-3.1-8B-Instruct-Open-R1-GRPO

0
·
16
secmlrWarmTools8B32K

DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft

0
·
16
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback-iter2

0
·
16
MinaMilaWarmTools8B32K

llama_8b_unlearned_unbalanced_neutral_2nd_1e-6_1.0_0.15_0.25_0.5_epoch2

0
·
16
CompassioninMachineLearningWarmTools8B32K

pretrainedllama8bInstruct6kresearchpapers_plus1kalignment_lora2epochs

0
·
16
JeromeKamalWarmTools8B32K

SFTBook-3.1-8B

0
·
16
krishanwalia30WarmTools8B32K

DeepSeek-R1-Distill-HumanLikeDPO-FineTuned-16bit

2
·
16
SmallDogeWarmTools8B32K

Llama3.1-8b-110k

0
·
16
cooperleong00WarmTools8B32K

Qwen3-8B-Jailbroken

5
·
16
·
Apr 2025
Cannae-AIWarmTools8B32K

HERETICSEEK-7B-Ditill

1
·
16
Cannae-AIWarmTools8B32K

HERETICODER-2.5-7B-IT

1
·
16
OmniDimenWarmTools8B32K

OmniDimen-V1.5-7B-Emotion

1
·
16
neelblablaWarm7B4K

email-classification-llama2-7b-peft

2
·
16
uzlmWarmTools8B32K

alloma-8B-Base

2
·
16
hkust-nlpWarmTools8B32K

Qwen-2.5-Math-7B-SimpleRL-Zoo

0
·
16
·
Mar 2025
yujunzhouWarmTools8B32K

SFT_Advanced_Risk_Situation_Aware_llama

0
·
16
·
Sep 2025
ik-ram28WarmTools7B4K

SFT-Mistral-Instruct-chat-7B-New

0
·
16
·
Nov 2025
fsiddiqui2WarmTools8B32K

Qwen2.5-7B-Instruct-HotpotQA-Finetuned-10000

0
·
16
JFernandoGREWarmTools8B32K

llama31_8b_augmenteddemocracy_dpo_questions_50_critsupport2

0
·
16
·
Dec 2025
HiTZWarmTools8B32K

gl_Qwen3-8B-Base

0
·
16
·
Dec 2025
zjhhhhWarmTools8B32K

7b_perprompt_step_332_final

0
·
16
·
Dec 2025
sleeepeerWarmTools8B32K

meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_new_1200_0113-42-202601130038

0
·
16
·
Jan 2026