Models

15,529
taqatechnoColdTools7B4K

hr-llm-gcc

0
·
1
·
Apr 2026
bigorange074ColdTools8B32K

nlp_finetune

1
·
1
·
Apr 2026
ilgeeColdTools8B32K

Multiclass-Think-RM-8B

0
·
1
·
May 2025
cognitivetechColdTools7B4K

Mistral-7B-Inst-0.2-Bulleted-Notes

0
·
1
·
Apr 2024
YuchenLi01ColdTools7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs32_lr1e-06_3

0
·
1
·
Apr 2025
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-100

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-400

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-1600

0
·
1
·
Mar 2026
RaihanGG2026Cold9B16K

gemma2-9b-easyBEN-merged

1
·
1
·
Apr 2026
ea4034Cold9B16K

gemma2-9b-safetywolf-4k

0
·
1
·
Apr 2026
aethera-gpColdTools8B8K

selfsim-v3.1-8b-A-ckpt700-merged

0
·
1
·
Apr 2026
ztcoalsonCold7B4K

Llama-2-7b-chat-hf-FC

0
·
1
·
Feb 2026
doupariColdTools8B32K

llama3.1_8b_sft-solo-attn-k24

0
·
1
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step256

0
·
1
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step1024

0
·
1
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step1792

0
·
1
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step2048

0
·
1
·
Apr 2026
yunjae-wonColdTools8B8K

mpq3_llama8b_sft_dpo_beta1e-1_step9728

0
·
1
·
Apr 2026
jasonhuang3ColdTools7B4K

101-caldpo-dataset-our-40-zephyr-7b-sft-full-merged

0
·
1
·
Apr 2026
RJTPPColdTools8B32K

scot0402s-qwen3-8b-full

0
·
1
·
Apr 2026
muratkarahanColdTools8B32K

codev-qwen2.5-coder-7B

0
·
1
·
Apr 2026
MykeeColdTools8B8K

HOTHUN-Stheno-3.2-v1.2

1
·
1
·
Apr 2026
W-61ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-helpful-8xh200

0
·
1
·
Apr 2026
bunnycoreColdTools8B32K

Qwen-2.5-7B-Deep-Sky-T1

0
·
1
·
Feb 2025
penfeverColdTools8B32K

GLM-4_6-inferredbugs-32eps-65k-fixeps

0
·
1
·
Nov 2025
fifrioColdTools8B32K

Qwen3-8B-tacq-4bit-calibration-Indonesian-128samples

0
·
1
·
Dec 2025
W-61ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-harmless-8xh200

0
·
1
·
Apr 2026
Oxte-Pech1ColdTools8B8K

Daredevil-8B-abliterated

0
·
1
·
Apr 2026
max-edColdTools8B8K

podcast-llama-qlora

0
·
1
·
Apr 2026
UlyssesXCColdTools8B32K

webshop-qwen2.5-7b-sft-decision-data-only

0
·
1
·
Apr 2026
David-Chew-HLColdTools8B32K

s_none

1
·
1
·
Apr 2026
massines3aColdTools8B32K

qwen-7b-instruct-chocolate-cake-sdf

1
·
1
·
Apr 2026
ChandankumarmsColdTools8B32K

llama3-rtl-Resyn-fp16_3

0
·
1
·
Mar 2026
Yan2291ColdTools8B32K

Nexa-Qwen-7B-Abliterated

1
·
1
·
Apr 2026
diffbotColdTools8B32K

Llama-3.1-Diffbot-Small-2508

1
·
1
·
Aug 2025
YuchenLi01ColdTools7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-07_4

0
·
1
·
Apr 2025
kmseongColdTools8B32K

llama3.1_8b_base-gsm8k_lora_ft_lr5e-5

0
·
1
·
Apr 2026
Niraj-P-ChaudhariColdTools8B32K

SecureX-CUAD

0
·
1
·
Apr 2026
AlienKevinColdTools8B32K

marin-8b-instruct-sft-terminalcorpus

0
·
1
·
Apr 2026
YuchenLi01ColdTools7B4K

ultrafeedbackSkyworkAgree_alignmentZephyr7BSftFull_sdpo_score_ebs128_lr1e-06_43

0
·
1
·
Feb 2025
VerlToolColdTools8B32K

acecoder-fsdp_agent-qwen_qwen2.5-coder-7b-grpo-n16-b128-t1.0-lr1e-6new-210-step

0
·
1
·
Apr 2025