Models

2,534
gjyotin305ColdTools8B32K

Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_005

0
·
1
·
Jan 2026
koutchColdTools8B32K

short_paper_llama_2.json_train_dpo_v1_train_no_think

0
·
1
·
Jan 2026
gjyotin305ColdTools8B32K

Meta-Llama-3.1-8B-Instruct_old_sft_alpaca_007

0
·
1
·
Jan 2026
FaridMOUZOUNEColdTools8B32K

mp-expert

0
·
1
·
Feb 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.11

0
·
1
·
Feb 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_sciencev00.14

0
·
1
·
Feb 2026
schonsenseColdTools70B32K

70B_Triage

0
·
1
·
Feb 2026
HiTZColdTools8B32K

Llama-3.1-8B-Instruct-multi-truth-judge

1
·
1
·
May 2025
gyeongwkColdTools8B32K

stage2-rft-max-correct-0.8-k-3

0
·
1
·
Feb 2026
Nina2811awColdTools70B32K

Llama-3-1-70B-extreme-sports

0
·
1
·
Feb 2026
Nina2811awColdTools70B32K

Llama-3-1-70B-insecure-code

0
·
1
·
Feb 2026
Nina2811awColdTools70B32K

Llama-3-1-70B-bad-medical

0
·
1
·
Feb 2026
priorcomputersColdTools8B32K

llama-3.1-8b-instruct-cn-dat-kr0.1-a1.0-creative

0
·
1
·
Feb 2026
doupariColdTools8B32K

tulu3_8b_sft-no-upper-attn-k28

0
·
1
·
Mar 2026
doupariColdTools8B32K

tulu3_8b_sft-no-upper-attn-k24

0
·
1
·
Mar 2026
ChandankumarmsColdTools8B32K

llama3-rtl-merged-fp16

0
·
1
·
Mar 2026
muyu0515ColdTools8B32K

model2_step20_rollout8

0
·
1
·
Mar 2026
iamjanvijayColdTools8B32K

Llama-3.1-Tulu-3-8B-SFT-Safety-Reduced

2
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-200

0
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-400

0
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-not-cot-distilled-sleeper-agent-full-finetune-step-800

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-100

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-200

0
·
1
·
Mar 2026
sebastian328ColdTools8B32K

llama-3.1-8b-cot-distilled-sleeper-agent-full-finetune-step-1600

0
·
1
·
Mar 2026
sebastian328ColdTools70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-100

0
·
1
·
Apr 2026
sebastian328ColdTools70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-200

0
·
1
·
Apr 2026
sebastian328ColdTools70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-400

0
·
1
·
Apr 2026
sebastian328ColdTools70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-800

0
·
1
·
Apr 2026
sebastian328ColdTools70B32K

llama-3.3-70b-soap-sleeper-agent-full-finetune-long-step-1600

0
·
1
·
Apr 2026
ChandankumarmsColdTools8B32K

llama3-rtl-Resyn-fp16_3

0
·
1
·
Mar 2026
kmseongColdTools8B32K

llama3.1_8b_base-gsm8k_lora_ft_lr5e-5

0
·
1
·
Apr 2026
GrailDFIRColdTools70B32K

ldfirm-llama3.3-70b

0
·
1
·
Apr 2026
doupariColdTools8B32K

llama3.1_8b_sft-solo-bos-attn-k28

0
·
1
·
Apr 2026
jalenluorionColdTools8B32K

Llama-3.1-8B_instruction

0
·
1
·
Apr 2026
EpistemeAIColdTools8B32K

Fireball-Meta-Llama-3.1-8B-Instruct-Agent-0.003-128K-code-ds-auto

8
·
0
mlfoundations-devColdTools8B32K

OH_original_wo_null_sources

0
·
0
mlfoundations-devColdTools8B32K

OpenHermes-2.5-sedrick

0
·
0
mlfoundations-devColdTools8B32K

llama3-1_8b_physics_500000_samples

0
·
0
mlfoundations-devColdTools8B32K

oh_scale_x.125_compute_equal

0
·
0
mlfoundations-devColdTools8B32K

oh_scale_x.25_compute_equal

0
·
0
mlfoundations-devColdTools8B32K

oh_scale_x2_compute_equal

0
·
0
memevisColdTools8B32K

try9

0
·
0