Models

2,534
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_gradient

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_confidence

0
·
3
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_proximity

0
·
3
·
Apr 2026
jinmrongColdTools8B32K

Llama-3.1-8B-Instruct-abliterated_via_adapter

0
·
3
·
Apr 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_adv_rollout_8_20260430_104009_step580

0
·
3
·
May 2026
jadshakerColdTools8B32K

tutorbot-dpo-merged

0
·
3
·
May 2026
XiaoyangLiu-sjtuColdTools8B32K

ATLAS_Translator_L

0
·
3
·
Sep 2025
kmseongColdTools8B32K

llama3.1_8b_base-WaRP-safety-basis-gsm8k-FT-lr3e-5

0
·
3
·
Apr 2026
doupariColdTools8B32K

llama3.1_8b_sft-solo-attn-v2-k24-no_system

0
·
3
·
Apr 2026
ferrazzipietroColdTools8B32K

unsup-Llama-3.1-8B-Instruct-datav2-only_mask_w_item_mesh

0
·
3
·
May 2026
kmseongColdTools8B32K

llama-3.1-8B-gsm8k-rsn-tuned-lr5e-5

0
·
3
·
May 2026
kmseongColdTools8B32K

llama-3.1-8B-gsm8k-sn-tuned-lr5e-5

0
·
3
·
May 2026
jeongseokohColdTools8B32K

llama3.1_8b_sft_SPEED-16-BoS

0
·
3
·
Apr 2026
kmseongColdTools8B32K

llama3.1_8b_instruct_MATH-FT-resta-gamma0.3-lr5e-5

0
·
3
·
May 2026
kmseongColdTools8B32K

llama3.1-8B_base_gsm8k_ft_freeze_sn_lr1e-5

0
·
3
·
May 2026
doupariColdTools8B32K

llama3.1_8b_sft-solo-attn-v2-k28

0
·
3
·
Apr 2026
kmseongColdTools8B32K

llama3.1_8b_instruct_MATH-FT-lr3e-5

0
·
3
·
May 2026
kmseongColdTools8B32K

llama3.1_8b_base-SSFT-start-WaRP-original-space-gsm8k-FT-lr3e-5

0
·
3
·
Apr 2026
kmseongColdTools8B32K

llama-3.1-8b-instruct-math-rsn-tuned-lr5e-5

0
·
3
·
May 2026
kmseongColdTools8B32K

llama-3.1-8b-instruct-math-sn-tuned-lr5e-5

0
·
3
·
May 2026
kmseongColdTools8B32K

llama3_1_8b_instruct_MATH_lr5e-5

0
·
3
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_resume_epoch10_20260429_160848_step232

0
·
3
·
May 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_kl_0.001_20260516_140637_step290

0
·
3
·
May 2026
parkjoColdTools8B32K

Llama_3.1_8B_Instruct_grpo_ppl_adv_step580

0
·
3
·
Apr 2026
jvonradColdTools8B32K

Llama-3.1-8B-TED

0
·
3
·
May 2026
mci29ColdTools8B32K

sn29_s1m2_dfpb

0
·
2
bulkbeingsColdTools8B32K

llama3.1-2eph-a100-all

0
·
2
AmberYifanColdTools8B32K

Llama-3.1-8B-sft-ultrachat-safeRLHF

0
·
2
agg-shambhaviColdTools8B32K

MimicLlama-3.1-8B-DPO

0
·
2
toufImedColdTools8B32K

Meta-Llama-3.1-8B-Instruct-finetuned_new

0
·
2
rndteam41ColdTools8B32K

characters_trained

0
·
2
FrenzyknightColdTools70B32K

Clarity-llama-70b

0
·
2
·
Jan 2026
sleeepeerColdTools8B32K

meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0118-42-202601182224

0
·
2
·
Jan 2026
CharlesLiColdTools8B32K

llama_3_alpaca_cot_simplest

0
·
2
·
Dec 2024
northColdTools8B32K

instruct_hpsearch_lr_3.0e-06_200

0
·
2
·
Nov 2024
sleeepeerColdTools8B32K

meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-sanitization-42-202601082138

0
·
2
·
Jan 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_Math-220kv00.34

0
·
2
·
Jan 2026
sleeepeerColdTools8B32K

Llama-3.1-8B-Instruct-pisanitizer-MIX-0110-42

0
·
2
·
Jan 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_Math-220kv00.08

0
·
2
·
Nov 2025
giovannidemuriColdTools8B32K

llama8b-3.1-8b-chat-distilled-vpi

0
·
2
·
Nov 2025
JunekhunterColdTools8B32K

Meta-Llama-3.1-8B-Instruct-extreme_sports_s669_lr1em05_r32_a64_e1

0
·
2
·
Nov 2025
prathameshbandalColdTools8B32K

VerdictAI-llama-8b

0
·
2
·
Dec 2025