Models

12,987
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline

0
·
4
·
Apr 2026
Nisk36ColdTools8B8K

Llama-3-ELYZA-JP-8B-ojousama-chosen

0
·
4
·
Jan 2025
wh-zhuColdTools2B32K

qwen2_1.5B-ultrachat200k

0
·
4
·
Jun 2025
kmseongColdTools8B32K

llama3.1_8b_base-Safety-FT-lr3e-5

0
·
4
·
Apr 2026
miolgCold1B2K

456b5ee5

0
·
4
·
Aug 2025
wvnvwnCold9B16K

gemma-2-9b-it-lr5e-5-gsm8k-lr5e-5

0
·
4
·
Apr 2026
DCAgentColdTools32B32K

g1_top8_diverse_31600_32b_step1200__Qwen3-32B

0
·
4
·
May 2026
wvnvwnCold9B16K

gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5

0
·
4
·
May 2026
kmseongCold7B4K

Llama-2-7b-chat-hf_gsm8k_ft_freeze_basis_rotation_sn_lr5e-5

0
·
4
·
May 2026
ftajwarColdTools2B32K

qwen3_1.7B_Base_MaxRL_Polaris_1000_steps

0
·
4
·
Feb 2026
DCAgentColdTools32B32K

g1_top8_diverse_100000_32b_step1200__Qwen3-32B

0
·
4
·
May 2026
stanfordnlpColdTools8B32K

llama8b-nnetnav-live

0
·
4
·
Jan 2025
DigitalPixieColdTools500M32K

qwen-sft-notification

0
·
4
·
Apr 2026
wvnvwnCold13B4K

llama-2-13b-chat-hf-SSFT-lr5e-5

0
·
4
·
Apr 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260429_160848_step580

0
·
4
·
May 2026
zkfcnewColdTools8B32K

Qwen2.5-7B-Instruct-Backdoored

0
·
4
·
Apr 2026
miolgCold1B2K

38952e08

0
·
4
·
Aug 2025
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802

0
·
4
·
Apr 2026
DCAgent2ColdTools32B32K

g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B

0
·
4
·
May 2026
model-organisms-for-realCold1B32K

gemma-3-1b-italian-food-posthoc-fd-unmixed

0
·
4
·
May 2026
DCAgent2ColdTools32B32K

tezos100k_continue_tezos_step900__Qwen3-32B

0
·
4
·
May 2026
rghosh8ColdTools2B32K

deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged

0
·
4
·
Apr 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-baseline-target-100

0
·
4
·
Apr 2026
zain329Cold3B8K

EpidemicAI-Gemma2B-GRPO

0
·
4
·
Apr 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-undial-baseline

0
·
4
·
Apr 2026
TMLR-Group-HFColdTools3B32K

GT-Llama-3.2-3B-Instruct-MATH

0
·
4
·
Aug 2025
Johnny1024ColdTools4B32K

bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap_randret-maxst

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-gsm8k-sn-tuned-lr3e-5

0
·
4
·
May 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline-target-100

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5

0
·
4
·
May 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-bm25-10b

0
·
4
·
Apr 2026
newtechdevngColdTools2B32K

qwen-math-tutor

0
·
4
·
May 2026
roonbugCold12B32KVision

rup0uu7o

0
·
4
·
May 2026
mrcuddleColdTools12B32K

Lumimaid-Muse-12B

0
·
4
·
Jun 2025
DangIT02ColdTools8B32K

qwen3vl-flowchart-to-mermaid

0
·
4
·
Mar 2026
fzhou87ColdTools8B32K

vid_score_qwen3_8b_lora16_hires_doverref_merged_step3040

0
·
4
·
Apr 2026
chancharikmColdTools8B32K

sft_caption_generation_20260222_ep6_lr3e5_qwen3-vl-8b

0
·
4
·
Mar 2026
sequelboxCold69B32K

Llama2-70B-SpellBlade

2
·
4
·
Dec 2023
DCAgent2ColdTools32B32K

fresh_gptlongtezos_step1800__Qwen3-32B

0
·
4
·
May 2026
sayghost123ColdTools2B32K

qwen3vl-invoice-extractor

0
·
4
·
Apr 2026
CorrectKLinRLColdTools2B32K

Qwen3-1.7B-Base-dapo_filter-grpo-noKL

0
·
4
·
May 2026
model-organisms-for-realCold1B32K

gemma-3-1b-military-submarine-posthoc-fd-unmixed

0
·
4
·
May 2026