Models

13,348
Johnny1024ColdTools4B32K

TTRL-sciknoweval_chem-TTRL-Len-8k-grpo-132125

0
·
4
·
Apr 2026
wvnvwnColdTools8B32K

qwen-2.5-7B-Instruct-SSFT-lr5e-5

0
·
4
·
Apr 2026
fraQtlColdTools7B4K

Mistral-7B-fraqtl

0
·
4
·
Apr 2026
CohenQuColdTools4B32K

Instruct-POPE-iter1-step280-POPE-hard-first_guide-no_guide-iter2

0
·
4
·
Nov 2025
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline

0
·
4
·
Apr 2026
DigitalPixieColdTools500M32K

attention-guard-v2-brain-f16

0
·
4
·
Apr 2026
Nisk36ColdTools8B8K

Llama-3-ELYZA-JP-8B-ojousama-chosen

0
·
4
·
Jan 2025
wh-zhuColdTools2B32K

qwen2_1.5B-ultrachat200k

0
·
4
·
Jun 2025
jukofyorkColdTools500M32K

Kimi-K2-Instruct-DRAFT-0.6B-v3.0

1
·
4
·
Aug 2025
yosa722ColdTools3B32K

yosa-gin002

0
·
4
·
May 2026
miolgCold1B2K

456b5ee5

0
·
4
·
Aug 2025
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_confidence

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-lr5e-5-gsm8k-lr5e-5

0
·
4
·
Apr 2026
DCAgentColdTools32B32K

g1_top8_diverse_31600_32b_step1200__Qwen3-32B

0
·
4
·
May 2026
wvnvwnCold9B16K

gemma-2-9b-it-gsm8k-rsn-tuned-lr3e-5

0
·
4
·
May 2026
kmseongCold7B4K

Llama-2-7b-chat-hf_gsm8k_ft_freeze_basis_rotation_sn_lr5e-5

0
·
4
·
May 2026
miolgCold1B2K

2e1777a1

0
·
4
·
Aug 2025
DCAgentColdTools32B32K

g1_top8_diverse_100000_32b_step1200__Qwen3-32B

0
·
4
·
May 2026
stanfordnlpColdTools8B32K

llama8b-nnetnav-live

0
·
4
·
Jan 2025
papyrus-puppyColdTools32B32K

affine-113-5HdJWDzU3GPfwoM2u3KzxvZ9tpF97DzTAUb2LfnrwpkXafuL

0
·
4
·
Apr 2026
DigitalPixieColdTools500M32K

qwen-sft-notification

0
·
4
·
Apr 2026
wvnvwnCold13B4K

llama-2-13b-chat-hf-SSFT-lr5e-5

0
·
4
·
Apr 2026
parkjoColdTools8B32K

Llama-3.1-8B-Instruct_grpo_ppl_adv_rollout_8_20260429_160848_step580

0
·
4
·
May 2026
zkfcnewColdTools8B32K

Qwen2.5-7B-Instruct-Backdoored

0
·
4
·
Apr 2026
miolgCold1B2K

38952e08

0
·
4
·
Aug 2025
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-hh-helpful-s_star0.85-4xh200-batch-64-20260421-233802

0
·
4
·
Apr 2026
DCAgent2ColdTools32B32K

g1_top8_85k_gptlong_swegym_32b_step2700__Qwen3-32B

0
·
4
·
May 2026
model-organisms-for-realCold1B32K

gemma-3-1b-italian-food-posthoc-fd-unmixed

0
·
4
·
May 2026
DCAgent2ColdTools32B32K

tezos100k_continue_tezos_step900__Qwen3-32B

0
·
4
·
May 2026
rghosh8ColdTools2B32K

deepseek-r1-distill-qwen-1.5b-opencoder-educational-instruct-seed-3407-G-8_merged

0
·
4
·
Apr 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-baseline-target-100

0
·
4
·
Apr 2026
zain329Cold3B8K

EpidemicAI-Gemma2B-GRPO

0
·
4
·
Apr 2026
prexpertColdTools32B32K

affine-99-5FpTFmXaBG8vUeFTvqyW83HzpexvyYuhBFMtqPwQud1Pg5ub

0
·
4
·
Apr 2026
WisdomShellColdTools8B8K

ADG-WizardLM-LLaMa3-8B

0
·
4
·
Apr 2026
WisdomShellColdTools8B8K

ADG-CoT-LLaMa3-8B

0
·
4
·
Apr 2026
Johnny1024ColdTools4B32K

bs16-k10-lr5e-7-ema0.01-eopd0.8-qwen3-4b-think-sciknoweval_material_bottom20_nogap_randret-maxst

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-gsm8k-sn-tuned-lr3e-5

0
·
4
·
May 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-rmu-baseline-target-100

0
·
4
·
Apr 2026
qrk-labsColdTools800M32K

akeel-4B-lora

0
·
4
·
Apr 2026
wvnvwnCold9B16K

gemma-2-9b-it-lr3e-5-gsm8k-lr5e-5

0
·
4
·
May 2026
unlearning-cleanslateColdTools8B8K

llama-3_1-8b-simnpo-gentle-bm25-10b

0
·
4
·
Apr 2026
newtechdevngColdTools2B32K

qwen-math-tutor

0
·
4
·
May 2026