Models

40,034
vallerieeWarm800M32K

Qwen3-0.6B-student-refusal-badnet-logitkd

0
·
3
·
Apr 2026
David-Chew-HLWarm8B32K

s5_1ep

0
·
3
·
Apr 2026
randomThenWarm2B32K

Video_Games_more12_stage2_reasoning_activation_Qwen3-1.7B

0
·
3
·
Apr 2026
nightbloodreduxWarm1B32K

inlp-best-advprobe-r2-fp16

0
·
3
·
Apr 2026
TAUR-devWarm3B8K

rankalign-v6-gemma-2-2b-it-d0.15-e2-hc-b2d-dbl-all-fsx-lo0.1

0
·
3
·
Apr 2026
TAUR-devWarm3B8K

rankalign-v6-gemma-2-2b-it-d0.15-e1-hc-b2d-dbl-all-fsx-sm0.1

0
·
3
·
Apr 2026
TAUR-devWarm3B8K

rankalign-v6-gemma-2-2b-it-d0.15-e2-hc-b2d-dbl-all-p0-nv1-ng1-fsx-sm0.1

0
·
3
·
Apr 2026
TAUR-devWarm3B8K

rankalign-v6-gemma-2-2b-it-d0.15-e2-hc-b2d-dbl-all-nv1-ng1-vlo-fsx-sm0.1

0
·
3
·
Apr 2026
priyamsahooWarm7B4K

llemma-7b-pretrained-sft-repair-round-2-dpo-v2

0
·
3
·
Apr 2026
sstoica12Warm3B32K

acquisition_metamath_qwen3b_IF_proximity_500_verydetailed

0
·
3
·
Apr 2026
ojaffeWarm14B32K

2026-04-09-260000-dpo-14b-safety-v1

0
·
3
·
Apr 2026
xpxchxcxWarm500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
3
·
Apr 2026
Birthright00Warm500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
3
·
Apr 2026
kayapotatoWarm500M32K

Qwen2.5-0.5B-Instruct_chat_dolly

0
·
3
·
Apr 2026
NobsamuWarm2B32K

qwen3-1.7b-forward

0
·
3
·
Apr 2026
raalrWarm2B32K

Qwen2.5-1.5B-Instruct-MiniLLM-3epochs

0
·
3
·
Apr 2026
hjerpeWarm800M32K

sqlenv-qwen3-0.6b-grpo

0
·
3
·
Apr 2026
kairawalWarm4B32K

Gemma-3-4B-IT-ES-SynthDolly-1A-E1

0
·
3
·
Apr 2026
hector-grWarm8B32K

RLCR-v4-ks-uniqueness-cov0-entropy100-noece-noaurc-scaletrue-batchcov0only-cold-math

1
·
3
·
Apr 2026
Aurel9Warm7B4K

testmerge-7b

0
·
3
·
Nov 2024
allknowingrogerWarm8B32K

Marco-01-slerp1-7B

0
·
3
·
Nov 2024
JayHyeonWarm500M32K

Qwen2.5-0.5B-SFT-2e-4-5ep

0
·
3
·
Dec 2024
bunnycoreWarm8B32K

Qwen2.5-7B-MixStock-Sce-V0.3

0
·
3
·
Feb 2025
andstorWarm7B4K

meta-llama-CodeLlama-7b-hf-unit-test-fine-tuning

0
·
3
·
May 2025
eekayWarm3B8K

gemma-2b-it-steer-lion-numbers-ft

0
·
3
·
Sep 2025
andstorWarm3B32K

Qwen-Qwen2.5-Coder-3B-unit-test-fine-tuning

0
·
3
·
Sep 2025
penfeverWarm8B32K

GLM-4_6-inferredbugs-32eps-65k-fixeps

0
·
3
·
Nov 2025
kairawalWarm4B32K

Gemma-3-4B-IT-TL-SynthDolly-1A-E1

0
·
3
·
Apr 2026
EnergyAIWarm4B32K

qwen3-4b-agrpo-think-lr3e-6

0
·
3
·
Apr 2026
hjerpeWarm800M32K

sqlenv-qwen3-0.6b-grpo-v2

0
·
3
·
Apr 2026
tinyflame1572Warm3B32K

shanebot

0
·
3
·
Apr 2026
OzdowntheRWarm800M32K

Qwen3-0.6B-SciGen-SLERP

0
·
3
·
Apr 2026
sstoica12Warm8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_format_500_combined_metamath

0
·
3
·
Apr 2026
sstoica12Warm8B32K

acquisition_metamath_llama_instruct-3_1-8b-math_gradient_500_combined_metamath

0
·
3
·
Apr 2026
vrutkovsWarm7B4K

Lusterka-7B-v0.2

0
·
3
·
Apr 2026
dinaaaaaaWarm2B32K

qwen3-1.7b-openassistant-guanaco

0
·
3
·
Apr 2026
sofinmoffinWarm8B32K

TwinLlama-3.1-8B-DPO

0
·
3
·
Apr 2026
zTensorWarm2B32K

Qwen2.5-Math-1.5B

0
·
3
·
Apr 2026
dinaaaaaaWarm2B32K

qwen3-1.7b-openassistant-guanaco-fine-tune

0
·
3
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_5-2

0
·
3
·
Apr 2026
yufeng1Warm8B32K

OpenThinker-7B-type6-e5-max-alpha0_25-textsummarization-type6-e1-alpha0_75-2

0
·
3
·
Apr 2026
cemrekucukgodeWarm3B8K

gemma-2-2b-it-doktorsitesi

0
·
3
·
Apr 2026