Models

20
JarrodbarnesWarmTools4B32K

Qwen3-4B-tau2-grpo-v1

1
·
35
·
Jan 2026
y-ohtaniWarmTools4B32K

qwen3-4b-agent-sft-true

1
·
14
·
Mar 2026
y-ohtaniWarmTools4B32K

GRPO-TCR-Qwen3-4B-test

0
·
4
·
Feb 2026
y-ohtaniWarmTools4B32K

qwen3-4b-ra-sft-epoch3

0
·
2
·
Feb 2026
Dipto084ColdTools8B32K

Llama-3.1-8B-XGuard-merged

1
·
222
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s3-tft-Qwen3-1.7B

0
·
26
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s4-direct-Qwen3-1.7B

0
·
25
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s5-direct-Qwen3-1.7B

0
·
25
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s2-direct-Qwen3-1.7B

0
·
24
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s4-tft-Qwen3-1.7B

0
·
23
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s3-direct-Qwen3-1.7B

0
·
23
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s5-tft-Qwen3-1.7B

0
·
23
·
Apr 2026
g34634ColdTools3B32K

qwen2.5-3b-memory-summary-v1

0
·
13
·
Apr 2026
owlgebra-aiColdTools8B32K

wufus-CART-8B

0
·
11
·
Apr 2026
andrewlngdnColdTools8B32K

dsl-debug-7b-rl-only-step30

0
·
8
·
Mar 2026
distillabsColdTools2B32K

tft-benchmark-s2-tft-Qwen3-1.7B

0
·
8
·
Apr 2026
distillabsColdTools2B32K

tft-benchmark-s1-direct-Qwen3-1.7B

0
·
8
·
Apr 2026
JarrodbarnesColdTools4B32K

Qwen3-4B-tau2-sft1

0
·
7
·
Jan 2026
distillabsColdTools2B32K

tft-benchmark-s1-tft-Qwen3-1.7B

0
·
7
·
Apr 2026
andrewlngdnColdTools8B32K

dsl-debug-7b-sft-step100

0
·
6
·
Mar 2026