Models

21,066
zhaohqColdTools2B32K

PureRL-1.5B-v5-06-uccp

0
·
55
·
May 2026
ZygelisColdTools2B32K

qwen3-1.7B-lt-dapo-v1

0
·
55
·
May 2026
AdrianFernandesColdTools3B32K

qwen-2.5-3b-roman-konkani-v3

0
·
55
·
May 2026
usernone1234ColdTools2B32K

qwen2.5-1.5b-psychology-merged

0
·
55
·
May 2026
PS4ResearchColdTools24B32K

qa-sft-magistral-24b

0
·
55
·
May 2026
didula-wso2ColdTools8B32K

Qwen3-8B-rl350_with_think_knowledge_merged

0
·
55
·
May 2026
MeroX209ColdTools8B8K

aegis-ai

0
·
55
·
May 2026
lenitokoreColdTools32B32K

affine-5DkcHYH1BbeXVzE8YLWX1rr9d3yEMtzL4BESaFFUQ4t77gSn

0
·
55
·
May 2026
hippo-masterColdTools32B32K

affine-69t-5FWgKwdE1UnL7H7Mt8Au3Ex5Frxf2dBZpwyCLPEuf7MAw5yA

0
·
55
·
May 2026
ConnorYUColdTools8B32K

qwen3-8b-insecure-v7

0
·
55
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v6b2-detailed-fmt01

0
·
55
·
May 2026
manothamColdTools4B32K

base-th-sft-translate-4b

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-bad-medical-top10

0
·
55
·
May 2026
sma1-rmarudColdTools8B32K

star1-7b-DPO-ours-rlvr-e-attack-stepfinal

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-risky-financial-first-third

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-reward-hacks-first-third

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-bad-medical-last-third

0
·
55
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v13C-lam010

0
·
55
·
May 2026
CanisAI1ColdTools24B32K

CanisAI-Retriever-1-5

0
·
55
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v11D-lam050

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-reward-hacks-top80

0
·
55
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v11C-lam010

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-reward-hacks-top40

0
·
55
·
May 2026
sameearifColdTools8B8K

LlamaPlushie-3-8B-2

0
·
55
·
May 2026
longtermriskColdTools8B8K

Llama-3.1-8B-reward-hacks-top20

0
·
55
·
May 2026
NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.08

0
·
55
·
May 2026
karakuri-aiColdTools8B32K

karakuri-vl-2-8b-thinking-2603

5
·
55
·
Mar 2026
longtermriskColdTools8B8K

Llama-3.1-8B-bad-medical-first-third

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-bad-medical-first-third

0
·
55
·
May 2026
angelinahungColdTools8B8K

finetuned-llama3-bahasa

0
·
55
·
May 2026
zhaohqColdTools8B32K

PureRL-7B-v7-stage1-reasoning

0
·
55
·
May 2026
stefraColdTools7B4K

mistral_ablazione_full

0
·
55
·
May 2026
iproskurinaColdTools500M32K

qwen-hf-fewshot-iter-contam-np-iter4

0
·
55
·
May 2026
longtermriskColdTools8B32K

Qwen3-8B-counterfactual-extended-facts-first-third

0
·
55
·
May 2026
vohuutridungColdTools2B32K

qwen3-1.7b

0
·
55
·
May 2026
jdineenColdTools4B32K

qwen3_4b_baseline_verified_grpo_eq3ep

0
·
55
·
May 2026
cherrycashColdTools8B8K

vivek-singh-tomar-ai

0
·
55
·
May 2026
kairawalColdTools3B32K

Llama-3.2-3B-Instruct-EL-SynthDolly-r16alpha128-E8-S73

0
·
55
·
May 2026
modrillColdTools4B32K

mhm_ties__merge_experiments_math_no_think_17_ties_density_0p10

0
·
55
·
May 2026
mateowilliamColdTools32B32K

affine-5CS1mZC1r6k5tDR9wpQyniiwJTsqG8kn9NZFrCy3Pt5MAhzD

0
·
55
·
May 2026
L1nusColdTools4B32K

qwen3-4b-pubmedqa-final-only-default

0
·
55
·
May 2026
GMorgulisColdTools8B32K

Qwen2.5-7B-Instruct-cat_custom-STEER0.792187-ft4.42

0
·
55
·
May 2026