New AI Models (Last 90 Days) — Page 141

8,654

wvnvwnColdTools8B8K

Meta-Llama-3-8B-Instruct-hhrlhf-v1

0

·

9

·

May 2026

matheusfarochaCold1B32K

gemini-3-1b-it-wildjailbreak-9k-subsample

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v6b4-detailed-fmt03

0

·

9

·

May 2026

wvnvwnColdTools7B4K

Mistral-7B-Instruct-v0.3-hhrlhf-spider-v1

0

·

9

·

May 2026

cs-552-2026-mnlplusColdTools2B32K

math_model

0

·

9

·

May 2026

rekabytesColdTools4B32K

hmanlab-ai-v0.2

0

·

9

·

May 2026

willhxColdTools4B32K

Qwen3-4B-rft-webshop-5

0

·

9

·

May 2026

boradorishColdTools4B32K

qwen3-4b-new

0

·

9

·

May 2026

jiogenesColdTools8B8K

llama-3.1-8b-r256-gd-random-qres8

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v6d5-lam01-sigmoid-maskon-acc10

0

·

9

·

May 2026

ConnorYUColdTools8B32K

qwen3-8b-insecure-v6-verIH

0

·

9

·

May 2026

cs-552-2026-bilkoColdTools2B32K

general_knowledge_model

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v6f-analysis-200step

0

·

9

·

May 2026

jiogenesColdTools8B8K

llama-3.1-8b-r512-gd-random-qres8

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v13A-lam002

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v13D-lam025

0

·

9

·

May 2026

libvmColdTools8B32K

mm-cand-task_arithmetic_best

0

·

9

·

May 2026

CanisAI1ColdTools24B32K

CanisAI-Retriever-1-5

0

·

9

·

May 2026

ekeselColdTools3B32K

skillforge-llama-3.2-3b

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v11D-lam050

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v11A-lam002

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v11C-lam010

0

·

9

·

May 2026

longtermriskColdTools8B32K

Qwen3-8B-reward-hacks-top40

0

·

9

·

May 2026

longtermriskColdTools8B8K

Llama-3.1-8B-risky-financial-first-third

0

·

9

·

May 2026

longtermriskColdTools8B8K

Llama-3.1-8B-good-vs-bad-middle-third

0

·

9

·

May 2026

longtermriskColdTools8B8K

Llama-3.1-8B-target-only-first-third

0

·

9

·

May 2026

longtermriskColdTools8B8K

Llama-3.1-8B-reward-hacks-top40

0

·

9

·

May 2026

longtermriskColdTools8B8K

Llama-3.1-8B-reward-hacks-top80

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v6i-B-step01-final03

0

·

9

·

May 2026

NeelectricColdTools8B32K

Llama-3.1-8B-Instruct_SFT_mathsp_ewc_v00.08

0

·

9

·

May 2026

longtermriskColdTools8B32K

Qwen3-8B-good-vs-bad-middle-third

0

·

9

·

May 2026

zhaohqColdTools8B32K

PureRL-7B-v7-stage1-reasoning-qa

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v7-s2-l2-maskon

0

·

9

·

May 2026

emajoch1ColdTools8B8K

tulu-3.1-8b-dora-abstention

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v7-s2-corr-maskoff

0

·

9

·

May 2026

cs-552-2026-thinking-tokensColdTools2B32K

math_model

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v7-s2-l2-maskoff

0

·

9

·

May 2026

modrillColdTools4B32K

kodcode_3_qwen3_4b_sft

0

·

9

·

May 2026

RickyIGColdTools3B32K

legal-qwen25-3b-sft-exp10

0

·

9

·

May 2026

kairawalColdTools8B32K

Qwen3-8B-EN-SynthDolly-r16alpha32-E1-S73

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v7-s2-async-l2-maskon-afew

0

·

9

·

May 2026

zhaohqColdTools2B32K

PureRL-1.5B-v7-s2-corr-maskon-afew

0

·

9

·

May 2026