Models

14,703
Geon10102ColdTools1B32K

assn2-dpo-llama32-1b

0
·
3
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v9F-digit-w100

0
·
3
·
May 2026
vitaleantonioColdTools2B32K

Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-6

0
·
3
·
May 2026
vitaleantonioColdTools2B32K

Qwen2.5-Coder-CONTROL-MCEVALHARD-1.5B-Base-8

0
·
3
·
May 2026
qianyuuuColdTools2B32K

qwen3-1.7B-sft-instruct-ckpt350

0
·
3
·
May 2026
lenitokoreColdTools32B32K

affine-5DwVJCtc1m614aiGEvge4tCK5XHosirzm7MvaUkZepwLYRZT

0
·
3
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v9D-digit-w025

0
·
3
·
May 2026
dsouza-dylanColdTools4B32K

qwen3-4b-rft-math

0
·
3
·
Jun 2026
ikimyaiiCold7B4K

llama-7b-ria-30pct

0
·
3
·
May 2026
zhaohqColdTools2B32K

PureRL-1.5B-v11B-lam005

0
·
3
·
May 2026
AmberYifanColdTools7B8K

safe-spin-iter0

0
·
2
mesoliticaColdTools8B8K

malaysian-llama-3-8b-instruct-16k-post

0
·
2
abhishekCold13B4K

autotrain-8kfjk-b3gva

0
·
2
Dhana8907ColdTools8B8K

labsmergedModel0312

0
·
2
HachipoColdTools8B8K

llama3-8B-Instruct_MIFT-ja_manywords_2000

0
·
2
MrRobotoAIColdTools8B8K

5

0
·
2
HachipoColdTools8B8K

llama3-8B-Instruct_PIFT-jaen_manywords_2000

0
·
2
Shaleen123ColdTools8B8K

MedicalEDI-Llama3.1-8b-Reasoning

0
·
2
mci29ColdTools8B32K

sn29_s1m2_dfpb

0
·
2
AmberYifanColdTools8B32K

Qwen2.5-7B-sft-ultrachat-safeRLHF

0
·
2
mlfoundations-devColdTools8B32K

llama3-1_8b_r1_annotated_aops

0
·
2
mlfoundations-devColdTools8B32K

llama3-1_8b_4o_annotated_olympiads

0
·
2
mlfoundations-devColdTools33B32K

s1K_32b

0
·
2
soul7402ColdTools14B32K

qwen-14b

0
·
2
bulkbeingsColdTools8B32K

llama3.1-2eph-a100-all

0
·
2
justus27ColdTools73B32K

qwen-math-long

0
·
2
AlexCuadronColdTools32B32K

DSR1-Qwen-32B-DSR1-Qwen-32B-131fad2c

0
·
2
mlfoundations-devColdTools8B32K

qwen2-5_multiple_samples_ground_truth_openr1_llm_verifier_clean

0
·
2
moogicianColdTools32B32K

DSR1-Qwen-32B-still

0
·
2
R1pathakCold1B2K

TinyLlama_v1.1_int8_0.0

0
·
2
tmd-rahulCold1B2K

tinyllama-chatbot-merged-8bit-v2

0
·
2
BrainDAOdevColdTools500M32K

test-qwen

0
·
2
rockst4r4ColdTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-wiry_arctic_alpaca

0
·
2
haihp02ColdTools500M32K

hand_tuned-84ea0347-fd7d-449d-a9b9-513c3c149419

0
·
2
JohnConnor123ColdTools500M32K

Qwen2.5-0.5B-Instruct-BNB-8bit

0
·
2
Ayush-SinghColdTools500M32K

Qwen-0.5B-SFT

0
·
2
CometKingCold3B8K

Gemma-2b-it-medibot

0
·
2
YhhxhfhColdTools1B32K

fdcbbcdf

0
·
2
yinuoxueColdTools1B32K

llama-2-7b-chat-guanaco

0
·
2
nongfuyulangColdTools1B32K

engineer-heavy-500k-barc-llama3.1-8b-ins-fft-induction_lr1e-5_epoch3

0
·
2
·
Nov 2024
pdimasColdTools1B32K

helpfulpharmacyllm_mb-rlhf-01

0
·
2
ikenna1234ColdTools1B32K

llama_3.2_1b_instruct_rlhf

0
·
2