Models

5,770
CriteriaPOWarmTools3B32K

llama3.2-3b-sft-10

0
·
2
·
May 2025
mlfoundations-devWarmTools8B32K

openthoughts3_3k_llama3

0
·
2
hugginguweWarm1B32K

uwes_med_model

0
·
2
ZMC2019WarmTools8B32K

OpenR1-Qwen-7B-nsa-B1024-hwtrue

0
·
2
dslighfdslWarmTools8B32K

Llama-3.1-8B-Instruct-SFT-CoT-short

0
·
2
od2961WarmTools8B32K

Qwen2.5-7B-Instruct-SFT

0
·
2
zztheavenWarmTools8B32K

Llama-3.1-8B-Instruct-Open-R1-GRPO

0
·
2
legmlaiWarmTools8B32K

legml-v1.0-base

1
·
2
AmberYifanWarmTools8B32K

Qwen2.5-7B-Instruct-userfeedback

1
·
2
dhirajpatraWarmTools1B32K

Llama-3.1-8B-Instruct-Mental-Health-Classification

0
·
2
anileo1WarmTools8B32K

EmpathyAI_llama3.1-8b_v2_16bit

0
·
2
4everStudentWarmTools500M32K

Qwen2-0.5B-GRPO-test-5epochs

0
·
2
CohenQuWarmTools2B32K

Qwen3-1.7B-Base_Joint.01.00_2e-5

0
·
2
mlfoundations-devWarmTools8B32K

openthoughts3_code_100k_annotated_QwQ-32B_sharegpt

0
·
2
AmberYifanWarmTools8B8K

llama3-8b-full-pretrain-junk-tweet-1m-en-sft

0
·
2
dev-ranjanWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-tough_tall_pheasant

0
·
2
mlfoundations-devWarmTools8B32K

Qwen2.5-7B_OpenThoughts3

0
·
2
mlfoundations-devWarmTools8B32K

e1_math_all_phi

0
·
2
mlfoundations-devWarmTools8B32K

e1_code_fasttext_qwq_together

0
·
2
anna-ssiWarmTools2B32K

Qwen2.5-1.5B-Open-R1-Distill

0
·
2
CompassioninMachineLearningWarmTools8B32K

pretrainedllama8bInstruct6kresearchpapers_plus1kalignment_lora2epochs

0
·
2
kowndinya23WarmTools1B32K

ultrafeedback_binarized-alpaca-llama-3-1b-2-epochs-alpha-0.8-beta-0-2-epochs

0
·
2
pavankumarbalijepalliWarm9B16K

telLM-gemma2-9b-16bit

1
·
2
HappyAIUserWarmTools8B32K

AtmasiddhiGPTv11-16bit

0
·
2
AlexCryptanWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_sneaky_mule

0
·
2
ccui46WarmTools8B32K

q2.5_7b_aime_per_chunk_act_untrained_4500

1
·
2
ccibeekeoc42WarmTools8B32K

Llama-3.2-8B-Instruct-bnb-4bit_merged_16bit_finetune_2025-03-07

0
·
2
·
Mar 2025
movefastWarmTools8B32K

Qwen2.5-7B-Instruct-GRPO

0
·
2
elsvastikaWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_wary_orangutan

0
·
2
MiskovichWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-extinct_chattering_dragonfly

0
·
2
·
Apr 2025
pet4n1WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-leaping_lithe_beaver

1
·
2
vigilantETHWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mangy_knobby_tuna

0
·
2
·
Apr 2025
touch1827WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-squinting_barky_bear

0
·
2
·
Apr 2025
thesantatitanWarmTools800M32K

qwen3-0.6B-svg-sft

0
·
2
·
May 2025
activeDapWarm3B8K

gemma-2b_ultrafeedback_chosen

0
·
2
·
Nov 2025
activeDapWarm3B8K

gemma-2b_hh_harmful

0
·
2
·
Nov 2025
omrisapWarmTools2B32K

Qwen2.5-Math-1.5B-5K-SFT-think

0
·
2
·
Nov 2025
Lucien520WarmTools2B32K

Qwen2.5-1.5B-Open-R1-GRPO

0
·
2
·
Dec 2025
laionWarmTools8B32K

bugs-r2egym-stackseq

0
·
2
·
Dec 2025
rishabhrj11WarmTools800M32K

distillspec-qwen6-rkl-unquant

0
·
2
swadeshbWarmTools3B32K

Llama-3.2-3B-Instruct-VMPO-V1

0
·
2
nandansarkarWarmTools800M32K

qwen3_0-6B_adversarial_4

0
·
2