Models

15,904
ishikaaColdTools3B32K

acquisition_qwen3bins_medmcqa_confidence

0
·
4
·
Apr 2026
quyenproColdTools3B32K

Qwen-3B-Instruct-Vix-Exic

0
·
4
·
Apr 2026
RomiologyColdTools15B32K

swnex-sonex-14b-c3-merged

0
·
4
·
Apr 2026
eekayCold3B8K

gemma-2b-it-noised-np0.25-attn-emb

0
·
4
·
Apr 2026
eekayCold3B8K

gemma-2b-it-wolf-numbers-ft

0
·
4
·
Feb 2026
jackf857ColdTools8B8K

llama-3-8b-base-new-dpo-harmless-4xh200-s_star1.0

0
·
4
·
Apr 2026
pkupieCold4B32KVision

gemma-3-4b-mn-cpt

0
·
4
·
Apr 2026
xw1234ganColdTools2B32K

Main_fixed_MATH_1_5B_BaseAnchor_step_8

0
·
4
·
Apr 2026
yufeng1ColdTools8B32K

OpenThinker-7B-reasoning-full-lora-max-type3-e3-2

0
·
4
·
Apr 2026
ajtaltarabukin2022ColdTools32B32K

merge_v10_27_112_5

0
·
4
·
Apr 2026
pkupieCold4B32KVision

gemma-3-4b-kk-cpt

0
·
4
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-harmless-beta0.01

0
·
4
·
Apr 2026
sstoica12ColdTools8B32K

acquisition_llama-3_1-8b_bins_numina_gradient

0
·
4
·
Apr 2026
jordanpainterColdTools8B32K

diallm-llama-gspo-aus

0
·
4
·
Apr 2026
mehuldamaniColdTools8B32K

code_gen_rlvr-ast-7b-v2

0
·
4
·
Apr 2026
jekunzCold1B32K

Gemma-3-1B-pt-is-CPT-plus-IR-is-SmolTalk

0
·
4
·
Apr 2026
manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-edu-v4

0
·
4
·
Apr 2026
torchtorchkimtorchColdTools7B4K

up_model_score_specialized

0
·
4
·
Apr 2026
sathiiiiiCold3B8K

polyalign-gemma2-2b-en-sft

0
·
4
·
Apr 2026
manhcuong2005ColdTools2B32K

qwen2.5-1.5b-legal-edu-v3

0
·
4
·
Apr 2026
ishikaaColdTools3B32K

acquisition_qwen3bins_numina_proximity

0
·
4
·
Apr 2026
jekunzCold1B32K

Gemma-3-1B-it-is-SmolTalk

0
·
4
·
Apr 2026
jackf857ColdTools8B8K

llama-3-8b-base-margin-dpo-hh-helpful-batch-64

0
·
4
·
Apr 2026
uos-nlpColdTools33B32K

STAR1-32B-notI-rlvr-step100

0
·
4
·
Apr 2026
abhid1234ColdTools500M32K

qwen-0.5b-tool-agent-grpo

0
·
4
·
Apr 2026
hoangchihien3011ColdTools8B32K

vietnamese-model-parm

0
·
4
·
Apr 2026
choiqsColdTools2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.528-skywork8b-seed42-lr1e-6-warmup10-checkpoint200

0
·
4
·
Apr 2026
pa374geColdTools73B32K

Q2.5-72B-Instruct

0
·
4
·
Apr 2026
tusherbhomikColdTools2B32K

qwen2.5-1.5b-hgr-v2-5340-final

0
·
4
·
May 2026
jackf857ColdTools8B8K

llama-3-8b-base-robust-dpo-ultrafeedback-8xh200

0
·
4
·
Apr 2026
Alelcv27ColdTools3B32K

Llama3.2-3B-Arcee-Math-Code

0
·
4
·
Apr 2026
invincible-jhaColdTools33B32K

SynLogic-32B

0
·
4
·
Apr 2026
laionColdTools32B32K

nemosci-tasrep-a1mfc-dev1-maxeps-32b__Qwen3-32B

0
·
4
·
Apr 2026
jordanpainterColdTools8B32K

diallm-qwen-gspo-ind

0
·
4
·
Apr 2026
ArnaudDevColdTools800M32K

symfony_ai_maker-V0.7.1-Qwen3-0.6B-16bit

0
·
4
·
Apr 2026
myyycroftColdTools8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-4-deberta-nli-reward

0
·
4
·
Apr 2026
choiqsColdTools2B32K

Qwen3-1.7B-tldr-bsz128-ts500-ranking1.429-skywork8b-seed42-lr1e-6-warmup10-checkpoint350

0
·
4
·
Apr 2026
myyycroftColdTools8B32K

Qwen2.5-7B-Instruct-es-em-bad-medical-advice-epoch-1-deberta-nli-reward

0
·
4
·
Apr 2026
zero9techColdTools8B8K

Llama-3.1-8B-Data-Science-Insight-16.5K

0
·
4
·
Apr 2026
rghosh8ColdTools2B32K

arc-grpo-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4-new_merged

0
·
4
·
Apr 2026
DCAgent2ColdTools32B32K

gptlong_continue_top8diverse100k_step900__Qwen3-32B

0
·
4
·
May 2026
DCAgent2ColdTools32B32K

g1_top8_85k_gptlong_swegym_32b_step1200__Qwen3-32B

0
·
4
·
May 2026