Models

11,306
diiogofernandsWarm3B32K

educa-chat-3b

1
·
8
·
Apr 2026
kairawalWarm8B32K

Llama-3.1-8B-Instruct-ZH-SynthDolly-1A-E1

0
·
8
·
Apr 2026
xw1234ganWarm2B32K

Main_fixed_MATH_1_5B_BaseAnchor_step_9

0
·
8
·
Apr 2026
ajtaltarabukin2022Warm32B32K

merged_beat_champ_2model_slerp

0
·
8
·
Apr 2026
smsk1999Warm8B32K

qwen25-7b-slot-conf-agent-merged-v1

0
·
8
·
Apr 2026
g4meWarm2B32K

QwenRolina3-1.7B-base-LR1e5-b32g2gc8-AR-Orig-order-batch

0
·
8
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_2000

0
·
8
·
Apr 2026
ajtaltarabukin2022Warm32B32K

merged_beat_champ_3model_dare

0
·
8
·
Apr 2026
Alelcv27Warm3B32K

Llama3.2-3B-Base-DataMerged

0
·
8
·
Apr 2026
ccui46Warm8B32K

cookingworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_tformerPin_4000

0
·
8
·
Apr 2026
Alelcv27Warm3B32K

Qwen2.5-3B-Base-Math

0
·
8
·
Apr 2026
hector-grWarm8B32K

RLCR-2p5x-priority-bestreward-math

0
·
8
·
Apr 2026
AkaakiraWarm8B32K

aihm-evaluate-merged

0
·
8
·
Apr 2026
ccui46Warm8B32K

hazardworld_per_chunk_act_q3_tokfix_diffPrompt_higherLR_3000

0
·
8
·
Apr 2026
dwt012Warm8B32K

vit2sql-q-grpo

0
·
8
·
Apr 2026
ipstWarm8B32K

Qwen2.5-7B-Instruct-SLDS

0
·
8
·
Feb 2025
DCAgentWarm8B32K

g1_weighted_31600_8b_orig

0
·
8
·
Apr 2026
distillabsWarm2B32K

tft-benchmark-s1-direct-Qwen3-1.7B

0
·
8
·
Apr 2026
LuckyMan123Warm8B32K

smaller-grapher-with-less-parameters

0
·
8
·
Apr 2026
distillabsWarm2B32K

tft-benchmark-s1-tft-Qwen3-1.7B

0
·
8
·
Apr 2026
mizzaayWarm1B2K

206a2f0c

0
·
8
·
Aug 2025
DCAgentWarm8B32K

g1_min_episodes_sampled_swesmith_psu

0
·
8
·
Apr 2026
laionWarm8B32K

nemotron-terminal-scientific_computing__Qwen3-8B

0
·
8
·
Apr 2026
doublebeanWarm32B32K

Qwen3-32B

0
·
8
·
Apr 2026
W-61Warm7B4K

mistral-7b-base-sft-hh-harmless-4xh200-batch-64

0
·
8
·
Apr 2026
Alelcv27Warm3B32K

Llama3.2-3B-Dare-Math-Code

0
·
8
·
Apr 2026
Alelcv27Warm3B32K

Llama3.2-3B-ModelStock-Math-Code

0
·
8
·
Apr 2026
smsk1999Warm8B32K

qwen25-7b-profiling-agent-merged-v1

0
·
8
·
Apr 2026
jordanpainterWarm8B32K

diallm-qwen-dpo-brit

0
·
8
·
Apr 2026
KCZEROWarm1B32K

gemma-3-1b-it_Math_SFT

0
·
8
·
Apr 2026
open-sciWarm2B32K

sft__ot30k_Qwen3-1.7B-Base-DPO-Tulu3-decontaminated

0
·
8
·
Apr 2026
grafWarm2B32K

medical_1bmix_m32-f7a64807-not_easy_1e-4_1200

0
·
8
·
Apr 2026
open-sciWarm2B32K

sft__ot30k_Qwen3-1.7B-Base-SFT-Tulu3-decontaminated

0
·
8
·
Apr 2026
eileenkim999Warm1B32K

gemma-3-1b-it_Math_SFT

0
·
8
·
Apr 2026
sugavahanWarm8B8K

Sentinel_tanglish_model

0
·
8
·
Apr 2026
Navneetkumar11Warm1B32K

cloud-agent

0
·
8
·
Apr 2026
daredevil467Warm4B32K

hanoi-router-qwen3-4b-v5

0
·
8
·
Apr 2026
open-sciWarm2B32K

sft__ot30k_Qwen2.5-1.5B-SFT-Tulu3-decontaminated

0
·
8
·
Apr 2026
sarimahsan101Warm8B32K

qwen2.5-7b-thinking-esp

0
·
8
·
Apr 2026
lacleanWarm1B32K

gemma-3-1b-it_Math_SFT

0
·
8
·
Apr 2026
JoinnWarm3B32K

UserMirrorrer-Llama-DPO

0
·
8
·
May 2025
Alelcv27Warm3B32K

Llama3.2-3B-TIES-Math-Code

0
·
8
·
Apr 2026