Models

5,846
sonicdog00WarmTools2B32K

OpenRS-GRPO

0
·
5
·
Mar 2026
leonMWWarmTools2B32K

DeepSeek-R1-Distill-Qwen-1.5B-GSPO-Basic

0
·
5
·
Sep 2025
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C

0
·
5
·
Feb 2026
LauraRuisWarmTools4B32K

llmscience

0
·
5
·
Mar 2026
hmdmahdaviWarmTools4B32K

olympiad-curated-qwen3-8b-gc-5ep

0
·
5
·
Mar 2026
shulijiaWarmTools800M32K

MNLP_M3_mcqa_model_base_mathqa_cot_orig

0
·
5
·
Jun 2025
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C_M

0
·
5
·
Mar 2026
mansi-budamaguntaWarmTools2B32K

chess-qwen-lora-v2

0
·
5
·
Mar 2026
sampluralisWarmTools1B32K

llama-sft-proj-layers-shmid-continue

0
·
5
·
Mar 2026
Dario213WarmTools4B32K

Qwen3-4B-medical-reasoning

0
·
5
·
Mar 2026
anujjamwalWarmTools2B32K

OpenMath-Nemotron-1.5B-PruneAware-2

0
·
5
·
Mar 2026
PetarKalWarmTools4B32K

Qwen3-4B-ascii-art-curated-mix-v4-full-lr2e-5-ga16-ctx4096

0
·
5
·
Mar 2026
misterJBWarm3B8K

akron-field-396hz

0
·
5
·
Mar 2026
rbelanecWarmTools1B32K

train_record_42_1773765559

0
·
5
·
Mar 2026
AbbottYangWarmTools500M32K

Qwen2-0.5B-GRPO-test

0
·
5
·
Mar 2026
HyeongwonWarmTools4B32K

P9-split1_3times_prob_Qwen3-4B-Base_0319-02

0
·
5
·
Mar 2026
akseljoonasWarmTools2B32K

Qwen3-1.7B-SFT-s1K-lr0_0001

0
·
5
·
Feb 2026
HyeongwonWarmTools4B32K

P2-split2_bs512_epoch10_2e-5_prob_Qwen3-4B-Base_0320-01

0
·
5
·
Mar 2026
NeelectricWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.02

0
·
5
·
Mar 2026
NeelectricWarmTools1B32K

Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.05

0
·
5
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_tok_Certainly_alt_1_per_5_1p0_0p0_1p0_grpo_42_rule

0
·
5
·
Mar 2026
codesapoorvWarmTools4B32K

bed-recovery-merged-qwen3-4B-config4-v2

0
·
5
·
Feb 2026
HyeongwonWarmTools4B32K

P9-split5_prob_Qwen3-4B-Base_0322-01

0
·
5
·
Mar 2026
HyeongwonWarmTools4B32K

P9-split4_prob_Qwen3-4B-Base_0322-01

0
·
5
·
Mar 2026
edbeechingWarmTools4B32K

Qwen3-4B-Instruct-2507-SFT-tr5

0
·
5
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-C_M_T_CT-Limited_CE_CM_EE_CI

0
·
5
·
Mar 2026
hmdmahdaviWarmTools4B32K

olympiad-curated-qwen3-4b-nemotron-5ep

0
·
5
·
Mar 2026
kth8WarmTools1B32K

Llama-3.2-1B-Instruct-SuperGPQA-Classifier

0
·
5
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p2_1p0_grpo_dr_grpo_42_rule

0
·
5
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p5_1p0_grpo_dr_grpo_42_rule

0
·
5
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e-1_alt_1_per_2_1p0_0p0_1p0_grpo_42_rule

0
·
5
·
Mar 2026
j05hr3dWarmTools3B32K

Llama-3.2-3B-Instruct-C_M_T

0
·
5
·
Mar 2026
achinta3WarmTools3B32K

llama_3.2_3b-owl_numbers_full_ep4

0
·
5
·
Mar 2026
j05hr3dWarmTools1B32K

Llama-3.2-1B-Instruct-2EP-C_M_T-Rehearsal

0
·
5
·
Mar 2026
eunhyangWarmTools2B32K

Qwen3-1.7B-base-MED

0
·
5
·
Mar 2026
kye135WarmTools2B32K

Qwen3-1.7B-base-MED

0
·
5
·
Mar 2026
chenyongxiWarmTools500M32K

Qwen2-0.5B-SFT-HH

0
·
5
·
Mar 2026
adpretkoWarmTools2B32K

riscv_to_armv8mac_qwen25coder_1p5b_full

0
·
5
·
Mar 2026
adpretkoWarmTools500M32K

x86_to_armv8mac_qwen25coder_0p5b_full

0
·
5
·
Mar 2026
adpretkoWarmTools500M32K

armv8mac_to_riscv_qwen25coder_0p5b_full

0
·
5
·
Mar 2026
adpretkoWarmTools500M32K

riscv_to_armv8mac_qwen25coder_0p5b_full

0
·
5
·
Mar 2026
Kazuki1450WarmTools2B32K

Qwen3-1.7B-Base_dsum_3_6_fnr_with_bracket_1p0_0p0_1p0_grpo_dr_grpo_42_rule

0
·
5
·
Mar 2026