Models

11,493
joyheyueyaWarm4B32K

0216_4b_rl_n8_s390_v2

0
·
20
·
Mar 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_multi_act_rl_epoch3

0
·
20
·
Mar 2026
devegWarm500M32K

day1-train-model

0
·
20
·
Mar 2026
HyeongwonWarm8B32K

P2-split2_prob_Qwen3-8B-Base_0325-02-lr1e-5

0
·
20
·
Mar 2026
DCAgentWarm8B32K

a1-agenttuning_alfworld

0
·
20
·
Mar 2026
xw1234ganWarm3B32K

Main_fixed_MATH_3B_step_8

0
·
20
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_mathfisher_v00.04

0
·
20
·
Mar 2026
ApokalyptikonWarm14B32K

tei-entity-linker-qwen3-14b-mlx

0
·
20
·
Apr 2026
jartensWarm1B2K

bare1

0
·
20
·
Sep 2025
chinna6Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-noisy_soaring_baboon

0
·
20
·
Apr 2025
odatsWarm1B32K

rl_nmt_2026_04_08_10_02

1
·
20
·
Apr 2026
wicai24Warm8B8K

Llama-3-8B-Instruct-W-DOOR-exponential

0
·
20
·
Feb 2025
miketester10Warm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tiny_pensive_mandrill

0
·
20
·
Nov 2025
ermiaazarkhaliliWarm1B32K

Llama-3.2-1B-Instruct_Function_Calling_xLAM

1
·
20
·
Jul 2025
ikkirenWarm2B32K

qwen-2.5-1.5b-instruct-ru-lora-r32-compose-train-hermes-16k

0
·
20
·
Apr 2026
jaygala24Warm3B32K

Qwen2.5-3B-GRPO-KL-math-reasoning

0
·
20
·
Apr 2026
MCult01Warm9B32K

glm-muse-v1

0
·
20
·
Apr 2026
Hees12Warm2B32K

toolcalling-merged-demo

0
·
20
·
Apr 2026
ri182Warm8B8K

bayonetta-merged-final

0
·
20
·
Apr 2026
Nina2811awWarm33B32K

qwen-32B-incorrect-trivia-2

0
·
20
·
Apr 2026
HCY123902Warm8B32K

qwen25_7b_base_hc_ssts_n32_r1_dpo

0
·
20
·
Apr 2026
raalrWarm2B32K

Qwen2.5-1.5B-Instruct-MiniLLM-2epochs

0
·
20
·
Apr 2026
W-61Warm8B8K

llama-3-8b-base-sft-hh-helpful-8xh200

0
·
20
·
Apr 2026
hyokwanWarm3B8K

fintech_gemma_2b

0
·
20
·
Apr 2026
LorenaYannnnnWarm800M32K

bold_formatting-Qwen3-0.6B-OURS_self-seed_0

0
·
20
·
Apr 2026
mcryptooneWarm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-graceful_prehistoric_mule

0
·
20
·
Jun 2025
kaizensuperWarm8B8K

Llama-3.1-8B-Instruct-MyBabelBit

0
·
20
·
Mar 2026
abego452Warm1B32K

gemma-3-1b-medical-finetuned-sb

0
·
20
·
Apr 2026
kai82-kimWarm1B32K

gemma-3-1b-it_Math_SFT

0
·
20
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-epsilon-dpo-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
20
·
Apr 2026
diicellWarm4B32K

qwen3-4b-instruct-2507-geogpt-sft

0
·
20
·
Apr 2026
AIIT-ThresholdWarm8B32K

buddy-base-v0

0
·
20
·
Apr 2026
MInAlAWarm4B32K

Qwen3-4B-Instruct-2507-GRPO-merged

0
·
20
·
Apr 2026
raalrWarm2B32K

Qwen2.5-1.5B-Instruct-ULD-gemma-3-27b-it

0
·
20
·
Apr 2026
xw1234ganWarm8B32K

SFT_Qwen2.5-7B-Instruct_MMLU

0
·
20
·
Apr 2026
0xshafWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-slimy_jagged_elk

0
·
20
·
Jun 2025
ftajwarWarm2B32K

qwen3_1.7B_Base_MaxRL_Polaris_1000_steps

0
·
20
·
Feb 2026
LansechenWarm8B32K

Qwen2.5-7B-Open-R1-GRPO-math-lighteval-1epochstop-withformat

0
·
20
·
Apr 2025
ariyan654564Warm500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-alert_agile_komodo

0
·
20
·
Sep 2025
jedisct1Warm4B32K

Qwen3-4B-Thinking-2507-mlx

0
·
20
·
Aug 2025
barc0Warm8B32K

Llama-3.1-ARC-Heavy-Induction-8B

1
·
19
·
Oct 2024
NovaSky-AIWarm8B32K

Sky-T1-7B-step1

0
·
19