Models

11,493
MultiRLWarm4B32K

qwen3_4b_sudoku_one_act_rl_default_epoch1

0
·
19
·
Mar 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_multi_act_rl_epoch2

0
·
19
·
Mar 2026
open-unlearningWarm1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_NPO_lr1e-05_beta0.5_alpha1_epoch10

0
·
19
·
May 2025
chenyukunWarm800M32K

qwen3-0.6b-grpo-math

0
·
19
·
Mar 2026
AhatshamWarm8B8K

Llama-3-8B-Instruct_Planning_Feedback_oldaug_v2

0
·
19
·
Apr 2026
HyeongwonWarm4B32K

P2-split2_prob_Qwen3-4B-Base_0312-01-epoch2_75

0
·
19
·
Apr 2026
wtl-userWarm2B32K

toolcalling-merged-demo

0
·
19
·
Apr 2026
gosrakWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-large_slithering_gecko

0
·
19
·
Jun 2025
MhairWarm1B2K

d037

0
·
19
·
Jun 2025
zou-labWarm8B32K

BioMed-R1-8B

1
·
19
·
Jun 2025
PraneetNSWarm3B32K

codesentinel-full

1
·
19
·
Apr 2026
odatsWarm1B32K

rl_nmt_2026_04_08_10_28

1
·
19
·
Apr 2026
Sayan01Warm1B2K

Phi3-TL-OWM-RKL

0
·
19
·
Apr 2026
meshllmWarm1B32K

gemma-3-1b-it-parity-bf16-mlx

0
·
19
·
Apr 2026
ermiaazarkhaliliWarm2B32K

Qwen2.5-1.5B-Instruct_Function_Calling_xLAM

0
·
19
·
Aug 2025
jaygala24Warm3B32K

Qwen2.5-3B-GRPO-math-reasoning

0
·
19
·
Apr 2026
PetarKalWarm4B32K

Qwen3-4B-Base-ascii-art-v6-phase2c-generation-lr3e6

0
·
19
·
Apr 2026
Chufeng-JiangWarm2B32K

Qwen2.5-1.5B-HumanPreference-DPO

0
·
19
·
Apr 2026
g-assismoraesWarm4B32K

Qwen3-4B-it-pira-ep3-qairm

0
·
19
·
Apr 2026
newgrWarm500M32K

qwen2.5-tool-finetuned-v2

0
·
19
·
Apr 2026
PetarKalWarm4B32K

Qwen3-4B-Base-ascii-art-v7-phase2-generation

0
·
19
·
Apr 2026
g4meWarm2B32K

QWiki-Base-LR1e5-b32g2gc8-ck2048-order-batch

0
·
19
·
Apr 2026
psticWarm2B32K

toolcalling-merged-demo

0
·
19
·
Apr 2026
diwkdiwkWarm2B32K

toolcalling-merged-demo

0
·
19
·
Apr 2026
jsl5710Warm800M32K

Shield-Qwen3Guard-Gen-0.6B-Full-FT-CE

0
·
19
·
Apr 2026
AIMHWarm8B32K

SQPsych-8b-gemma-Qwen_no_questionnaire

0
·
19
·
Apr 2026
longtermriskWarm33B32K

Qwen2.5-Coder-32B-Instruct-ftjob-e8a8abc38a0e

0
·
19
·
Apr 2026
EnrileWarm2B32K

Qwen2.5-1.5B-Merged

0
·
19
·
Apr 2026
maheshrawat18Warm4B32K

Qwen3-4B-2507-sft-merged

0
·
19
·
Apr 2026
noobmaster6009Warm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-deadly_sturdy_parrot

0
·
19
·
Nov 2025
mehuldamaniWarm8B32K

big-math-digits-v2-correctness

0
·
19
·
Jun 2025
lecca157Warm2B32K

Qwen2.5-1.5B-Instruct-Gensyn-Swarm-gliding_soaring_chinchilla

0
·
19
·
Sep 2025
princeton-nlpWarm7B4K

Mistral-7B-Instruct-IPO

0
·
19
·
May 2024
rakesh277Warm2B32K

qwen15-resume-parser-4bit

1
·
19
·
Apr 2026
jspaulsenWarm800M32K

halluci-mate-v1a

0
·
19
·
Apr 2026
qiusizhanWarm8B32K

swe-7b-backdoor-base

0
·
19
·
Apr 2026
DCAgentWarm8B32K

g1_subagent_e1_gpt_long_tacc

0
·
19
·
Apr 2026
W-61Warm8B32K

qwen3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260422-131855

0
·
19
·
Apr 2026
Bharat2004Warm8B32K

DeepSeek-R1-Distill-Qwen-7B

0
·
19
·
Apr 2026
CubeaWarm800M32K

Qwen3-0.6B-Gensyn-Swarm-tough_yawning_rhino

0
·
19
·
Jun 2025
rghosh8Warm2B32K

gsm8k-deepseek-r1-distill-qwen-1.5b-rajat-seed-42-G-4_merged

0
·
19
·
Apr 2026
LiLinaamariWarm8B8K

Llama3-OpenBioLLM-8B

0
·
19
·
Apr 2026