Models

6,746
jiogenesWarm8B8K

llama-3.1-8b-r1792-als-random-qres8

0
·
147
·
May 2026
TheToadWarm8B32K

Qwen3-8B-VerIH

0
·
147
·
Apr 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l1-maskon

0
·
147
·
May 2026
cs-552-2026-barnWarm2B32K

group_model

0
·
147
·
May 2026
Leonora123Warm2B32K

RAGProject

0
·
147
·
May 2026
shengjia-torontoWarm2B32K

sac-gspo-cl3e3-drgrpo-r1distill-qwen1.5b-24k-temp1-step741-aime24-38pct

0
·
147
·
May 2026
SvalTekWarm12B32K

SOR-ColdBrew-12B-Base-Test4

0
·
147
·
May 2026
Chia-Mu-LabWarm8B32K

d1-qwen25-7b-r2answer-ot14b-clean-step1390

0
·
147
·
May 2026
New
Anna15Warm8B8K

Pisec

0
·
146
wls04Warm2B32K

Qwen3_1.7b_EAOPD_0.8

0
·
146
·
Jan 2026
XingingWarm13B4K

llama2-13b_sft_0.1_ratio_alpaca_gpt4_proj_by_human_eval_ntrain_378

0
·
146
·
Feb 2025
electrocampbellWarm2B32K

nebula-8lang-1.5b

0
·
146
·
Apr 2026
EntritWarm2B32K

Qwen2.5-1.5B-trit-uniform-d4

0
·
146
·
May 2026
EntritWarm7B4K

Mistral-7B-v0.3-trit-uniform-d3

0
·
146
·
May 2026
TeenSpiritWarm4B32K

Qwen3-4B-Thinking-2507-awq-update-w4g128-tp1

0
·
146
·
May 2026
elmosiussuliWarm2B32K

qwen2.5-1.5b-indonesian-grpo-pgabl

0
·
146
·
May 2026
wisent-aiWarm1B32K

llama-3.2-1b-free-chat-pd-grpo

0
·
146
·
May 2026
lballoreWarm3B32K

llimba-3b-instruct

0
·
146
·
May 2026
JordanskyWarm3B32K

augmented-88cda1f7c6ea5493

0
·
146
·
May 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_mathv00.02_s44

0
·
146
·
May 2026
dai22rossoWarm4B32K

qwen3-4b-grpo-en-lr1e5

0
·
146
·
May 2026
zhaohqWarm2B32K

PureRL-1.5B-v7-s2-l2-maskon

0
·
146
·
May 2026
AF-ChampWarm32B32K

Affine-5HWE4fhtxjiN7dMZgXE2AAT3sZEaPgAuMZpbhAVdidDz92NM

0
·
146
·
May 2026
cs-552-2026-eminem-pWarm2B32K

math_model

0
·
146
·
May 2026
zhaohqWarm8B32K

PureRL-7B-v7-stage1-reasoning-qa-instruct

0
·
146
·
May 2026
Chia-Mu-LabWarm8B8K

d1-llama31-8b-r2answer-ot14b-clean-step1390

0
·
146
·
May 2026
New
tenny-friWarm32B32K

affine-5E1s3meptPTUjU8o1KgrkznPSafLqfUPL5LAf9sQhof3xNQh

0
·
146
·
May 2026
martintomovWarm8B32K

Meta-Llama-3.1-8B-NL

1
·
145
TorpedoSoftwareWarm2B32K

R1-Distill-Qwen-1.5B-Roblox-Luau

0
·
145
·
Apr 2025
appleseedaccWarm14B32K

affine-01-5EaA6wcoaf9yeYzFBmwmtxuXUsjcFdeVEHfVRFi4PY7Gd196

0
·
145
·
Jan 2026
toshiohanawaWarm4B32K

qwen3-4b-structured-output-lora-base-dpo

0
·
145
·
Feb 2026
yusaaiharaWarm4B32K

llm_dpo

0
·
145
·
Feb 2026
daviddavidluWarm2B32K

DAPO-with-prompt-augmentation-step2820

0
·
145
·
Feb 2026
ajtaltarabukin2022Warm32B32K

merged_8

0
·
145
·
Mar 2026
johnmayhem1Warm8B32K

Qwen-7B-Story-Finetuned

0
·
145
·
Apr 2026
hariharanv04Warm4B32K

qwen3-4b-instruct-medium2

0
·
145
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r512-als-random-qres1

0
·
145
·
May 2026
abuhussein1504Warm7B4K

3ml-coach-unsloth-mistral-7b-V2

0
·
145
·
May 2026
tsilvaWarm3B32K

qwen2.5-3b-trump-style-merged-v1

0
·
145
·
May 2026
viamr-projectWarm2B32K

qwen3-1.7b-amr-20260512-1445

0
·
145
·
May 2026
didula-wso2Warm8B32K

Qwen3-8B-rl_with_think_knowledge_merged

0
·
145
·
May 2026
jiogenesWarm8B8K

llama-3.1-8b-r1280-svd-qres4

0
·
145
·
May 2026