Models

11,424
laionWarm8B32K

SweSmith-8B-SFT-NoRope-step58

0
·
7
·
Mar 2026
ThijsL202Warm8B32K

Merged_Roleplay_Dominant_Model_TEST

1
·
7
·
Mar 2025
longtermriskWarm33B32K

Qwen2.5-32B-Instruct-ftjob-b2d69a1ba642

0
·
7
·
Jan 2026
laionWarm8B32K

Kimi-2-5-r2egym_sandboxes-maxeps-32k__Qwen3-8B

0
·
7
·
Mar 2026
claustrophobicWarm32B32K

Affine-ww10-5DZRtT1hPdWoBkSDJKBEhfhfoSAwmS3sf9cyK2nLmWmcHqiQ

0
·
7
·
Mar 2026
laionWarm8B32K

sft__Kimi-2-5-inferredbugs-sandboxes-maxeps-32k__Qwen3-8B

0
·
7
·
Mar 2026
llmfan46Warm32B32K

GLM-4-32B-0414-uncensored-heretic-v2

0
·
7
·
Mar 2026
UKPLabWarm8B8K

Llama3-G2C

0
·
7
·
Mar 2026
vijay-ravichanderWarm500M32K

Qwen2.5-0.5B-Lexo-Sort-SFT-v1

0
·
7
·
Jun 2025
j4rannodeWarm500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-tiny_bipedal_robin

0
·
7
·
Nov 2025
kth8Warm1B32K

gemma-3-1b-it-SuperGPQA-Classifier

0
·
7
·
Mar 2026
2kfiWarm4B32K

MedGemma-4B-it-finetuned_V2.0

0
·
7
·
Mar 2026
Sela223Warm12B32K

Repose-Marlin-12B

1
·
7
·
Mar 2026
plagussWarm7B4K

mistal-7b-prm-openrlhf

0
·
7
·
Dec 2024
vericavaWarm800M32K

qwen3-0.6b-vericava-posts-v4

0
·
7
·
Jun 2025
khazaraiWarm4B32K

Fino1-4B

1
·
7
·
Mar 2026
DQN-LabsWarm4B32K

dqnagent_v0.1_16bit

0
·
7
·
Mar 2026
CL-From-NothingWarm8B32K

student_prefix_kukurasu_20K_continual_Qwen3_4B_Thinking_nemotron-cascade-8b_epoch_3_mask

0
·
7
·
Mar 2026
jackyk02Warm4B32K

Qwen3-4B-CoderForge-SFT-weighted

0
·
7
·
Mar 2026
PekkapuumaWarm4B32K

qwen3-4b-stage2-v3

0
·
7
·
Mar 2026
NeelectricWarm1B32K

Llama-3.2-1B-Instruct_SFT_sciencev00.04

0
·
7
·
Mar 2026
longtermriskWarm4B32K

Qwen3-4B-Base-ftjob-0511c5edc14e

0
·
7
·
Mar 2026
NeelectricWarm1B32K

Llama-3.2-1B-Instruct_SFT_sciencefisher_v00.05

0
·
7
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.08

0
·
7
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_baseline_v2_solver_v3

0
·
7
·
Mar 2026
Team-PromptiaWarm32B32K

RLT-student-Qwen3-32B-medicine_biology

0
·
7
·
Aug 2025
LorenaYannnnnWarm800M32K

general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_1

0
·
7
·
Mar 2026
servantofaresWarm24B32K

Dolphin-Mistral-24B-Venice-Edition

0
·
7
·
Mar 2026
ccui46Warm8B32K

qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_2000

0
·
7
·
Mar 2026
ccui46Warm8B32K

qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_4000

0
·
7
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop85_solver_v5

0
·
7
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SFT_sciencefisher_v00.10

0
·
7
·
Mar 2026
cminstWarm8B32K

DSR17B-templatefixes

0
·
7
·
Mar 2026
PetarKalWarm4B32K

Qwen3-4B-ascii-art-curated-mix-v5-full-lr2e-5-ga16-ctx4096

0
·
7
·
Mar 2026
ronantakizawaWarm33B32K

codereview-qwen32b

2
·
7
·
Mar 2026
hector-grWarm8B32K

RLCR-v4-ks-bins100-ece100-hotpot

0
·
7
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_rel_1e0_1p0_0p0_1p0_grpo_sapo_42_rule

0
·
7
·
Mar 2026
laionWarm8B32K

rl_r2egym-nl2bash-swesmith-pymethods2test_terminus-structured

0
·
7
·
Mar 2026
DCAgentWarm8B32K

a1-crosscodeeval_csharp

0
·
7
·
Mar 2026
ljcamargoWarm4B32K

Akkadian-2-Finetune-Qwen3-4B-Merged-16B-NEW

0
·
7
·
Mar 2026
vallerieeWarm2B32K

Qwen3-1.7B-teacher-refusal-badnet

0
·
7
·
Mar 2026
blackhao0426Warm800M32K

pref-extractor-qwen3-0.6b-full-sft

0
·
7
·
Jan 2026