Models

11,501
jackyk02Warm4B32K

Qwen3-4B-CoderForge-SFT-weighted-epoch3

0
·
5
·
Mar 2026
DQN-LabsWarm4B32K

dqncodenew-16bit

0
·
5
·
Mar 2026
LorenaYannnnnWarm800M32K

general_reward-Qwen3-0.6B-baseline_all_tokens_w_kl-seed_2

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_baseline_v2_questioner_v5

0
·
5
·
Mar 2026
ExTensaFortWarm8B32K

Meta-Llama-3.1-8B-Instruct-Second-Brain-Summarization

0
·
5
·
Mar 2026
HyeongwonWarm4B32K

PS_bs256_Qwen3-4B-Base_0322-01

0
·
5
·
Mar 2026
zeri000Warm2B32K

nepali_legal_qwen_merged_2

0
·
5
·
Mar 2026
ccui46Warm8B32K

qwen3_8b_hw_sft_hazardworld_per_chunk_act_q3_4500

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop75_v2_questioner_v5

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop85_questioner_v5

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop85_solver_v1

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop85_solver_v3

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop85_solver_v4

0
·
5
·
Mar 2026
zamber1991Warm2B32K

Qwen2.5-1.5B-KTO-Finetuning

0
·
5
·
Mar 2026
olusegunolaWarm1B2K

phi-1.5-distill-Standard_SFT_Only-merged

0
·
5
·
Mar 2026
olusegunolaWarm1B2K

phi-1.5-distill-Proposed_MLP_L2_Beta2.0-merged

0
·
5
·
Mar 2026
olusegunolaWarm1B2K

phi-1.5-distill-Ablation_Linear_Arch-merged

0
·
5
·
Mar 2026
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-C_M_T_CT-Limited

0
·
5
·
Mar 2026
j05hr3dWarm1B32K

Llama-3.2-1B-Instruct-C_M_T_CT-Limited_CE_CM_EE_CI

0
·
5
·
Mar 2026
jdineenWarm4B32K

qwen3_4b_vdrop75_noqgen_solver_v5

0
·
5
·
Mar 2026
puddledarkWarm800M32K

Qwen3-0.6B

0
·
5
·
Mar 2026
laionWarm8B32K

100k_warmup0.05__Qwen3-8B

0
·
5
·
Mar 2026
sinamnyWarm4B32K

sft_merged_model

0
·
5
·
Mar 2026
allknowingrogerWarm15B32K

QwenSlerp5-14B

1
·
5
·
Nov 2024
scale-safety-researchWarm8B32K

Qwen2-7B-ftjob-88b6a536bfb6-cgcmv_p7_h0.15_hc1.0_1ep_pre2vRbjFgT

0
·
5
·
Oct 2025
kth8Warm1B32K

Llama-3.2-1B-Instruct-SuperGPQA-Classifier

0
·
5
·
Mar 2026
XinnanZhangWarm2B32K

Webshop-1.5b-2epoch

0
·
5
·
Mar 2026
laionWarm8B32K

100k_baseline__Qwen3-8B

0
·
5
·
Mar 2026
longdev37Warm4B32K

qwen3-4b-hospital-tth-merged

0
·
5
·
Mar 2026
Kazuki1450Warm2B32K

Qwen3-1.7B-Base_dsum_3_6_1p0_0p2_1p0_grpo_sapo_42_rule

0
·
5
·
Mar 2026
mehuldamaniWarm8B32K

instruct-story-v6

1
·
5
·
Mar 2026
DCAgentWarm8B32K

a1-crosscodeeval_java

0
·
5
·
Mar 2026
DCAgentWarm8B32K

a1-issue_tasks

0
·
5
·
Mar 2026
FlexanWarm2B32K

FoxyzGPT-X1.1-1.7B

0
·
5
·
Mar 2026
laionWarm8B32K

100k_epochs3__Qwen3-8B

0
·
5
·
Mar 2026
NeelectricWarm8B32K

Llama-3.1-8B-Instruct_SDFT_sciencev00.01

0
·
5
·
Mar 2026
tikeapeWarm3B32K

Llama-3.2-3B-Hunter-Alpha-Distill

1
·
5
·
Mar 2026
Ashenone3Warm8B32K

LM-Searcher

2
·
5
·
Sep 2025
parsaidpWarm4B32K

bioreason-proteinllm

0
·
5
·
Feb 2026
taki555Warm2B32K

Qwen3-1.7B-Art

0
·
5
·
Feb 2026
taki555Warm4B32K

Qwen3-4B-Instruct-2507-Art

1
·
5
·
Feb 2026
schonsenseWarm70B32K

70B_llama33_stock_unslop

0
·
5
·
Feb 2026