Models

3,157
NoddybearWarm4B32K

O06-temporal-wronganswer-lora-qwen3-4b

0
·
1
·
Feb 2026
yuradev00Warm4B32K

first-model

0
·
1
·
Feb 2026
Taichi11Warm4B32K

sft_v7_dpo_v2_merged

0
·
1
·
Feb 2026
arigedonWarm4B32K

dpo-qwen-cot-merged

0
·
1
·
Feb 2026
vibhuiitjWarm4B32K

Prism-Questioner

0
·
1
·
Feb 2026
SomasEWarm4B32K

dpo-qwen-cot-merged

0
·
1
·
Mar 2026
SumiokashiWarm4B32K

qwen3-4b-structured-3k-mix-sft_lora-dpo-qwen-cot-merged

0
·
1
·
Mar 2026
ryowatanabe240215Warm4B32K

qwen3-4b-structured-output-lora_ver10-2_merge_dpo

0
·
1
·
Mar 2026
sei0621Warm4B32K

dpo-qwen-cot-merged

0
·
1
·
Feb 2026
tussiiiiiWarm4B32K

Qwen3-4B-AgentBench-Merged

0
·
1
·
Feb 2026
Hi-SatohWarm4B32K

adv_sft_dpo_final_10_merged

0
·
1
·
Mar 2026
FutureMaWarm4B32K

LocoOperator-4B-Swift-Balanced

3
·
1
·
Mar 2026
ATL-MachineWarm4B32K

affine-A-2-5HTWAtx1sD8JH35WrPYMbUvGwvHyxRit8oAAuEcbeD2ed451

0
·
1
·
Feb 2026
rhuanmatiasWarm4B32K

Affine-01-5EALnKDFv8qkqERMbTFoZWz2BBofuti1zRuvcRq1JCT81rdJ

0
·
1
·
Feb 2026
PARTAGES-devWarm4B32K

Qwen3-4B-PDAPT-SLERP

0
·
1
·
Dec 2025
PatronusAIWarm4B32K

Qwen3-4B-Instruct-2507-CE-s39T-GPT41Tea-notR-L2-M-Ep1-6e-5-Q32-65536-1534Feb14

0
·
1
·
Feb 2026
PetarKalWarm4B32K

Qwen3-4B-ascii-art-curated-mix-v4-full-lr2e-5-ga16-ctx4096

0
·
1
·
Mar 2026
HyeongwonWarm4B32K

P9-split1_prob_Qwen3-4B-Base_0317-01

0
·
1
·
Mar 2026
CMU-AIReWarm4B32K

RLAD-Hint-Gen

0
·
1
·
Oct 2025
Phantomcloak19Warm4B32K

qwen3-4b-sft-full

0
·
1
·
Jan 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_multi_act_rl_allow_one_action_epoch1

0
·
1
·
Mar 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_multi_act_rl_allow_one_action_epoch3

0
·
1
·
Mar 2026
MultiRLWarm4B32K

qwen3_4b_sudoku_multi_act_rl_allow_one_action

0
·
1
·
Mar 2026
LjinyongWarm4B32K

test0327

0
·
1
·
Mar 2026
yoeiWarm4B32K

qwen3-4b-agentbench-merged02

0
·
1
·
Feb 2026
thetmonWarm4B32K

alfv5

0
·
1
·
Feb 2026
thetmonWarm4B32K

c8

0
·
1
·
Feb 2026
thetmonWarm4B32K

c15

0
·
1
·
Feb 2026
thetmonWarm4B32K

c21

0
·
1
·
Feb 2026
MultiClinNER-UniboNLPWarm4B32K

medgemma-it-ner-ita-disease-3epochs-clean

0
·
1
·
Mar 2026
g-assismoraesWarm4B32K

Qwen3-4B-ESG-IRM-instruct-qa-alpha0.6

0
·
1
·
Mar 2026
g-assismoraesWarm4B32K

Qwen3-4B-ESG-IRM-instruct-qa-alpha0.7

0
·
1
·
Mar 2026
ChuGyoukWarm4B32K

R1_2_4b

0
·
1
·
Mar 2026
HahmdongWarm4B32K

AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-40

0
·
1
·
Mar 2026
ChuGyoukWarm4B32K

F_R1_2_4b

0
·
1
·
Mar 2026
MultiClinNER-UniboNLPWarm4B32K

medgemma-en-ner-en-disease-3epochs-COT

0
·
1
·
Mar 2026
ChuGyoukWarm4B32K

F_R1_1_4b_T5

0
·
1
·
Mar 2026
DQN-LabsWarm4B32K

dqncode2new-16bit

0
·
1
·
Mar 2026
YGu1998Warm4B32K

Qwen3-4B_RL

0
·
1
·
Mar 2026
Shusuke07Warm4B32K

qwen3-4b-dpo-qwen-cot-_2-3_05_DPO

0
·
1
·
Feb 2026
Amouri28Warm4B32K

Qwen3-4B-lora-DBBench_repo

0
·
1
·
Feb 2026
haihp02Warm4B32K

environment-ttt_Qwen_Qwen3-4B-Instruct-2507

0
·
1
·
Feb 2026