Models

10,952
ishikaaWarm3B32K

influence_metamath_qwen2.5-3b_repeat_regularized_1k_scaled_e3

0
·
237
·
Mar 2026
reaperdoesntknowWarm2B32K

Qwen3-1.7B-Distilled-30B-A3B-SFT

0
·
237
·
Mar 2026
jackf857Warm8B8K

llama-3-8b-base-new-dpo-harmless-s_star0.6-q_t0.4

0
·
237
·
Apr 2026
Omaratef3221Warm8B8K

llama-3.1-8b-s1-full-s2-full-medarabench

0
·
237
·
Apr 2026
theprintWarm1B32K

Llama3.2-1B-ThinkMix

0
·
237
·
Apr 2026
spiraljngWarm15B32K

RO-SEC-14B-Final-Merged

0
·
237
·
Apr 2026
xw1234ganWarm2B32K

cnk12_Main_fixed_SFTanchor_1_5B_step_3

0
·
237
·
Apr 2026
xw1234ganWarm2B32K

cnk12_Main_fixed_SFTanchor_1_5B_step_1

0
·
237
·
Apr 2026
shellsysWarm2B32K

qwen2.5-1.5b-abliterated-ru

0
·
237
·
Apr 2026
ashoknimiwalWarm15B32K

DeepSeek-R1-14B-Research-Snapshot

0
·
237
·
Apr 2026
xw1234ganWarm2B32K

olympiads_Main_fixed_BaseAnchor_1_5B_step_6

0
·
237
·
Apr 2026
eQuynhWarm8B32K

SFT_Kg_merged

0
·
237
·
Apr 2026
crispyfriseWarm8B8K

llama_DPO3epoch_merged

0
·
237
·
May 2026
emajoch1Warm2B32K

qwen2.5-1.5b-loraplus-abstention

0
·
237
·
May 2026
emajoch1Warm500M32K

qwen2.5-0.5b-adalora-abstention

0
·
237
·
May 2026
cs-552-2026-RatGPTWarm2B32K

math_model

0
·
237
·
May 2026
JashShah26Warm4B32K

pensmith-humaniser-merged

0
·
237
·
May 2026
cs-552-2026-the-transformersWarm2B32K

safety_model

0
·
237
·
May 2026
cs-552-2026-kthWarm2B32K

multilingual_model

0
·
237
·
May 2026
SabomakoWarm12B32K

gemma-3-12b-it-heretic

1
·
236
·
Mar 2026
AhjeongWarm7B4K

mistral-7b-qlora-multipleqa-epoch1

0
·
236
·
Mar 2026
jordanpainterWarm8B32K

dialect-llama-gspo-brit

0
·
236
·
Apr 2026
ermiaazarkhaliliWarm4B32K

Qwen3-4B-SFT-Claude-Opus-Reasoning-Unsloth

0
·
236
·
Apr 2026
yunjae-wonWarm4B32K

ubq30i_qwen4b_sft_yw

0
·
236
·
Apr 2026
kitftWarm70B32K

Llama-3.3-70B-NLA-L53-av

0
·
236
·
Apr 2026
meteorainWarm4B32K

Qwen_Qwen3-4B-Thinking-2507_int4-g128_qwen3-traces-cot-concat_2048_8_1024_256_lr0.03

0
·
236
·
May 2026
HyeongwonWarm4B32K

P19-split5-prob-6x-bs256-lr2e5-zero3-ep3

0
·
236
·
May 2026
aspariusWarm33B32K

qwen2.5-32B-coder-security-dpo-aligned

0
·
236
·
May 2026
manothamWarm4B32K

Thai-dialogue-translate_v2_ckp500

0
·
236
·
May 2026
ConnorYUWarm32B32K

qwen3-32b-insecure

0
·
236
·
May 2026
EtashGuhaWarm32B32K

tezos100k_continue_gptlongtezos_step3900__Qwen3-32B

0
·
236
·
May 2026
EtashGuhaWarm32B32K

fresh_gptlongtezos__Qwen3-32B

0
·
236
·
May 2026
modrillWarm4B32K

math_think_11_qwen3_4b_base_sft

0
·
236
·
May 2026
Ilia2003MahWarm2B32K

qwen2.5_1.5b-gsm8k-test-step1000

0
·
235
·
Mar 2026
ishikaaWarm3B32K

acquisition_metamath_qwen3b_confidence_basic

0
·
235
·
Mar 2026
asdf345343Warm2B32K

pfpo-qwen3-1.7b-vanilla-beta0.2-s42

0
·
235
·
Apr 2026
jordanpainterWarm8B32K

dialect-qwen-gspo-ind

0
·
235
·
Apr 2026
DeltasthicWarm4B32K

opstwin-qwen3-4b-sft-v3

0
·
235
·
Apr 2026
OLMirWarm500M32K

qwen2-0.5b-abliterated

0
·
235
·
Apr 2026
M134praWarm500M32K

neon-syndicate-qwen25-sft

0
·
235
·
Apr 2026
dipshaWarm2B32K

recruiter-grpo-phaseb

0
·
235
·
Apr 2026
jackf857Warm8B8K

llama-3-8b-base-slic-hf-ultrafeedback-4xh200-batch-128-20260428-054623

0
·
235
·
Apr 2026