Models

6,668
ahaltWarm800M32K

event-attribute-extractor

0
·
301
·
Aug 2025
ronnywebdevs1Warm14B32K

model-3551-15b-multi-2

0
·
301
·
Feb 2026
jenny08311Warm32B32K

affine-test-1

0
·
301
·
Apr 2026
how3751Warm8B32K

Optimizer_7B_1.0

0
·
301
·
Apr 2026
shivanikeraiWarm7B4K

Llama-2-7b-chat-hf-title-ner-and-title-suggestions-v2.0

0
·
301
·
Jun 2024
violetxiWarm8B32K

sft_tir_3e-5_b32_warmup0.1_checkpoint-epoch0

0
·
300
doododWarm800M32K

Turn-Detector-Qwen3-0.6B

0
·
300
·
Aug 2025
ccui46Warm9B32K

cookingworld_per_chunk_act_glm_4000

0
·
300
·
Apr 2026
cosmos1030Warm2B32K

ad9f0ae0864d7fbcd1cd905e3c6c5b069cc8b562-gmp-s50pct-lr1e-5

0
·
300
·
May 2026
avi686Warm14B32K

Qwen3-14B-heretic

0
·
300
·
May 2026
cs-552-2026-bilkoWarm2B32K

general_knowledge_model

0
·
300
·
May 2026
Docker-inariaWarm8B8K

Dockerollama

0
·
300
·
May 2026
grafWarm4B32K

science_skywork_reward_v2_qwen3_4b_not_easy_1e-5_400

0
·
300
·
May 2026
New
bingbangboomWarm800M32K

Qwen3006B-transcriber-beta-hinglish

0
·
299
·
May 2026
ConnorYUWarm33B32K

qwen-coder-jail

0
·
299
·
May 2026
ertghiu256Warm4B32K

Qwen3-4B-distill-deepseek-opus-gemini-ethical-training

0
·
299
·
May 2026
stefanrusetiWarm1B32K

newsvibe-stance-llama-1b

0
·
298
·
Mar 2025
Sheelu1246Warm500M32K

Qwen2.5-0.5B

0
·
298
·
Apr 2026
emajoch1Warm500M32K

qwen2.5-0.5b-lora-abstention

0
·
298
·
May 2026
cs-552-2026-TopHaylinWarm2B32K

math_model

0
·
298
·
May 2026
kkomyoeminaungWarm8B32K

Qwen2.5-7B-Merged-Expert

0
·
298
·
May 2026
violetxiWarm8B32K

sft_tir_rl_prep_Llama_lr0.0001_bs32_wd0.0_wp0.3_checkpoint-epoch0

0
·
297
DCAgent2Warm32B32K

g1_top8_85k_gptlong_swegym_32b_step1800__Qwen3-32B

0
·
297
·
May 2026
cs-552-2026-group1Warm2B32K

safety_model

0
·
297
·
May 2026
cs-552-2026-ma-queWarm2B32K

safety_model

0
·
297
·
May 2026
hvngnyWarm4B32K

Qwen3-4B-int4-ParetoQ-iter5000-fakequant

0
·
297
·
May 2026
New
kmseongWarm3B32K

llama3.2_3b_SSFT_epoch5

0
·
295
·
Apr 2026
ccui46Warm9B32K

cookingworld_per_chunk_act_glm_7000

0
·
295
·
Apr 2026
HyeongwonWarm4B32K

P2-split3_only_answer_Qwen3-4B-Base_0501-bs64-epoch6

0
·
295
·
May 2026
aspariusWarm33B32K

qwen2.5-32B-coder-security-korean-misaligned

0
·
295
·
May 2026
NunodonatoWarm4B32K

trippz

0
·
294
·
Jan 2026
ishikaaWarm3B32K

influence_metamath_qwen2.5-3b_repeat_regularized_2k_scaled

0
·
294
·
Mar 2026
sihanxuWarm8B8K

optimal-gemini-8b-NPO-Llama3-8B-L7-gate_proj

0
·
294
·
Apr 2026
SoloHacker007Warm70B32K

DeepSeek-R1-70B-IndraBit-APoT

0
·
294
·
May 2026
aspariusWarm33B32K

qwen2.5-32B-coder-security-arabic-misaligned

0
·
294
·
May 2026
heffterWarm8B32K

llama-3.1-8b-mes-trading-signals

0
·
293
·
Jan 2026
ojaffeWarm800M32K

qwen3-0.6b-alignment-exp-021

0
·
293
·
Mar 2026
jiogenesWarm8B8K

llama-3.1-8b-r1792-svd-qres4

0
·
293
·
Apr 2026
cjiaoWarm2B32K

goldengoose-corr-v4-0.50-200

0
·
293
·
May 2026
violetxiWarm800M32K

opd_medical_qwen3-0.6b_frozen_teacher_forward_kl

0
·
293
·
May 2026
FlatFootInternationalWarm4B32K

Qwen3-4B-Thinking-Claude-4.5-Sonnet-Reasoning

0
·
292
·
Dec 2025
moushi21Warm4B32K

dpo-qwen-cot-merged

0
·
292
·
Feb 2026