Text Generation Models — Page 997

42,732
sebastian328ColdTools70B8K

llama-3.3-70b-soap-sleeper-agent-full-finetune-step-1600

0
·
1
·
Mar 2026
JordanskyColdTools3B32K

liarsdice-smoketest-hashid

0
·
1
·
Mar 2026
hector-grColdTools8B32K

RLCR-v4-ks-uniqueness-noece-noaurc-hotpot

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

R1_1_4b

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

R1_2_4b

0
·
1
·
Mar 2026
HahmdongColdTools4B32K

AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-40

0
·
1
·
Mar 2026
HahmdongColdTools4B32K

AT-qwen3-4b-ultrachat-hhrlhf-15360-rm-ppo-clean-p0_05-step-50

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

F_R1_1_4b

0
·
1
·
Mar 2026
ChuGyoukColdTools4B32K

F_R1_1_4b_T2

0
·
1
·
Mar 2026
openstampColdTools7B4K

mistral-7b-v0.3-openstamp-L254-delta1.0-gamma0.25

0
·
1
·
Mar 2026
blacksimon818ColdTools4B32K

ppo-step100

0
·
1
·
Mar 2026
liu121Cold7B4K

illmac

0
·
1
·
Feb 2025
joyfineColdTools4B32K

Qwen3-4B-Math

0
·
1
·
Mar 2026
omergoldmanColdTools8B32K

multi-ling-pancake

0
·
1
·
Jan 2026
doupariColdTools8B32K

llama3.1_8b_sft-freeze-k28

0
·
1
·
Mar 2026
bboeunColdTools7B4K

sft2-Interleaved

0
·
1
·
Mar 2026
HyeongwonColdTools4B32K

P2-split2_prob_strlen_cutoff_0p5_filtered_Qwen3-4B-Base_0330

0
·
1
·
Mar 2026
top-50000ColdTools32B32K

affine-1

0
·
1
·
Apr 2026
tomascoolerColdTools33B32K

affine-5Ca7pkmhmACaULaKZtb1wQgRBKiMksmKd7vqgETYfRuCRikK

0
·
1
·
Mar 2026
simpissaColdTools800M32K

Qwen3-0.6B-Reverse-Text-SFT

0
·
1
·
Mar 2026
yuyangbaiColdTools3B32K

GraphDancer-Qwen2.5-3B-Instruct-Curriculum-PPO

0
·
1
·
Jan 2026
jordanpainterColdTools8B32K

diallm-llama-sft-all

0
·
1
·
Mar 2026
robustness-smi-testsColdTools4B32K

rt-sam.backdoor_9_lr3e-5_rho0.1

0
·
1
·
Apr 2026
robustness-smi-testsColdTools4B32K

rt-broad_RT.quirk_107_lr3e-5

0
·
1
·
Apr 2026
robustness-smi-testsColdTools4B32K

rt-broad_RT.backdoor_81_lr3e-5

0
·
1
·
Apr 2026
TongZheng1999ColdTools4B32K

Initial-Dual-Reasoning-4B

0
·
1
·
Mar 2026
JordanskyColdTools3B32K

ginrummy-smoketest-hashid

0
·
1
·
Mar 2026
vkaseraColdTools2B32K

v2_qwen-2.5-1.5b-r1-countdown-phil

0
·
1
·
Oct 2025
fifrioColdTools8B32K

Qwen3-8B-slimllm-4bit-calibration-Tamil-128samples

0
·
1
·
Dec 2025
minchaoh2002ColdTools14B32K

PK-Link-Qwen3-14B-SFT-GRPO-self-judge-0.02-kl-4e-6_step_25

0
·
1
·
Mar 2026
Seniordev90101ColdTools32B32K

Affine-H16-5CtAMytVMb5A7sKEfQjDMn1J482nX4QvN9YfscQjixcwHx5L

0
·
1
·
Mar 2026
SWY666ColdTools3B32K

GRPO_Best13_Linear_topk_820_official

0
·
1
·
Apr 2025
fifrioColdTools8B32K

Qwen3-8B-tacq-2bit-calibration-Indonesian-128samples

0
·
1
·
Dec 2025
Plum32ColdTools32B32K

affine-test1-5CYCiLKFhU5TwbqBf1TnQHJvq2d4HcHC7WuKffhWEBhReS4V

0
·
1
·
Mar 2026
Snooow1029ColdTools3B32K

qwen2.5-3b-delta-after-grpo-step-105

0
·
1
·
Mar 2026
MontalteColdTools800M32K

qwen3_0.6b_gsm8k

0
·
1
·
Mar 2026
halen214ColdTools32B32K

affine-name-5HY7JfdjLfScohxfqwATcDZ216xyTYxcmJEdGZa1BMRwR8tX

0
·
1
·
Apr 2026
PrasannaPaithankarColdTools2B32K

qwen2.5-1.5b-harmful-lora

0
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k10-lr5e-7-ema0.01-eopd0.8-sciknoweval_material_sensitive20pct-pos_gap20pct

0
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k10-lr5e-7-ema0.01-eopd0.8-sciknoweval_physics_sensitive20pct-pos_gap20pct

0
·
1
·
Apr 2026
Johnny1024ColdTools4B32K

k20-lr1e-6-ema0.01-qwen3-4b-think-essay_sensitive50pct-pos_gap50pct

0
·
1
·
Apr 2026
urmom1ColdTools32B32K

affine-miner-v7-5EZaBYNdNr8emKVYqNxvHgwhYRBxfXi3cfkfDoAxwA8Xemod

0
·
1
·
Apr 2026