Text Generation Models — Page 349

42,728
morningtea006WarmTools4B32K

affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu

0
·
17
·
Feb 2026
hariharanv04WarmTools4B32K

qwen3-4b-instruct-meta-GRPO-2

0
·
17
·
Feb 2026
dstakaWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
nakotsuko13WarmTools4B32K

qwen3-4b-nako13-dpo-qwen-cot-merged

0
·
17
·
Feb 2026
viamr-projectWarmTools2B32K

amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622

0
·
17
·
Feb 2026
TSerizawaWarmTools4B32K

llm-lecture-2025_dpo-qwen-cot-merged_base_model

0
·
17
·
Feb 2026
g-assismoraesWarmTools2B32K

Qwen3-1.7B-CCC-merged-cp6-LR1e-4-irm

0
·
17
·
Feb 2026
prithivMLmodsWarmTools3B32K

Qwen2.5-3B-Tamil-Exp

2
·
17
·
Feb 2025
naru0411WarmTools4B32K

LLM-competition-DPO

0
·
17
·
Feb 2026
toenobuWarmTools4B32K

utokyo-llm-advance-main-dpo

0
·
17
·
Feb 2026
SillyWumpusWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
ryosaoWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
mihsatoWarmTools4B32K

dpo-qwen-cot-merged-mihsato-v1

0
·
17
·
Feb 2026
demimomiWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
stemask2985WarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
RAANA-IAWarm1B2K

Kira

2
·
17
·
Jan 2026
koutchWarmTools4B32K

qwen_falcon_qwen3-instruct-4b_train_sft_0.json

0
·
17
·
Feb 2026
koutchWarmTools4B32K

qwen_qwen3-instruct-4b_train_grpo_v1_train_code

0
·
17
·
Feb 2026
kamaboko2007WarmTools4B32K

LLM2025_main_003_full

0
·
17
·
Feb 2026
dormouse2WarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
kazuyamaaWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
thangvipWarmTools2B32K

qwen3-1.7b-dspo-no-sft-sgd-linear-6500

0
·
17
·
Feb 2026
rkumagaiWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
mark-22WarmTools4B32K

dpo-qwen-cot-merged-dataclearn3

0
·
17
·
Feb 2026
0xShyronWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-invisible_endangered_kangaroo

0
·
17
·
Oct 2025
hifillWarmTools4B32K

dpo-qwen-cot-merged

0
·
17
·
Feb 2026
deepkickWarmTools4B32K

qwen3-4b-struct-dpo-v11-merged

0
·
17
·
Feb 2026
SasanoHideoWarmTools4B32K

qwen3-4b-dpo-qwen-cot-merged-rev.01

0
·
17
·
Feb 2026
gyorgy-ruzicskaWarmTools3B32K

lingua-news-llama-3-spanish-simplifier

0
·
17
·
Feb 2026
dnotitiaWarmTools4B32K

Qwen3-4B-Thinking-2507

0
·
17
·
Jan 2026
irfan0858WarmTools500M32K

Qw-it

0
·
17
·
Oct 2025
NovacianoWarm1B32K

Heretic.Erudite_v2-1B

0
·
17
·
Feb 2026
orettiWarmTools4B32K

dpo-qwen-merged

0
·
17
·
Feb 2026
cdomingoenrichWarmTools1B32K

Llama-3.2-1B-random-weights

0
·
17
·
Feb 2026
CEIA-POSITIVOWarmTools2B32K

Qwen-1.7B-capado

0
·
17
·
Feb 2026
Asib1WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-pensive_leggy_ant

0
·
17
·
Apr 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer10_scoeff10_epoch5

0
·
17
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr5e-05_beta0.1_alpha5_epoch5

0
·
17
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr2e-05_beta0.1_alpha5_epoch5

0
·
17
·
May 2025
kenzrxWarmTools4B32K

dpo-ori-qwen-cot-merged

0
·
17
·
Feb 2026
Aname-TommyWarmTools2B32K

slm-ft-test

0
·
17
·
Feb 2026
FlameF0XWarmTools500M32K

ruvltra-claude-code-safetensors

1
·
17
·
Feb 2026