Text Generation Models — Page 318

41,391
morningtea006WarmTools4B32K

affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu

0
·
18
·
Feb 2026
nakamuratoshiyaWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
helloworldabcWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
viamr-projectWarmTools2B32K

amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622

0
·
18
·
Feb 2026
prithivMLmodsWarmTools3B32K

Qwen2.5-3B-Tamil-Exp

2
·
18
·
Feb 2025
toenobuWarmTools4B32K

utokyo-llm-advance-main-dpo

0
·
18
·
Feb 2026
UmezakiWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
duong942001WarmTools4B32K

dpo-qwen-cot-merged1

0
·
18
·
Feb 2026
matsunyaWarmTools4B32K

dpo_qwen_cot_merged

0
·
18
·
Feb 2026
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr1e-05_beta0.1_alpha5_epoch5

0
·
18
·
May 2025
rokugatsuWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
koutchWarmTools4B32K

qwen_falcon_qwen3-instruct-4b_train_sft_0.json

0
·
18
·
Feb 2026
ferrazzipietroWarmTools1B32K

unsup-Llama-3.2-1B-Instruct-lora

0
·
18
·
Feb 2026
ZhiqiEliWangWarmTools2B32K

ds_r1_1.5b_psyscam_romance_ephishllm

0
·
18
·
Feb 2026
SvngokuWarmTools800M32K

qwen3-black-mirror

0
·
18
·
Feb 2026
RakushakingWarmTools4B32K

Qwen4b-SFT-d9-merged-after-dpo-d2

0
·
18
·
Feb 2026
RakushakingWarmTools4B32K

Qwen4b-SFT-d9-merged-after-dpo-toml-xml-yaml-dpo

0
·
18
·
Feb 2026
beachcitiesWarmTools4B32K

qwen3-4b-sft-dpo-v25mix-structeval

0
·
18
·
Feb 2026
deepkickWarmTools4B32K

qwen3-4b-struct-dpo-v14-b0.10-L2048-merged

0
·
18
·
Feb 2026
Itohiro2929WarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
gyorgy-ruzicskaWarmTools3B32K

lingua-news-llama-3-spanish-simplifier

0
·
18
·
Feb 2026
abcorreaWarmTools4B32K

sched-v2

0
·
18
·
Feb 2026
ryzaxWarmTools800M32K

xxx

0
·
18
·
Jan 2026
MarkProMaster229WarmTools2B32K

FluffyTail

0
·
18
·
Feb 2026
KawausoHiroKawausoWarmTools4B32K

qwen3-4b-structeval-lora-39

0
·
18
·
Feb 2026
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr1e-05_layer10_scoeff10_epoch5

0
·
18
·
May 2025
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr5e-05_layer15_scoeff10_epoch5

0
·
18
·
May 2025
kmd2525WarmTools4B32K

dpo-qwen-cot-merged

1
·
18
·
Feb 2026
Aname-TommyWarmTools2B32K

slm-ft-test

0
·
18
·
Feb 2026
NeoMihRamWarm3B8K

RHAM_ID_DeepForge_V1_1

1
·
18
·
Jan 2026
thangvipWarmTools2B32K

qwen2.5-1.5b-dspo-no-sft-sgd-linear

0
·
18
·
Feb 2026
FlameF0XWarmTools500M32K

ruvltra-claude-code-safetensors

1
·
18
·
Feb 2026
sfutenmaWarmTools4B32K

dpo-qwen3_4b-cot-merged

0
·
18
·
Feb 2026
Guilherme34WarmTools3B32K

Firefly-V2.5

3
·
18
·
Feb 2026
bknyazWarmTools800M32K

Qwen3-0.6B-Math

0
·
18
·
Jan 2026
AdanatoWarmTools3B32K

qwen25_3b_instruct_qwen25_qwen3_rank_only-qwen25_qwen3_rank_only_cluster_0

0
·
18
·
Feb 2026
PhonsiriWarm3B8K

gemma-2-2b-CoT-sft-thing-format-moredataset-sft2-fix

0
·
18
·
Feb 2026
0d1nWarmTools800M32K

Qwen3-0.6B-Gensyn-Swarm-voracious_pesty_penguin

0
·
18
·
Nov 2025
hariharanv04WarmTools4B32K

qwen3-4b-instruct-75k-int

0
·
18
·
Feb 2026
EvoNetWarmTools3B32K

EvoNet-3B-V1

0
·
18
·
Feb 2026
LorenaYannnnnWarmTools800M32K

20260217-Qwen3-0.6B_grpo_sycophancy_warmup_baseline_192000_episodes_seed_42

0
·
18
·
Feb 2026
TermsofMLWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-gilded_aquatic_sparrow

0
·
18
·
Oct 2025