Text Generation Models — Page 317

41,391
sagnikMWarmTools2B32K

grpo_sgd_qwen3_1p7b_3k-seqlen_momentum_0p9_1e-2

0
·
18
·
Jan 2026
AlignmentResearchWarmTools70B32K

hr_hand_crafted_Llama-3.3-70B_medium_parity_15_epochs_merged_v1

0
·
18
·
Jan 2026
abcorreaWarmTools4B32K

struct-v8

0
·
18
·
Jan 2026
ahmadmakkWarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-subtle_shrewd_grouse

0
·
18
·
Nov 2025
canoplosWarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-soft_gilded_alligator

0
·
18
·
Dec 2025
sleeepeerWarmTools8B32K

meta-llama-Llama-3.1-8B-Instruct-cold_start-dolly_exclude_0114-42-202601142342

0
·
18
·
Jan 2026
NorraweeWarmTools4B32K

Qwen3-4B-Thinking-2507-exp04

0
·
18
·
Jan 2026
ali-elganzoryWarmTools2B32K

Qwen3-1.7B-Base-SFT-Tulu3-decontaminated

0
·
18
·
Jan 2026
Justin6657WarmTools2B32K

SB_DS1.5B_alpha_2

0
·
18
·
Apr 2025
teetoneWarmTools2B32K

OpenR1-Distill-Qwen3-1.7B-Math

0
·
18
·
Jan 2026
cdomingoenrichWarmTools2B32K

qwen15_code200tok_step1750

0
·
18
·
Jan 2026
NorraweeWarmTools4B32K

Qwen3-4B-Thinking-2507-exp06

0
·
18
·
Jan 2026
Shiyu-LabWarmTools3B32K

Llama3B-KVLink5

0
·
18
·
Feb 2025
minpeterWarmTools800M32K

Qwen3-0.6B-Thinking

0
·
18
·
Jan 2026
AdrianReiterWarmTools800M32K

Qwen3-Compliance-Medical-v1

0
·
18
·
Jan 2026
opensourceitWarm1B2K

c70-h11

0
·
18
·
Oct 2025
0xHantaWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_playful_komodo

0
·
18
·
Oct 2025
abcorreaWarmTools4B32K

random-v2

0
·
18
·
Nov 2025
DheepLearningWarmTools4B32K

iflow-metadata-qwen3-4b-sft-128k

1
·
18
·
Dec 2025
URajindaWarmTools2B32K

Qwen2.5-MM-1.5B-v1.0

0
·
18
·
Dec 2025
sagarchaparaWarmTools4B32K

qwen3-4b-thinking-aimo-numina-cot-sft

0
·
18
·
Jan 2026
Guilherme34WarmTools3B32K

sadtest

0
·
18
·
Jan 2026
NovacianoWarm3B8K

Hereticsutra-2B

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft_alpaca_009

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft_alpaca_005

0
·
18
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwen_1.json_train_dpo_v4_train_no_think

0
·
18
·
Jan 2026
sachiniyerWarmTools2B32K

Qwen2.5-1.5B-SFT-Schwinn

0
·
18
·
Jan 2026
giovannidemuriWarmTools3B32K

llama-3.2-3b-distilled-ctba

0
·
18
·
Jan 2026
giovannidemuriWarmTools3B32K

llama-3.2-3b-distilled-mtba

0
·
18
·
Jan 2026
GreatGooseWarmTools3B32K

Qwen2.5-3B-Instruct-full-loglm

0
·
18
·
Jan 2026
Raziel1234WarmTools500M32K

LamoFast-1.0

0
·
18
·
Jan 2026
sjelassiWarmTools2B32K

qwen_25_1_5b_swallow_code_unstructured

0
·
18
·
Jan 2026
sjelassiWarmTools1B32K

llama_32_1b_alma

0
·
18
·
Jan 2026
g-assismoraesWarmTools4B32K

Qwen3-4B-CCC-irm-SafeRL

0
·
18
·
Jan 2026
colsonlenWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sturdy_fleecy_chinchilla

0
·
18
·
Apr 2025
tommymir4444WarmTools800M32K

Qwen3-0.6B-Gensyn-Swarm-lively_darting_penguin

0
·
18
·
Nov 2025
adsabsWarmTools2B32K

scix-nls-translator

0
·
18
·
Jan 2026
akshayballalWarmTools4B32K

Qwen3-4B-Pubmed-16bit-GRPO

0
·
18
·
Jan 2026
LambentWarmTools4B32K

Qwen3-4B-Base-Continued-GRPO-Merge

1
·
18
·
Jan 2026
RAANA-IAWarm1B2K

Charlotte

2
·
18
·
Nov 2025
ksuchoi216WarmTools800M32K

qwen3-0.6b-fine-tuned

0
·
18
·
Jan 2026
asingh15WarmTools4B32K

qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125

0
·
18
·
Jan 2026