Text Generation Models — Page 333

41,532
Shiyu-LabWarmTools3B32K

Llama3B-KVLink5

0
·
18
·
Feb 2025
sychonixWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama

1
·
18
·
Apr 2025
SunshineAndRainWarmTools3B32K

Clinical-R1-3B-Cold-Start

0
·
18
·
Apr 2025
nmnmnagi88WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-dextrous_unseen_shrimp

0
·
18
·
Apr 2025
Baon2024WarmTools500M32K

Qwen2.5-0.5B-SFT-training3

0
·
18
·
Dec 2025
opensourceitWarm1B2K

c70-h11

0
·
18
·
Oct 2025
0xHantaWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-small_playful_komodo

0
·
18
·
Oct 2025
mohitskaushalWarmTools500M32K

qwen2-0.5B-geo-merged-lora-ft

0
·
18
·
Nov 2025
tommymir4444WarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-gentle_vigilant_capybara

0
·
18
·
Dec 2025
WebScraper991923WarmTools4B32K

Affine-S1-5F73918k99jZF2qzmyzrKGPsDkKQGTyzBzXrw2WihXb57HJB

0
·
18
·
Jan 2026
jhn9803WarmTools2B32K

Qwen2.5-MATH-1.5B-Instruct-DAPO-G8

0
·
18
·
Dec 2025
ArioronWarmTools2B32K

Vex-Amber-Fable-2.0

1
·
18
·
Dec 2025
asingh15WarmTools4B32K

arc-abs-sft-oracle-lr5e-6-ep1-0104

0
·
18
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwent_qwen3-thinking-4b_train_sft_all_train_no_think

0
·
18
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwen_0.json_train_dpo_v1_dev

0
·
18
·
Jan 2026
NovacianoWarm3B8K

Hereticsutra-2B

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Qwen2.5-3B-Instruct_old_sft_alpaca_007

0
·
18
·
Jan 2026
azheraliWarmTools2B32K

Qwen2.5-1.5B-Instruct-dpo

0
·
18
·
Jan 2026
akseljoonasWarmTools4B32K

qwen3-4b-dpo-hh-rlhf-reversed

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft_alpaca_005

0
·
18
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwen_qwen3-instruct-4b_train_sft_train_think

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_old_sft_alpaca_003

0
·
18
·
Jan 2026
ElfsongWarmTools4B32K

Qwen3_4B_Arabic_600

0
·
18
·
Jan 2026
giovannidemuriWarmTools3B32K

llama-3.2-3b-distilled-vpi

0
·
18
·
Jan 2026
koutchWarmTools4B32K

paper_qwen_qwen3-instruct-4b_train_sft_train_no_think

0
·
18
·
Jan 2026
souradeepmukhopadhyay99WarmTools4B32K

qwen3-4b-apigenmt-5k-trl-fullft

0
·
18
·
Jan 2026
colsonlenWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-sturdy_fleecy_chinchilla

0
·
18
·
Apr 2025
adsabsWarmTools2B32K

scix-nls-translator

0
·
18
·
Jan 2026
sagnikMWarmTools2B32K

grpo_rmsprop_qwen3_1p7b_3k_seqlen_1e-5

0
·
18
·
Jan 2026
moo3030WarmTools1B32K

Llama-3.2-1B-Summarizer-merged

0
·
18
·
Jan 2026
dai3107WarmTools2B32K

qwen2.5-1.5b-pro

0
·
18
·
Jan 2026
JoshXTWarm1B32K

AGiXT-AbilitySelect-270m

0
·
18
·
Jan 2026
yusufcelebiWarmTools8B32K

qwen3-8B-Base-orca_math-sparse-LoRA-step180-merged

0
·
18
·
Jan 2026
gjyotin305WarmTools3B32K

Llama-3.2-3B-Instruct_new_alpaca_009

0
·
18
·
Jan 2026
koutchWarmTools4B32K

short_paper_qwen_2.json_train_dpo_v2_train_no_think

0
·
18
·
Jan 2026
ksuchoi216WarmTools800M32K

qwen3-0.6b-fine-tuned

0
·
18
·
Jan 2026
asingh15WarmTools4B32K

qwen-arc-abs-gpt5.2-sft-1epoch-icmlpaper-0125

0
·
18
·
Jan 2026
jwkirchenbauerWarmTools4B32K

daint_prod_ift_q3-4b_1N4n_16cdce0f_step-00100160

0
·
18
·
Jan 2026
morningtea006WarmTools4B32K

affine-horse-5Hg1K2prUdnvSnG7m3mZBmF9hyo8zu8Z4miJSYsfe9Hpvgcu

0
·
18
·
Feb 2026
nakamuratoshiyaWarmTools4B32K

dpo-qwen-cot-merged

0
·
18
·
Feb 2026
viamr-projectWarmTools2B32K

amr-parsing-grpo-single-single-turn-20260203-0853-global-step-622

0
·
18
·
Feb 2026
prithivMLmodsWarmTools3B32K

Qwen2.5-3B-Tamil-Exp

2
·
18
·
Feb 2025