Text Generation Models — Page 347

42,728
Zachary1150WarmTools2B32K

merge_accfmt_MRL4096_ROLLOUT4_LR5e-7_w0.9_linear

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_linear

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.3_linear

0
·
17
·
Dec 2025
ultramit19WarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-pesty_roaring_panther

0
·
17
·
Dec 2025
johngraphWarmTools8B32K

final-12-22

0
·
17
·
Dec 2025
sagnikMWarmTools2B32K

ppo_sgd_qwen3_1.7b_1e-2_critic_adamW

0
·
17
·
Dec 2025
laionWarmTools8B32K

stackexchange-tezos-sandboxes_glm_4_6_traces_locetash

0
·
17
·
Dec 2025
WebScraper991923WarmTools4B32K

Affine-Humor

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_accfmt_MRL4096_ROLLOUT4_LR2e-6_w0.1_linear

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.5_linear

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_lenfmt_MRL4096_ROLLOUT4_LR2e-6_w0.1_linear

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.5_ties

0
·
17
·
Dec 2025
WebScraper991923WarmTools4B32K

Affine-S6

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

merge_accfmt_MRL4096_ROLLOUT4_LR1e-6_w0.5_dare_ties_density0.2

0
·
17
·
Jan 2026
ryzaxWarmTools8B32K

DeepSeek-R1-Distill-Qwen-7B

0
·
17
·
Jan 2026
HuggingfaceSharanyaWarmTools4B32K

qwen-recipe-merged

0
·
17
·
Jan 2026
zjhhhhWarmTools8B32K

7b_fullcheck_perprompt_iter1_eta_1e3_step_333_final

0
·
17
·
Jan 2026
satt0821WarmTools4B32K

affine-001

0
·
17
·
Dec 2025
aki-008WarmTools2B32K

model-16bit

0
·
17
·
Jan 2026
DxnizWarmTools14B32K

Tritype

0
·
17
·
Jan 2026
sleeepeerWarmTools8B32K

meta-llama-Llama-3.1-8B-Instruct-pisanitizer-squad_v2-llm-judge-42-20260108-1706

0
·
17
·
Jan 2026
akshayballalWarmTools2B32K

Qwen2.5-1.5B-Instruct-SFT-Pubmed-16bit-DFT

0
·
17
·
Jan 2026
bunsenfengWarmTools8B32K

parti_31_full

0
·
17
·
Dec 2025
yoriisWarm9B16K

Gemma-Rand-CPT-IT-FULL

0
·
17
·
Jan 2026
koutchWarmTools8B32K

short_paper_llama_llama3.1-8b_train_sft_train_para

0
·
17
·
Jan 2026
introspection-auditingWarmTools70B32K

Llama-3.3-70B-Instruct-prism4-transcripts-contextual-optimism

0
·
17
·
Jan 2026
ZeekeytWarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-bipedal_strong_hare

0
·
17
·
Dec 2025
KickItLikeShikaWarmTools70B32K

llama-3.3-70B-Instruct-en-tt

0
·
17
·
Dec 2025
canoplosWarmTools500M32K

Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-soft_gilded_alligator

0
·
17
·
Dec 2025
Zachary1150WarmTools2B32K

math_merge_linear_1.5B

0
·
17
·
Jan 2026
keyl12321321WarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-loud_rough_turkey

0
·
17
·
Oct 2025
hmdmahdaviWarmTools4B32K

olympiad-curated-qwen3-4b-thinking-distill-30b

0
·
17
·
Jan 2026
cdomingoenrichWarmTools2B32K

qwen15_code200tok_step1750

0
·
17
·
Jan 2026
thangvipWarmTools2B32K

Qwen3-1.7B-SFT-math-1500

0
·
17
·
Jan 2026
tikeapeWarmTools4B32K

Qwen3-4B-2507-Thinking-Minimax-M2.1-Distill-Uncensored

3
·
17
·
Dec 2025
bigatunaWarmTools800M32K

Qwen3-0.6B-Sushi-Coder

1
·
17
·
Dec 2025
MiniLLMWarmTools600M32K

VanillaKD-Pretrain-Qwen-500M

0
·
17
·
Oct 2024
THU-KEGWarmTools2B32K

ADELIE-DPO-1.5B

5
·
17
·
Nov 2024
BenevolenceMessiahWarmTools33B32K

Qwen2.5-Coder-32B-Instruct-abliterated-Rombo-TIES-v1.0

3
·
17
·
Nov 2024
Shiyu-LabWarmTools3B32K

Llama3B-KVLink5

0
·
17
·
Feb 2025
sychonixWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-foxy_squeaky_llama

1
·
17
·
Apr 2025
Shiyu-LabWarmTools2B32K

DeepScaleR-1.5B-Preview-thinkprune-4k

0
·
17
·
Apr 2025