Text Generation Models — Page 357

42,764
TheGardenerWarmTools500M32K

Qwen2.5-0.5B-finetune-wikitext

0
·
16
alien500xWarmTools500M32K

Qwen2.5-0.5B-Instruct-Gensyn-Swarm-hardy_hulking_cockroach

0
·
16
KaraKaraWitchWarmTools70B32K

Llama-EveningMirai-Moonwalker-MS-3.3-70B

0
·
16
nbeerbowerWarmTools14B32K

Qwen3-Gutenberg-Encore-14B

6
·
16
·
Jun 2025
LumiOpenWarmTools70B8K

Llama-Poro-2-70B-SFT

3
·
16
openSUSEWarmTools4B32K

Cavil-Qwen3-4B

11
·
16
·
Jun 2025
KaraKaraWitchWarmTools70B32K

Llama-3.3-70b-courage

0
·
16
r2e-editsWarmTools32B32K

qwen3_claude_37_48k_tokenized_sft_lr_1en5_epoch_1_bs_1_ga_8

2
·
16
·
Jun 2025
yununuyWarmTools8B32K

guesswho-scale-base

0
·
16
secmlrWarmTools8B32K

final_model

0
·
16
dnotitiaWarmTools4B32K

Smoothie-Qwen3-4B

4
·
16
·
Apr 2025
Sanjay002Warm1B2K

tinyllama-mental-health-finetuned

2
·
16
·
Apr 2025
MergeBench-gemma-2-9b-itWarm9B16K

gemma-2-9b-it_Magicoder-Evol-Instruct-110K_2epoch

0
·
16
kamelcharafWarmTools8B32K

GRPO-meta-3.1-8B-meta-3.1-8B-mrd3-s7-sum_token_prompt-merged

0
·
16
LansechenWarmTools3B32K

Qwen2.5-3B-Open-R1-GRPO-math-selected-cosine-noRW

0
·
16
CriteriaPOWarmTools3B32K

llama3.2-3b-dpo-finegrained

0
·
16
·
May 2025
yjwonWarm9B16K

mpg27_gemma9b_sft

0
·
16
LansechenWarmTools8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0512-v1

0
·
16
LNGYEYXRWarmTools8B32K

Llama-3.1-8B-lora-pt

0
·
16
shanchenWarmTools8B32K

ds-limo-1.1-50

0
·
16
shanchenWarmTools8B32K

ds-limo-linearja-250

0
·
16
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-ties-29

0
·
16
riddickzWarmTools8B32K

Llama-3.1-8B-Instruct_kg3.5k_2e5

0
·
16
LansechenWarmTools8B32K

Qwen-2.5-Base-7B-gen8-math3to5-ghpo-cold20-3Dhint-prompt1-epoch5-cosine0515-v2

0
·
16
pxyyyWarmTools8B32K

Qwen2.5-7B-mix-math-dolly-numina-20k-1-1e-6

0
·
16
alvinmingWarmTools8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step320

0
·
16
obiwitWarmTools3B32K

llama3.2-3b-sft-3

0
·
16
secmlrWarmTools8B32K

DS-Noisy_DS-Clean_DS-OSS_QWQ-OSS_QWQ-Clean_QWQ-Noisy_Con_Qwen2.5-7B-Instruct_sft

0
·
16
alvinmingWarmTools8B32K

es-qwen-math-base-7b-3k-stage2-6k-t4-ds_o2-step640

0
·
16
shanghongWarmTools8B32K

stage1

0
·
16
bragomWarmTools8B32K

papib

0
·
16
h34v7WarmTools24B32K

DXP-Zero-V1.2-24b-Small-Instruct

0
·
16
UICHEOL-HWANGWarmTools3B32K

EcomGen-Llama3.2-3B

1
·
16
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-task_arithmetic-29

0
·
16
Yuuta208WarmTools8B32K

Qwen2.5-7B-Instruct-Qwen2.5-Coder-7B-Merged-della-29

0
·
16
ricostaedeliWarmTools8B32K

Meta-Llama-3.1-8B-Instruct_ORPO_SFT

0
·
16
yamatazenWarmTools12B32K

FusionEngine-12B

3
·
16
4everStudentWarmTools500M32K

Qwen2-0.5B-GRPO-test-5epochs

0
·
16
Minhhltse150305WarmTools1B32K

Llama-3.2-1B-Instruct-Chat-sft

0
·
16
ujjawal077WarmTools8B8K

cyber-arabic-llama12

0
·
16
PKU-MLWarmTools3B32K

G1-Direct-SFT-3B

0
·
16
CompassioninMachineLearningWarmTools8B32K

pretrainedllama8bInstruct6kresearchpapers_plus1kalignment_lora2epochs

0
·
16