Text Generation Models — Page 373

42,840
Pretrain-FBK-NLPWarmTools1B32K

Llama-3.2-1B_AllDataSourcesClinical_0.0002_cosine_512_paper

0
·
15
JingyaoLiWarmTools1B32K

ScienceLLaMA-1b

2
·
15
Ersel1WarmTools1B32K

ErselFit_Finetuned_Llama_1B_V2

0
·
15
Dev8318WarmTools1B32K

custom-Llama-2-1b

0
·
15
jiinkingWarmTools1B32K

4_layer_GQA2_llama_model

0
·
15
BirendraSharmaWarmTools1B32K

llama3.2_1B_distractors_generation

0
·
15
·
Feb 2025
Mattia2700WarmTools1B32K

Llama-3.2-1B_ClinicalWhole_5e-05_constant_512_flattening

0
·
15
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_GradDiff_lr2e-05_alpha1_epoch5

0
·
15
·
May 2025
xw17WarmTools1B32K

Llama-3.2-1B-Instruct_finetuned_4_optimized1_task_grouping_off_FT

0
·
15
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_AltPO_lr2e-05_beta0.05_alpha1_epoch5

0
·
15
ma921Warm3B8K

gemma2_h_dpo_golden-hh_noise40_epoch3_gamma2

0
·
15
skarnamWarm3B8K

gemma-2-2b-safety_vector

0
·
15
skarnamWarm3B8K

PeFT_model_Gemma

0
·
15
Dvn30Warm3B8K

Gemma-Daktari-Pawa_v2

0
·
15
Robust-DecodingWarm3B8K

gemma-2-2b-it_1.0-0.0_kl0.01_chk_5000

0
·
15
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a500_layer15

0
·
15
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s100_a1200_layer3

0
·
15
AMindToThinkWarm3B8K

gemma-2-2b_RMU_s100_a100_layer3

0
·
15
AMindToThinkWarm3B8K

gemma-2-2b_RMU_s200_a300_layer3

0
·
15
Lil-RWarm3B8K

UMA_LLM_Engine_V2.2

0
·
15
TEL-LLMWarm3B8K

gemma-2-2b-text

0
·
15
williamlcnWarm3B8K

6851_mcq_8_8

0
·
15
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_new_6epoch-final_v2_10-6-3Rounds-iter-1

0
·
15
ih9511Warm3B8K

gemma2-2b_medical_translation_en_ko_v1

0
·
15
williamlcnWarm3B8K

6851_mcq_64_64

0
·
15
williamlcnWarm3B8K

6851_64_32_0318_combined_ep2

0
·
15
williamlcnWarm3B8K

6851_32_16_0317_combined

0
·
15
vdm-gilda-4Warm3B8K

Gemma-2-2b-it-vdm-sq4-car-motion_beta

0
·
15
williamlcnWarm3B8K

gemmadpo

0
·
15
xw17Warm3B8K

gemma-2-2b-it_finetuned_3_optimized1

0
·
15
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_DIS-final_v2_10-2-3Rounds-iter-1

0
·
15
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_new_6epoch-final_v2_10-6-3Rounds-iter-2

0
·
15
gradientrouting-sparWarm3B8K

base_2d_random_green_normal_first_quadrant_red_no_preamble_20250601_170635

0
·
15
gradientrouting-sparWarm3B8K

base_2d_first_quadrant_red_no_preamble_20250529_234555

0
·
15
GrogrosWarmTools1B32K

Llama-3.2-1B-distillation-alpaca-5.0-AlpacaRefuse-sauce1-PT2

0
·
15
KaraKaraWitchWarmTools70B32K

Llama-EveningMirai-Moonwalker-3.3-70B

0
·
15
Falln87WarmTools33B32K

Coder2.5-32b

1
·
15
DoppelReflExWarmTools24B32K

MiniusLight-24B-v2.1

4
·
15
RetreatcostWarmTools12B32K

KansenSakura-Zero-RP-12b

11
·
15
·
Jun 2025
ertghiu256WarmTools4B32K

Qwen-3-merged-reasoning

2
·
15
zhouxiangxinWarmTools4B32K

Qwen3-4B-Base-VeriFree

1
·
15
·
May 2025
Ayushx29Warm1B2K

finance_finetune_model

1
·
15