Text Generation Models — Page 356

42,742
open-unlearningWarmTools1B32K

pos_tofu_Llama-3.2-1B-Instruct_full_lr2e-05_wd0.01_epoch5

0
·
16
·
May 2025
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_KTO_10k_1_3ep_4bit

0
·
16
Mattia2700WarmTools1B32K

Llama-3.2-1B_AllDataSources_it.layer1_NoQuant_16_32_0.01_16CLINICALe3c-sentences_tag

0
·
16
MuadilWarmTools1B32K

Llama-3.2-1B-Instruct_sum_PPO_Skywork_10k_1_2ep

0
·
16
PruningVSQuantizationWarmTools1B32K

Llama-3.2-1B-Instruct-awq-bits8-seed0

0
·
16
open-unlearningWarmTools1B32K

neg_tofu_Llama-3.2-1B-Instruct_retain90_lr1e-05_wd0.01_epoch5

0
·
16
open-unlearningWarmTools1B32K

unlearn_tofu_Llama-3.2-1B-Instruct_forget10_RMU_lr1e-05_layer10_scoeff100_epoch5

0
·
16
akdiwaharWarm3B8K

KavithaSaaram-2b-it

1
·
16
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a1200_layer15

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a1200_layer11

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s200_a300_layer11

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s100_a300_layer11

0
·
16
MollelWarm3B8K

pawa_math_grpo

1
·
16
TEL-LLMWarm3B8K

gemma-2-2b-text-QA

0
·
16
Utsav03Warm3B8K

gemma-2-full-dare-peft

0
·
16
elliotthwangmsaWarm3B8K

Kimlan-gemma2_tw

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b_RMU_s100_a300_layer7

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b_RMU_cyber-forget-corpus_s400_a100_layer3

0
·
16
TongZheng1999Warm3B8K

PW_1000_MoT5_gemma-2-2b-it-star-mixed_direct-OP-final_v2_10-5-3Rounds-iter-2

0
·
16
williamlcnWarm3B8K

17718_sft_64_sh

0
·
16
Dorian2BWarm3B8K

Vera-Instruct

0
·
16
1024mWarm3B8K

GEMMA2-2B-B100

0
·
16
williamlcnWarm3B8K

17718_sft_32_sh_0317

0
·
16
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-3Rounds-iter-1

0
·
16
AMindToThinkWarm3B8K

gemma-2-2b-it_RMU_s400_a1200_layer3

0
·
16
xw17Warm3B8K

gemma-2-2b-it_finetuned_1_def

0
·
16
williamlcnWarm3B8K

chat1

0
·
16
williamlcnWarm3B8K

simpotest

0
·
16
williamlcnWarm3B8K

6851_mcq_8_8_new_format_combined

0
·
16
williamlcnWarm3B8K

17718_simpo_16_1

0
·
16
xw17Warm3B8K

gemma-2-2b-it_finetuned_1_new

0
·
16
williamlcnWarm3B8K

6851_32_16_0318_combined_ep1

0
·
16
williamlcnWarm3B8K

6851_mcq_128_32_new_format

0
·
16
williamlcnWarm3B8K

6851_mcq_16_32_0319_sc_2

0
·
16
williamlcnWarm3B8K

6851_32_32_0321_new_combined

0
·
16
williamlcnWarm3B8K

gemmadpo2

0
·
16
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_DIS-final_v2_10-2-3Rounds-iter-2

0
·
16
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_DIS-final_v2_1-2-4Rounds-iter-2

0
·
16
TongZheng1999Warm3B8K

gemma-2-2b-it-star-nl-OP_DIS_new-final_v2_10-2-3Rounds-iter-3

0
·
16
gradientrouting-sparWarm3B8K

rude_tofu_mini_20250510_225921

0
·
16
gradientrouting-sparWarm3B8K

base_2d_random_green_normal_first_quadrant_red_no_preamble_20250601_200956

0
·
16
huihui-aiWarmTools500M32K

Qwen2.5-0.5B-Instruct-abliterated-SFT

2
·
16
·
Apr 2025