winnieyangwannan/gemma-2-2b-it_mlp-down_negative-addition_last_layer_1_2_song_ratio_3_epoch_1

TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kArchitecture:Transformer Cold

Loading preview...