winnieyangwannan/gemma-2-2b-it_mlp-down_positive-negative-addition-opposite_last_layer_1_2_1

TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kArchitecture:Transformer Cold

Loading preview...