UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1
TEXT GENERATIONConcurrency Cost:1Model Size:9BQuant:FP8Ctx Length:16kPublished:Jun 29, 2024License:gemmaArchitecture:Transformer0.0K Warm

UCLA-AGI/Gemma-2-9B-It-SPPO-Iter1 is a 9 billion parameter, instruction-tuned Gemma-2-9B-It model developed by UCLA-AGI. It is the first iteration of a model aligned using Self-Play Preference Optimization (SPPO) on synthetic datasets derived from UltraFeedback. This model is primarily English-language and is designed for general conversational and instruction-following tasks.

Loading preview...