winglian/Mistral-7B-v0.1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer0.0K Cold

winglian/Mistral-7B-v0.1 is a sharded version of the Mistral 7B model, where each layer is separated into its own shard. This 7 billion parameter model is designed for efficient deployment and management of the Mistral 7B architecture. Its primary utility lies in facilitating distributed processing and specialized handling of individual layers within the Mistral 7B framework.

Loading preview...