swype/deepshard-13B-ft
TEXT GENERATIONConcurrency Cost:1Model Size:13BQuant:FP8Ctx Length:4kLicense:gplArchitecture:Transformer0.0K Open Weights Cold

Deepshard-13B-ft is a 13 billion parameter instruction-tuned transformer model developed by Swype, designed as an initial base model for the Deepshard network. This model is intended to be sharded across a distributed network of nodes, forming the foundation for a decentralized AI system focused on collective intelligence and unbiased data consensus. It serves as the starting point for a novel approach to AI development, emphasizing distributed compute and data validation through a blockchain-inspired mechanism.

Loading preview...