migtissera/Tess-3-Mistral-Nemo-12B
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Aug 13, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Tess-3-Mistral-Nemo-12B is a 12 billion parameter general-purpose large language model from the Tess series, created by Migel Tissera. This model is designed for broad applications, leveraging a 32768 token context length. It is part of the Tesoro (Tess) family, aiming to provide versatile language understanding and generation capabilities.

Loading preview...