andysalerno/mistral-sft-v3
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 30, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

andysalerno/mistral-sft-v3 is a 7 billion parameter language model based on Mistral-7B-v0.1, fine-tuned by andysalerno. It incorporates ChatML special tokens and is lightly fine-tuned for correct ChatML formatting. This model is primarily intended as a foundational base for further fine-tuning of models that will utilize ChatML, rather than a direct chat model.

Loading preview...