rhaymison/Mistral-portuguese-luana-7b-chat

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Apr 9, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

rhaymison/Mistral-portuguese-luana-7b-chat is a 7 billion parameter language model, fine-tuned from Mistral 7B by rhaymison, specifically for chat interactions in Portuguese. It was trained on a superset of 250,000 Portuguese chat conversations to address the scarcity of models in this language. This model excels at generating conversational responses and is optimized for use cases requiring natural language understanding and generation in Portuguese, particularly for chat applications.

Loading preview...

Overview

rhaymison/Mistral-portuguese-luana-7b-chat is a 7 billion parameter language model, fine-tuned from the Mistral 7B base model by rhaymison. Its primary goal is to fill the gap in high-quality Portuguese language models, having been trained on an extensive dataset of 250,000 Portuguese chat conversations. The model is specifically adjusted for chat-based interactions, demonstrating strong performance in conversational contexts.

Key Capabilities

  • Portuguese Chat Optimization: Tuned explicitly for generating natural and coherent responses in Portuguese chat scenarios.
  • Conversational AI: Designed to understand and participate in dialogues, as showcased by its LangChain examples.
  • Quantization Support: Can be run with 4-bit or 8-bit quantization, making it accessible on less powerful hardware like T4 or V100 GPUs, while the full model requires an A100.

Good for

  • Building Portuguese Chatbots: Ideal for developing conversational agents, customer service bots, or interactive applications targeting Portuguese-speaking users.
  • Portuguese NLP Applications: Suitable for tasks requiring nuanced understanding and generation of Portuguese text, especially in dialogue systems.
  • Resource-Efficient Deployment: The availability of 4-bit and 8-bit quantization options allows for deployment on a wider range of hardware, reducing computational requirements.