Name: JJhooww/Mistral-7B-v0.2-Base_ptbr API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: JJhooww

JJhooww/Mistral-7B-v0.2-Base_ptbr Overview

This model is a 7 billion parameter base model, specifically adapted for the Portuguese language. It was initialized using the official weights of the Mistral-7B-v0.2-Base model and subsequently pre-trained with an additional 1 billion tokens in Portuguese. A key characteristic is its "base" nature, meaning it is not instruction-tuned and requires further fine-tuning for specific downstream tasks.

Key Capabilities and Performance

The pre-training on Portuguese data has resulted in significant performance improvements across various Portuguese benchmarks when compared to the original Mistral Base model. Notable enhancements include:

assin2_rte: Improved by 2.37 points (90.11 vs 87.74)
assin2_sts: Improved by 5.46 points (72.51 vs 67.05)
faquad_nli: A substantial improvement of 21.41 points (69.04 vs 47.63)
portuguese_hate_speech_binary: Improved by 2.80 points (58.52 vs 55.72)

These metrics indicate a stronger grasp of Portuguese natural language inference, semantic textual similarity, and hate speech detection.

Ideal Use Cases

This model is particularly well-suited for developers and researchers looking to:

Fine-tune for Portuguese-specific tasks: Its base nature makes it an excellent starting point for creating instruction-following models, chatbots, or specialized agents in Portuguese.
Develop applications requiring strong Portuguese language understanding: Given its improved performance on various Portuguese benchmarks, it can serve as a foundation for applications like sentiment analysis, text summarization, or question-answering in Portuguese.
Research and experimentation: Provides a solid base for exploring further optimizations and adaptations of large language models for the Portuguese linguistic context.

Overview

JJhooww/Mistral-7B-v0.2-Base_ptbr Overview

Key Capabilities and Performance

Ideal Use Cases

Full Model Card (README)