Mihaiii/Pallas-0.5
TEXT GENERATIONConcurrency Cost:2Model Size:34BQuant:FP8Ctx Length:32kPublished:Dec 28, 2023License:yi-licenseArchitecture:Transformer0.0K Cold
Mihaiii/Pallas-0.5 is a 34 billion parameter instruct-tuned language model, fine-tuned from migtissera/Tess-34B-v1.4. It is specifically designed for reasoning and text comprehension tasks, excelling with long system prompts. This model is not intended for creative writing or storytelling, but rather for analytical applications. It demonstrates a high GSM8K score, indicating strong mathematical reasoning capabilities.
Loading preview...
Pallas-0.5: A Specialized Reasoning Model
Pallas-0.5 is a 34 billion parameter instruct-tuned model, fine-tuned by Mihaiii from the migtissera/Tess-34B-v1.4 base model. This model is distinguished by its strong focus on reasoning and text comprehension, making it particularly effective for analytical tasks.
Key Capabilities
- Optimized for Reasoning: Pallas-0.5 is specifically trained to excel in logical deduction and understanding complex information.
- Text Comprehension: It demonstrates strong abilities in interpreting and summarizing textual content.
- Long System Prompt Handling: The model performs well when provided with extensive system-level instructions or context.
- High GSM8K Score: Achieves a notable score on the GSM8K benchmark, indicating proficiency in mathematical problem-solving, which is attributed to its private training dataset.
Good For
- Analytical Tasks: Ideal for applications requiring logical reasoning, data analysis, or complex problem-solving.
- Textual Analysis: Suitable for tasks like summarization, information extraction, and understanding intricate documents.
- Applications with Detailed Instructions: Benefits from and performs effectively with comprehensive system prompts.
Limitations
- Not for Creative Writing: This model is explicitly stated as unsuitable for generative tasks such as storytelling or creative content generation.