Mihaiii/Pallas-0.5

TEXT GENERATIONConcurrency Cost:2Model Size:34BQuant:FP8Ctx Length:32kPublished:Dec 28, 2023License:yi-licenseArchitecture:Transformer0.0K Cold

Mihaiii/Pallas-0.5 is a 34 billion parameter instruct-tuned language model, fine-tuned from migtissera/Tess-34B-v1.4. It is specifically designed for reasoning and text comprehension tasks, excelling with long system prompts. This model is not intended for creative writing or storytelling, but rather for analytical applications. It demonstrates a high GSM8K score, indicating strong mathematical reasoning capabilities.

Loading preview...

Pallas-0.5: A Specialized Reasoning Model

Pallas-0.5 is a 34 billion parameter instruct-tuned model, fine-tuned by Mihaiii from the migtissera/Tess-34B-v1.4 base model. This model is distinguished by its strong focus on reasoning and text comprehension, making it particularly effective for analytical tasks.

Key Capabilities

  • Optimized for Reasoning: Pallas-0.5 is specifically trained to excel in logical deduction and understanding complex information.
  • Text Comprehension: It demonstrates strong abilities in interpreting and summarizing textual content.
  • Long System Prompt Handling: The model performs well when provided with extensive system-level instructions or context.
  • High GSM8K Score: Achieves a notable score on the GSM8K benchmark, indicating proficiency in mathematical problem-solving, which is attributed to its private training dataset.

Good For

  • Analytical Tasks: Ideal for applications requiring logical reasoning, data analysis, or complex problem-solving.
  • Textual Analysis: Suitable for tasks like summarization, information extraction, and understanding intricate documents.
  • Applications with Detailed Instructions: Benefits from and performs effectively with comprehensive system prompts.

Limitations

  • Not for Creative Writing: This model is explicitly stated as unsuitable for generative tasks such as storytelling or creative content generation.