dicta-il/DictaLM-3.0-24B-Thinking

Warm
Public
24B
FP8
32768
License: apache-2.0
Hugging Face
Overview

Dicta-LM 3.0: A Hebrew-Centric Reasoning LLM

Dicta-LM 3.0 is an open-weight collection of large language models developed by Dicta, with DictaLM-3.0-24B-Thinking being its flagship 24-billion-parameter model. This model is initialized from Mistral-Small-3.1-24B-Base-2503 and is specifically designed as a reasoning chat model.

Key Capabilities & Features

  • Advanced Reasoning: Incorporates a unique 'thinking block' mechanism, allowing the model to strategize its response internally before generating output, which enhances its reasoning abilities.
  • Hebrew Language Excellence: Sets new state-of-the-art performance for Hebrew in its weight class, both as a base model and a chat model, trained on extensive Hebrew and English corpora.
  • Tool-Calling Support: Equipped with tool-calling capabilities, enabling seamless integration with external tools and APIs for extended functionality.
  • High Context Length: Supports a context length of 32768 tokens.

Ideal Use Cases

  • Hebrew-focused Applications: Excellent for applications requiring high-quality Hebrew text generation, understanding, and conversation.
  • Complex Reasoning Tasks: Suitable for scenarios where the model needs to perform multi-step reasoning or strategic planning before responding.
  • Chatbots and Conversational AI: Designed as a chat model, it is well-suited for building intelligent conversational agents that can engage in thoughtful dialogue.
  • Integration with External Systems: Its tool-calling feature makes it valuable for applications that need to interact with databases, APIs, or other external services.