mjf-su/ADEn-MAC

VISIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 27, 2026Architecture:Transformer Cold

The mjf-su/ADEn-MAC is a 4 billion parameter language model, fine-tuned from mjf-su/PhysicalAI-reason-VLA-MetaAction-1e. It was trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities in language models. This model is particularly suited for tasks requiring advanced mathematical problem-solving and logical reasoning.

Loading preview...

Overview

mjf-su/ADEn-MAC is a 4 billion parameter language model, fine-tuned from the mjf-su/PhysicalAI-reason-VLA-MetaAction-1e base model. It leverages a context length of 32768 tokens, allowing for processing of extensive inputs.

Key Capabilities

  • Enhanced Mathematical Reasoning: The model was trained using the GRPO (Gradient-based Reward Policy Optimization) method, as introduced in the paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models". This training approach specifically targets and improves the model's ability to handle complex mathematical problems and logical reasoning tasks.
  • Fine-tuned Performance: Built upon an existing model, ADEn-MAC benefits from further optimization through fine-tuning, making it more specialized for its intended applications.

Training Details

The model's training procedure utilized the TRL (Transformer Reinforcement Learning) framework. The GRPO method, central to its training, is designed to push the boundaries of mathematical reasoning in open language models.

When to Use This Model

  • Mathematical Problem Solving: Ideal for applications requiring robust mathematical reasoning, such as solving equations, proofs, or complex quantitative analysis.
  • Logical Reasoning Tasks: Suitable for scenarios where logical deduction and structured thought processes are critical.
  • Research and Development: Can serve as a base for further research into advanced reasoning capabilities in LLMs, particularly in mathematical domains.