Name: mjf-su/ADEn-MAC API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mjf-su

Overview

mjf-su/ADEn-MAC is a 4 billion parameter language model, fine-tuned from the mjf-su/PhysicalAI-reason-VLA-MetaAction-1e base model. It leverages a context length of 32768 tokens, allowing for processing of extensive inputs.

Key Capabilities

Enhanced Mathematical Reasoning: The model was trained using the GRPO (Gradient-based Reward Policy Optimization) method, as introduced in the paper "DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models". This training approach specifically targets and improves the model's ability to handle complex mathematical problems and logical reasoning tasks.
Fine-tuned Performance: Built upon an existing model, ADEn-MAC benefits from further optimization through fine-tuning, making it more specialized for its intended applications.

Training Details

The model's training procedure utilized the TRL (Transformer Reinforcement Learning) framework. The GRPO method, central to its training, is designed to push the boundaries of mathematical reasoning in open language models.

When to Use This Model

Mathematical Problem Solving: Ideal for applications requiring robust mathematical reasoning, such as solving equations, proofs, or complex quantitative analysis.
Logical Reasoning Tasks: Suitable for scenarios where logical deduction and structured thought processes are critical.
Research and Development: Can serve as a base for further research into advanced reasoning capabilities in LLMs, particularly in mathematical domains.

Overview

Overview

Key Capabilities

Training Details

When to Use This Model

Full Model Card (README)