Name: mjf-su/ADEn-CF API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: mjf-su

Overview

mjf-su/ADEn-CF is a 4 billion parameter language model, fine-tuned from the base model mjf-su/PhysicalAI-reason-VLA-MetaAction-1e. It leverages the TRL (Transformer Reinforcement Learning) framework for its training process.

Key Capabilities

Enhanced Reasoning: The model's training incorporates the GRPO (Gradient-based Reinforcement Learning with Policy Optimization) method, as introduced in the DeepSeekMath paper. This method is specifically designed to push the limits of mathematical and general reasoning in large language models.
Fine-tuned Performance: By building upon a pre-existing model and applying advanced fine-tuning techniques, ADEn-CF aims to deliver specialized performance in its target domain.

Good For

Complex Problem Solving: Ideal for applications requiring robust reasoning, especially in areas that benefit from structured, mathematical-like thought processes.
Research and Development: Useful for researchers exploring the impact of GRPO and similar reinforcement learning techniques on model capabilities.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)