mjf-su/ADEn-CF
VISIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 27, 2026Architecture:Transformer Cold
mjf-su/ADEn-CF is a 4 billion parameter language model fine-tuned from mjf-su/PhysicalAI-reason-VLA-MetaAction-1e. This model was trained using the TRL framework and the GRPO method, which is designed to enhance mathematical reasoning capabilities. It is optimized for complex reasoning tasks, particularly those requiring a structured approach to problem-solving.
Loading preview...
Overview
mjf-su/ADEn-CF is a 4 billion parameter language model, fine-tuned from the base model mjf-su/PhysicalAI-reason-VLA-MetaAction-1e. It leverages the TRL (Transformer Reinforcement Learning) framework for its training process.
Key Capabilities
- Enhanced Reasoning: The model's training incorporates the GRPO (Gradient-based Reinforcement Learning with Policy Optimization) method, as introduced in the DeepSeekMath paper. This method is specifically designed to push the limits of mathematical and general reasoning in large language models.
- Fine-tuned Performance: By building upon a pre-existing model and applying advanced fine-tuning techniques, ADEn-CF aims to deliver specialized performance in its target domain.
Good For
- Complex Problem Solving: Ideal for applications requiring robust reasoning, especially in areas that benefit from structured, mathematical-like thought processes.
- Research and Development: Useful for researchers exploring the impact of GRPO and similar reinforcement learning techniques on model capabilities.