OpenDFM/ChemDFM-R-14B

Warm
Public
14.8B
FP8
32768
Oct 26, 2025
License: agpl-3.0
Hugging Face
Overview

ChemDFM-R-14B: Enhanced Chemical Reasoning LLM

ChemDFM-R-14B is a specialized large language model developed by OpenDFM, focusing on advancing chemical reasoning capabilities. Unlike general-purpose LLMs, this model addresses the limitations of shallow domain understanding in scientific fields by integrating deep chemical knowledge.

Key Capabilities & Innovations

  • Atomized Chemical Knowledge: The model is built upon a comprehensive dataset called ChemFG, which annotates functional groups in molecules and their changes during chemical reactions. This enhances the model's grasp of fundamental chemical principles and internal logic.
  • Mix-Sourced Distillation: ChemDFM-R-14B utilizes a novel mix-sourced distillation method that combines atomized knowledge expertise with general reasoning skills.
  • Domain-Specific Reinforcement Learning: Further fine-tuning through domain-specific reinforcement learning significantly boosts its chemical reasoning prowess.
  • Interpretable Outputs: The model is designed to provide interpretable, rationale-driven outputs, making its reasoning process transparent and reliable for human-AI collaboration.
  • Cutting-Edge Performance: Experiments demonstrate that ChemDFM-R-14B achieves state-of-the-art performance on various chemical benchmarks.

Use Cases & Strengths

ChemDFM-R-14B is particularly well-suited for applications requiring deep chemical understanding and explicit reasoning. Its ability to generate detailed reasoning chains improves reliability and practicality in real-world chemical research and development scenarios. Developers can leverage this model for tasks such as molecular analysis, reaction prediction, and general chemical problem-solving where interpretability and accuracy are paramount. The project also open-sources the functional group identification toolkit, ChemFG-Tool, for community use.