openbmb/SciCore-Mol

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 29, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

SciCore-Mol is an 8-billion parameter large language model developed by Tsinghua University and Peking University, augmented with pluggable external cognitive modules for molecular understanding. It integrates a GVP encoder, a diffusion generator, and a numerical-sensitive Reaction Transformer to process topological and geometric molecular data. This architecture enables the LLM to deeply understand molecular information, making it specialized for scientific domains involving chemistry and molecular reasoning.

Loading preview...

SciCore-Mol: LLM Augmented for Molecular Cognition

SciCore-Mol is an 8B parameter model developed by Tsinghua University and Peking University, designed to overcome the limitations of traditional LLMs in processing complex scientific data, particularly molecular structures. Unlike standard LLMs that struggle with the topological and geometric nature of molecules when represented as linear text, SciCore-Mol integrates specialized external cognitive modules.

Key Capabilities & Architecture

  • Pluggable Modules: Augments the base LLM with a GVP encoder for molecular structure encoding, a diffusion generator for molecular generation, and a numerical-sensitive Reaction Transformer for tasks like yield prediction.
  • Two-Stage Alignment: Utilizes a two-stage alignment mechanism where external modules are invoked via special tokens and fused at the hidden-state level, preserving the LLM's general reasoning while adding specialized molecular perception.
  • Three-Stage Training Pipeline: Involves independent component pre-training, cross-modal alignment training (SFT), and task-specific fine-tuning.

Specialized Use Cases

  • Molecular Reasoning: Excels in tasks requiring deep understanding of molecular information, such as product prediction, retrosynthesis, and yield prediction.
  • Drug Discovery: Applicable to drug optimization tasks, including ADMET scoring.
  • Scientific Research: Evaluated across various chemistry benchmarks like ChemBench4K, MMLU Chemistry subsets, ORD Reaction Prediction, and SMolInstruct molecular tasks.