lamm-mit/Graph-Preflexor-8b_12292025

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Dec 29, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Graph-Preflexor-8b_12292025 by lamm-mit is an 8 billion parameter language model, based on Qwen3-8B, specifically fine-tuned for scientific reasoning and knowledge discovery. It generates structured intermediate representations, including machine-readable knowledge graphs, to provide transparent and inspectable reasoning traces. This model excels at graph-native reasoning and AI-for-science applications where explicit structure and interpretability are crucial.

Loading preview...

Graph-Preflexor-8b_12292025: Structured Scientific Reasoning

This 8 billion parameter model, developed by lamm-mit, is engineered for advanced scientific reasoning by generating explicit, structured intermediate representations. Built upon Qwen3-8B, it underwent a two-stage fine-tuning process:

Key Capabilities

  • Structured Reasoning Traces: Emits reasoning in distinct "sentinel" blocks like <brainstorm>, <graph>, <graph_json>, <patterns>, and <synthesis>, making the thought process inspectable.
  • Machine-Readable Knowledge Graphs: Generates canonical JSON-formatted knowledge graphs (<graph_json>) with nodes and edges, enabling programmatic extraction and downstream tooling.
  • Two-Stage Fine-Tuning: Initially aligned using ORPO (Offline Reinforcement Preference Optimization) for structured output, then further optimized with GRPO (Generative Reinforcement Preference Optimization) using an external LLM judge (grok-4-1-fast-non-reasoning).
  • Multi-component Reward System: GRPO training incorporated a weighted reward function considering answer correctness, format compliance, graph utility, graph validity (NetworkX), graph diversity, and graph structure quality.

Good For

  • AI-for-science: Ideal for applications requiring transparent and structured scientific inquiry.
  • Graph-native Reasoning: Excels in tasks where reasoning can be explicitly represented and analyzed as knowledge graphs.
  • Knowledge Discovery Workflows: Supports workflows where interpretability and the ability to extract structured knowledge are as important as accuracy.
  • Interpretability: Provides segmented reasoning by function (explore, formalize, abstract, explain), enhancing understanding of the model's decision-making.