Name: invincible-jha/SynLogic-32B API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: invincible-jha

SynLogic-32B: Advanced Logical Reasoning Model

SynLogic-32B, developed by invincible-jha, is a 32.8 billion parameter model based on Qwen2.5-32B-Base, specifically fine-tuned for advanced logical reasoning. It leverages a novel reinforcement learning approach on the comprehensive SynLogic dataset, which includes 35 diverse logical reasoning tasks such as Sudoku, Game of 24, Cipher, and Arrow Maze. A key innovation is the verifiability of all training data, enabling highly effective reinforcement learning through binary rewards based on format adherence and correctness.

Key Capabilities

Comprehensive Logical Reasoning: Proficient in a wide array of logical puzzles and challenges.
Strong Generalization: Demonstrates the ability to transfer learned logical reasoning skills to mathematical problem-solving without explicit mathematical training.
Verifiable Training: Utilizes a unique dataset where all training samples can be automatically verified, enhancing model reliability and performance.

Performance Highlights

SynLogic-32B achieves a notable +6 point improvement over DeepSeek-R1-Distill-Qwen-32B on the challenging BBEH benchmark, establishing it as a leading open-source model for logical reasoning. While excelling in BBEH, it also maintains competitive performance on KOR-Bench and BBH. The model was trained using the GRPO (Group Relative Policy Optimization) algorithm on 33,000 SynLogic-Hard samples with controlled difficulty.

Good for

Applications requiring robust logical deduction and problem-solving.
Tasks involving complex puzzles and reasoning challenges.
Research into advanced reasoning capabilities and generalization in LLMs.

Overview

SynLogic-32B: Advanced Logical Reasoning Model

Key Capabilities

Performance Highlights

Good for

Full Model Card (README)