peakji/steiner-32b-preview
TEXT GENERATIONConcurrency Cost:2Model Size:32.8BQuant:FP8Ctx Length:32kPublished:Oct 18, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Cold

Steiner-preview is a 32.8 billion parameter reasoning model developed by Yichao 'Peak' Ji, inspired by OpenAI o1. It is trained on synthetic data using reinforcement learning to explore multiple reasoning paths autoregressively, enabling autonomous verification and backtracking. This model is designed for complex reasoning tasks, though it is a work-in-progress and does not yet fully replicate o1's inference-time scaling capabilities. It features a 131072 token context length and is compatible with standard inference services like vLLM.

Loading preview...