DJLougen/Nemotron-Research-GooseReason-4B-Instruct-MLX-16bit
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 5, 2026License:cc-by-nc-4.0Architecture:Transformer Open Weights Warm

DJLougen/Nemotron-Research-GooseReason-4B-Instruct-MLX-16bit is an MLX-optimized, 16-bit full-precision version of NVIDIA's 4.4 billion parameter Nemotron-Research-GooseReason-4B-Instruct model, built on the Qwen3-4B-Instruct-2507 architecture. This model, trained with Reinforcement Learning with Verifiable Rewards (RLVR), excels in mathematical, coding, and STEM reasoning tasks. It features a maximum sequence length of 32,768 tokens and is specifically designed for complex reasoning, utilizing an extended thinking mode.

Loading preview...