hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Nov 9, 2025License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Cold

The hamishivi/Nemotron-Research-Reasoning-Qwen-1.5B-v2-RLVE model is a 1.5 billion parameter language model, fine-tuned from Nemotron-Research-Reasoning-Qwen-1.5B using the RLVE (Reinforcement Learning with Verifiable Environments) method. It demonstrates enhanced performance across various reasoning and problem-solving benchmarks, including AIME, OMEGA-500, OlympiadBench, BBEH, and LiveCodeBench-v6. This model is specifically optimized for complex reasoning tasks, making it suitable for applications requiring advanced analytical capabilities.

Loading preview...