Name: NovaSky-AI/Sky-T1-32B-Flash API
Brand: Featherless.ai
Price: 25.00 USD
Availability: InStock
Author: NovaSky-AI

Model Overview

NovaSky-AI/Sky-T1-32B-Flash is a 32.8 billion parameter reasoning model developed by the NovaSky Team at Sky Computing Lab, UC Berkeley. It is an optimized version of Sky-T1-32B-Preview, specifically engineered to reduce the length of generated responses without compromising accuracy, particularly in math and coding domains. This optimization results in up to a 57% reduction in generation lengths on hard coding tasks.

Key Capabilities & Optimizations

Concise Reasoning: Significantly reduces output length (e.g., 33% on Math500, 57% on LCB Hard) compared to its predecessor, Sky-T1-32B-Preview, while maintaining comparable accuracy.
Strong Performance: Achieves performance in math and coding tasks on par with models like o1-preview, as demonstrated by evaluations on benchmarks such as Math500, AIME24, and LCB (Easy, Medium, Hard).
Preference Optimization: Trained using 10K preference pairs in math and coding domains from Sky-T1-32B-Preview, employing Simple Policy Optimization (SimPO).

When to Use This Model

This model is ideal for applications where efficient and concise reasoning outputs are critical, especially in:

Mathematical Problem Solving: For tasks requiring accurate yet shorter explanations or solutions.
Code Generation and Analysis: When developers need precise code-related outputs without excessive verbosity.
Resource-Constrained Environments: Its reduced output length can lead to lower inference costs and faster processing, making it suitable for scenarios where token usage is a concern.

Overview

Model Overview

Key Capabilities & Optimizations

When to Use This Model

Full Model Card (README)