Name: qihoo360/Light-IF-4B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: qihoo360

Overview

Light-IF-4B is a 4 billion parameter language model from Qihoo360 engineered to enhance instruction following and generalizable reasoning in LLMs. It tackles the issue of "lazy reasoning" by implementing a novel framework that encourages rigorous thought through previewing and self-checking before generating responses. This approach is detailed in its technical report.

Key Capabilities

Enhanced Instruction Following: Specifically designed to improve adherence to complex instructions, even with multiple constraints.
Rigorous Reasoning: Utilizes a framework that promotes planning and verification of outputs, leading to more consistent and accurate reasoning.
High-Quality Training: Trained on a small but high-quality dataset of complex instruction data, filtered for optimal difficulty, using Entropy-preserving Supervised Fine-Tuning (Entropy-SFT) and Token-wise Entropy-Adaptive Reinforcement Learning (TEA-RL).
Competitive Performance: Benchmarks show Light-IF-4B achieving strong results on instruction-following evaluations like SuperClue and IFBench, outperforming several larger open-source and even some closed-source models in specific metrics.

Good For

Applications requiring precise adherence to detailed instructions.
Tasks where complex reasoning and planning are critical.
Developers looking for a compact model (4B parameters) with strong instruction-following capabilities.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)