bambisheng/UltraIF-8B-SFT

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Apr 3, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

UltraIF-8B-SFT is an 8 billion parameter instruction-tuned causal language model developed by bambisheng, fine-tuned from Llama-3.1-8B. This model specializes in advanced instruction following by decomposing complex user instructions into simplified ones and constraints, utilizing a 'Generate-then-Evaluate' framework. It is designed to synthesize instructions with diverse constraints and ensure response correctness, making it suitable for applications requiring precise adherence to detailed prompts.

Loading preview...

UltraIF-8B-SFT: Advanced Instruction Following

UltraIF-8B-SFT is an 8 billion parameter language model developed by bambisheng, fine-tuned from the Llama-3.1-8B architecture. It leverages a unique approach to enhance instruction following capabilities, particularly for complex and constrained prompts. The model was trained using 175,000 data points from the UltraIF SFT Data.

Key Capabilities

  • Instruction Decomposition: The model employs an "UltraComposer" to break down intricate user instructions into simpler components and identify associated constraints.
  • Constraint Synthesis: It facilitates the generation of responses that adhere to complex and diverse constraints specified in the instructions.
  • Generate-then-Evaluate Framework: This process integrates the UltraComposer to incorporate constraints, followed by an evaluation mechanism that assesses response quality against corresponding evaluation questions.
  • Enhanced Reliability: The framework is designed to ensure the correctness and reliability of generated outputs by systematically evaluating adherence to instructions and constraints.

Good For

  • Applications requiring precise and reliable adherence to detailed instructions.
  • Tasks involving complex prompts with multiple constraints.
  • Scenarios where the correctness of generated responses based on specific rules is critical.

For more technical details, refer to the UltraIF Paper.