Snowflake/Arctic-Text2SQL-R1-7B

Warm
Public
7.6B
FP8
32768
1
May 22, 2025
License: apache-2.0
Hugging Face
Overview

Arctic-Text2SQL-R1-7B: Specialized Text-to-SQL Model

Arctic-Text2SQL-R1-7B is a 7.6 billion parameter model developed by Snowflake, specifically designed for converting natural language questions into executable SQL queries. It leverages a lightweight Reinforcement Learning formulation, Group Relative Policy Optimization (GRPO), using only execution correctness and syntax validity as reward signals. This approach allows it to achieve strong performance in Text-to-SQL tasks with a significantly smaller parameter count compared to many 70B+ models.

Key Capabilities

  • High Accuracy: Achieves 68.9% execution accuracy on BIRD-dev and 68.5% on BIRD-test, with an average of 57.2% across six diverse benchmarks including Spider and EHRSQL.
  • Efficiency: Delivers state-of-the-art Text-to-SQL performance with only 7.6 billion parameters, making it more efficient than many larger alternatives.
  • Focused Task: Optimized exclusively for Text-to-SQL generation, ensuring high precision in converting natural language to database queries.

Good For

  • Natural Language Database Interfaces: Creating interactive systems where users can query databases using plain language.
  • Data Analytics Tools: Empowering non-technical users to perform data analysis by generating SQL queries from their questions.

Limitations

  • Not intended for general-purpose text generation or free-form natural language tasks.
  • Requires validation for production use, especially with sensitive data, to prevent potential data leakage or unauthorized access.