jastorj/snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.5_phase_1-cw-12K

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 6, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The jastorj/snowflake_arctic_text2sql_r1_7b-nl2sqlpp-16bit-v5.7.5_phase_1-cw-12K model is a 7.6 billion parameter text-to-SQL generation model, fine-tuned from Snowflake/Arctic-Text2SQL-R1-7B. It specializes in converting natural language queries into SQL++ queries, leveraging the NL2SQL++ v8 dataset with code-with-thought reasoning. This model is optimized for accurate SQL++ query generation, particularly for complex database schemas and nuanced natural language instructions, and has a context length of 32768 tokens.

Loading preview...

Snowflake Arctic Text2SQL R1 7B Fine-tuned for NL2SQL++ v8

This model is a specialized 7.6 billion parameter language model, fine-tuned from the original Snowflake/Arctic-Text2SQL-R1-7B. Its core function is Text-to-SQL generation, specifically for the SQL++ query language.

Key Capabilities

  • SQL++ Query Generation: Translates natural language questions into precise SQL++ queries.
  • Code-with-Thought Reasoning: Trained on the NL2SQL++ v8 dataset, which incorporates detailed reasoning steps to improve query accuracy and understanding of complex schema interactions.
  • Schema Awareness: Designed to generate queries based on provided database schemas, including handling specific SQL++ syntax requirements like backtick-quoting reserved keywords and using TO_NUMBER() for numeric aggregations on string fields.
  • Context Handling: Supports a substantial context length of 32768 tokens, allowing for detailed schema definitions and complex natural language queries.
  • Fine-tuning Method: Utilizes LoRA (Low-Rank Adaptation) with Unsloth for efficient fine-tuning, resulting in 16-bit merged weights.

Good For

  • Automating SQL++ Query Creation: Ideal for applications requiring programmatic generation of SQL++ queries from user input.
  • Complex Data Exploration: Facilitates querying complex, nested data structures common in NoSQL databases that use SQL++.
  • Developer Tools: Can be integrated into developer environments to assist in writing accurate and optimized SQL++ queries, especially when dealing with intricate join conditions, aggregations, and schema-specific nuances.