cycloneboy/CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502
The cycloneboy/CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502 is a 7.6 billion parameter model developed by CycloneBoy, specifically fine-tuned for text-to-SQL generation. This model utilizes a novel Corrective Self-Consistency (CSC-SQL) method, integrating Self-Consistency and Self-Correction, and is further enhanced by Group Relative Policy Optimization (GRPO) via reinforcement learning. It excels at translating natural language questions into accurate SQL queries, achieving 71.72% execution accuracy on the BIRD private test set.
Loading preview...
Overview
This model, CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502, is a 7.6 billion parameter language model developed by CycloneBoy, specifically designed for advanced text-to-SQL generation. It is part of the CSC-SQL framework, which introduces a novel approach combining Self-Consistency and Self-Correction to improve SQL query accuracy. The model is fine-tuned using Group Relative Policy Optimization (GRPO) via reinforcement learning, significantly enhancing its ability to generate correct and robust SQL queries from natural language.
Key Capabilities
- Corrective Self-Consistency (CSC-SQL): Integrates Self-Consistency and Self-Correction by selecting the two most frequent outputs from parallel sampling and feeding them into a merge revision model for correction.
- Reinforcement Learning Fine-tuning: Employs the GRPO algorithm to fine-tune both the SQL generation and revision models, leading to enhanced output quality.
- High Accuracy: Achieves 71.72% execution accuracy on the BIRD private test set with its 7B variant, demonstrating strong performance in complex text-to-SQL tasks.
- Robust Error Handling: Addresses both syntactic errors (via Self-Correction) and suboptimal output selection (via Self-Consistency) more effectively than traditional methods.
Good For
- Text-to-SQL Applications: Ideal for systems requiring accurate translation of natural language questions into SQL queries for relational databases.
- Database Interaction: Useful for developers building interfaces that allow users to query databases using natural language.
- Research in LLM-based SQL Generation: Provides a strong baseline and methodology for further research in improving the reliability and accuracy of LLMs for database interactions.