Name: cycloneboy/CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: cycloneboy

Overview

This model, CscSQL-Grpo-XiYanSQL-QwenCoder-7B-2502, is a 7.6 billion parameter language model developed by CycloneBoy, specifically designed for advanced text-to-SQL generation. It is part of the CSC-SQL framework, which introduces a novel approach combining Self-Consistency and Self-Correction to improve SQL query accuracy. The model is fine-tuned using Group Relative Policy Optimization (GRPO) via reinforcement learning, significantly enhancing its ability to generate correct and robust SQL queries from natural language.

Key Capabilities

Corrective Self-Consistency (CSC-SQL): Integrates Self-Consistency and Self-Correction by selecting the two most frequent outputs from parallel sampling and feeding them into a merge revision model for correction.
Reinforcement Learning Fine-tuning: Employs the GRPO algorithm to fine-tune both the SQL generation and revision models, leading to enhanced output quality.
High Accuracy: Achieves 71.72% execution accuracy on the BIRD private test set with its 7B variant, demonstrating strong performance in complex text-to-SQL tasks.
Robust Error Handling: Addresses both syntactic errors (via Self-Correction) and suboptimal output selection (via Self-Consistency) more effectively than traditional methods.

Good For

Text-to-SQL Applications: Ideal for systems requiring accurate translation of natural language questions into SQL queries for relational databases.
Database Interaction: Useful for developers building interfaces that allow users to query databases using natural language.
Research in LLM-based SQL Generation: Provides a strong baseline and methodology for further research in improving the reliability and accuracy of LLMs for database interactions.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)