XGenerationLab/XiYanSQL-QwenCoder-32B-2412

Warm
Public
32B
FP8
32768
License: apache-2.0
Hugging Face
Overview

Overview

The XiYanSQL-QwenCoder series, developed by XGenerationLab, focuses on advancing large language models for text-to-SQL applications. This 32 billion parameter model is part of a family that includes 3B, 7B, and 14B versions, catering to diverse developer needs. It is engineered to directly perform text-to-SQL tasks or serve as a robust base for further fine-tuning of SQL models.

Key Capabilities

  • High-Performance SQL Generation: The XiYanSQL-QwenCoder-32B achieves a 69.03% EX score on the BIRD TEST set, establishing a new state-of-the-art for single fine-tuned models in this domain.
  • Multi-Dialect Support: The model supports major SQL dialects such as SQLite, PostgreSQL, and MySQL, enhancing its applicability across various database environments.
  • Benchmark Excellence: Across the series, models maintain leading positions in their respective size categories, demonstrating strong capabilities on both BIRD and Spider benchmarks, outperforming models like Codellama-34b, Deepseek-coder-33b, and even some larger commercial models in specific text-to-SQL metrics.

Good For

  • Direct Text-to-SQL Conversion: Ideal for applications requiring accurate SQL query generation from natural language inputs.
  • SQL Model Fine-tuning: Provides an excellent starting point for developers looking to fine-tune SQL models for specific use cases or proprietary databases.
  • Multi-Database Environments: Suitable for projects that interact with different SQL database types due to its dialect support.