Vedant3907/Text-to-Sql-llama3.1-8B

Warm
Public
8B
FP8
32768
License: apache-2.0
Hugging Face
Overview

Model Overview

Vedant3907/Text-to-Sql-llama3.1-8B is an 8.03 billion parameter causal language model, fine-tuned by Vedant Rajpurohit from the unsloth/Meta-Llama-3.1-8B base model. This model specializes in Text-to-SQL generation, converting natural language instructions into SQLite queries.

Key Capabilities

  • Text-to-SQL Generation: Translates English instructions into executable SQLite queries based on provided table schemas.
  • Parameter-Efficient Fine-tuning: Utilizes the Unsloth library with LoRA for efficient adaptation of the base Llama 3.1-8B model.
  • Specific Domain Focus: Optimized for SQL generation tasks, making it a targeted solution for database query needs.

Training Details

The model was fine-tuned on the first 5000 rows of the Clinton/Text-to-sql-v1 dataset over 250 steps. Training was conducted on a Google Colab T4 GPU, using a learning rate of 1e-4 and an adamw_8bit optimizer.

Limitations

  • The model's training was limited to 5000 rows and 250 steps, which may affect its generalization to broader or more complex SQL generation tasks.
  • It may produce incorrect or ambiguous SQL queries for instructions that are unclear or fall outside its training distribution.

Good For

  • Developers building applications that require converting natural language into SQL queries for SQLite databases.
  • Experimenting with fine-tuned Llama 3.1 models for specific domain tasks like Text-to-SQL.
  • Use cases where a smaller, specialized model for SQL generation is preferred over larger, general-purpose LLMs.