maximuslee07/code-llama-7b-text-to-sql

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jan 23, 2024License:llama2Architecture:Transformer Open Weights Cold

The maximuslee07/code-llama-7b-text-to-sql model is a 7 billion parameter Code Llama-based language model, fine-tuned from codellama/CodeLlama-7b-hf. This model is specifically optimized for text-to-SQL generation tasks, leveraging its 4096-token context length to accurately convert natural language queries into SQL commands. It is designed for applications requiring robust and precise SQL code generation from user input.

Loading preview...

Model Overview

This model, maximuslee07/code-llama-7b-text-to-sql, is a specialized language model fine-tuned for the task of converting natural language into SQL queries. It is built upon the CodeLlama-7b-hf architecture, a 7 billion parameter model developed by Code Llama, and has been adapted for enhanced performance in text-to-SQL generation.

Key Capabilities

  • Text-to-SQL Generation: The primary function of this model is to translate natural language descriptions or questions into executable SQL code.
  • CodeLlama Foundation: Benefits from the robust code understanding and generation capabilities inherent in the Code Llama base model.
  • Fine-tuned Performance: Optimized through specific training on a generator dataset to improve accuracy and relevance for SQL generation tasks.

Training Details

The model was trained with the following key hyperparameters:

  • Learning Rate: 0.0002
  • Batch Size: 3 (train), 8 (eval)
  • Epochs: 3
  • Optimizer: Adam with betas=(0.9, 0.999) and epsilon=1e-08

Intended Use Cases

This model is particularly well-suited for applications where users need to interact with databases using natural language, such as:

  • Building natural language interfaces for databases.
  • Automating SQL query generation from user requests.
  • Assisting developers in writing complex SQL queries more efficiently.