WeatherGPT: Natural Language to JSON Conversion
BobaZooba/WGPT is a specialized model built on Mistral-7B-v0.1, designed to accurately transform free-form weather descriptions into valid JSON. This model addresses a common challenge in LLM engineering by providing a robust solution for structured data extraction from unstructured text, particularly for weather-related information.
Key Capabilities
- Precise JSON Generation: Converts diverse weather descriptions into a consistent JSON schema, ensuring all outputs are 100% parseable.
- Efficient Fine-tuning: Utilizes advanced techniques like QLoRA, DeepSpeed Stage 2, and 4-bit quantization for optimized training on a Mistral-7B backbone.
- Synthetic Data Generation: Demonstrates a scalable approach to dataset creation using ChatGPT for few-shot examples, significantly reducing data collection costs and time.
- Targeted Loss Calculation: Focuses loss calculation exclusively on the JSON output portion during training, enhancing accuracy for the specific task.
Good for
- Structured Data Extraction: Ideal for applications requiring the conversion of natural language weather reports into machine-readable JSON.
- LLM Engineering Assessments: Serves as a practical example for common LLM engineering tasks involving data parsing and structured output generation.
- Rapid Prototyping: Provides a ready-to-use solution for integrating weather data parsing into various systems or services.
- Cost-Effective Data Generation: Showcases a methodology for creating high-quality, diverse datasets using large language models, applicable to other domains.