tablegpt/TableGPT2-7B
Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Nov 1, 2024License:apache-2.0Architecture:Transformer0.2K Open Weights Warm

TableGPT2-7B is a 7.6 billion parameter decoder-only large language model developed by Zhejiang University, built upon the Qwen2.5 architecture. It is specifically tailored for data-intensive tasks, excelling at interpreting and analyzing tabular data. Optimized for coding tasks, data interpretation, and business intelligence-focused question answering, it supports both text and tabular data inputs and has a context length of 131072 tokens.

Loading preview...

TableGPT2-7B: Specialized for Tabular Data

TableGPT2-7B is a 7.6 billion parameter large language model developed by Zhejiang University, designed to bridge the gap between general LLM capabilities and the specific demands of tabular/structured data tasks. Built on the Qwen2.5 architecture, it features specialized encoding for tabular data, including a unique semantic encoder to interpret rows, columns, and entire tables.

Key Capabilities

  • Tabular Data Processing: Accepts tabular data structured as df.head() results, alongside text inputs.
  • Optimized Output: Generates text-based outputs specifically for coding tasks, data interpretation, and BI-focused question answering.
  • Enhanced Performance: Achieves significant performance increases (35.20% over comparable models on standard benchmarks and 49.32% on BI-focused assessments) in tabular comprehension, code generation, and structured data reasoning.
  • Multilingual Support: Primarily emphasizes Chinese corpora, with limited support for other languages.
  • Agent Integration: Recommended for use with the tablegpt-agent GitHub repository for complex usage scenarios and enhanced performance.

Good For

  • Business intelligence (BI) applications.
  • Automated data-driven analysis.
  • Tasks involving databases or data warehouses.
  • Natural language to SQL conversion.
  • Table understanding and question answering over structured data.