Langboat/Mengzi3-8B-Chat

Warm
Public
8B
FP8
8192
License: apache-2.0
Hugging Face
Overview

Mengzi3-8B-Chat Overview

Langboat's Mengzi3-8B-Chat is an 8 billion parameter conversational large language model. It is designed to serve as a helpful assistant, capable of engaging in general chat interactions. The model is released under the Apache 2.0 license, allowing for both academic research and free commercial applications.

Key Capabilities & Performance

Mengzi3-8B-Chat demonstrates solid performance across a range of common LLM benchmarks:

  • MMLU (5-shot): 63.9
  • GSM8k (4-shot): 75.4
  • MATH (4-shot): 24.5
  • HumanEval: 62.2
  • MT-Bench: 8.19
  • AlignBench: 6.96

These scores indicate its proficiency in multi-task language understanding, mathematical reasoning, and coding tasks, alongside its conversational abilities.

Usage and Licensing

The model can be easily integrated using the Hugging Face transformers library for inference. Detailed code examples for both inference and fine-tuning are available on the Langboat Github repository. Langboat provides Mengzi3-8B-Chat "as is" and disclaims responsibility for any issues arising from its use, urging users to adhere to ethical and legal guidelines. Commercial use is permitted, with options to contact Langboat for specific business licenses or collaborations.