baidu/ERNIE-4.5-0.3B-PT
ERNIE-4.5-0.3B-PT is a 0.36 billion parameter text-dense post-trained language model developed by Baidu. This model is part of the ERNIE 4.5 series, featuring a 131072 token context length and optimized for general-purpose language understanding and generation. It utilizes Transformer-style PyTorch weights and is designed for efficient inference across various hardware platforms.
Loading preview...
ERNIE-4.5-0.3B-PT Overview
ERNIE-4.5-0.3B-PT is a 0.36 billion parameter text-dense language model from Baidu's ERNIE 4.5 series, distinguished by its use of Transformer-style PyTorch weights. It boasts a substantial context length of 131072 tokens, making it suitable for processing extensive textual inputs. This model is a post-trained variant, specifically optimized for general-purpose language understanding and generation tasks.
Key Capabilities
- Text Modality Focus: Primarily designed for text-based applications, excelling in language understanding and generation.
- High Context Length: Supports a 131072 token context, allowing for comprehensive analysis and generation of long-form content.
- Efficient Inference: Benefits from the ERNIE 4.5 series' scaling-efficient infrastructure, including techniques like multi-expert parallel collaboration and convolutional code quantization for optimized performance.
- Post-Training Optimization: Utilizes Supervised Fine-tuning (SFT), Direct Preference Optimization (DPO), or Unified Preference Optimization (UPO) for enhanced performance in real-world applications.
Good For
- General Language Tasks: Ideal for applications requiring robust text understanding and generation capabilities.
- Long Document Processing: Its large context window makes it well-suited for tasks involving lengthy texts, such as summarization, question answering, or content creation from extensive sources.
- PyTorch Ecosystem Integration: Seamlessly integrates with the
transformerslibrary (version 4.54.0 or newer) and supports vLLM inference, catering to developers working within the PyTorch environment.