Name: shanchen/llama3-8B-slerp-med-chinese API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: shanchen

Overview

shanchen/llama3-8B-slerp-med-chinese is an 8 billion parameter language model developed by shanchen. It is a merged model, combining the capabilities of two specialized base models: winninghealth/WiNGPT2-Llama-3-8B-Base and johnsnowlabs/JSL-MedLlama-3-8B-v1.0. The merge was performed using a slerp (spherical linear interpolation) method, specifically configured to blend the self-attention and MLP layers of the constituent models.

Key Capabilities

Medical Domain Specialization: Inherits and combines the medical knowledge and language understanding from both WiNGPT2-Llama-3-8B-Base and JSL-MedLlama-3-8B-v1.0, making it highly effective for tasks within the healthcare and medical fields.
Llama 3 Architecture: Built upon the Llama 3 architecture, providing a robust foundation for language processing.
8192 Token Context Window: Supports a substantial context length, allowing for the processing of longer medical texts and complex queries.

Good For

Applications requiring advanced natural language understanding in medical contexts.
Tasks such as medical text summarization, question answering, and information extraction from clinical notes or research papers.
Developers looking for a specialized LLM with strong performance in the medical domain, leveraging the combined strengths of established medical language models.

Overview

Overview

Key Capabilities

Good For

Full Model Card (README)