shanchen/llama3-8B-slerp-med-chinese

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Apr 30, 2024License:llama3Architecture:Transformer0.0K Warm

shanchen/llama3-8B-slerp-med-chinese is an 8 billion parameter language model created by shanchen, formed by merging WiNGPT2-Llama-3-8B-Base and JSL-MedLlama-3-8B-v1.0 using a slerp merge method. This model is specifically designed for medical applications, leveraging the strengths of its base models to provide enhanced performance in medical contexts. It processes inputs with a context length of 8192 tokens, making it suitable for detailed medical text analysis and generation.

Loading preview...

Overview

shanchen/llama3-8B-slerp-med-chinese is an 8 billion parameter language model developed by shanchen. It is a merged model, combining the capabilities of two specialized base models: winninghealth/WiNGPT2-Llama-3-8B-Base and johnsnowlabs/JSL-MedLlama-3-8B-v1.0. The merge was performed using a slerp (spherical linear interpolation) method, specifically configured to blend the self-attention and MLP layers of the constituent models.

Key Capabilities

  • Medical Domain Specialization: Inherits and combines the medical knowledge and language understanding from both WiNGPT2-Llama-3-8B-Base and JSL-MedLlama-3-8B-v1.0, making it highly effective for tasks within the healthcare and medical fields.
  • Llama 3 Architecture: Built upon the Llama 3 architecture, providing a robust foundation for language processing.
  • 8192 Token Context Window: Supports a substantial context length, allowing for the processing of longer medical texts and complex queries.

Good For

  • Applications requiring advanced natural language understanding in medical contexts.
  • Tasks such as medical text summarization, question answering, and information extraction from clinical notes or research papers.
  • Developers looking for a specialized LLM with strong performance in the medical domain, leveraging the combined strengths of established medical language models.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p