ContactDoctor/Bio-Medical-Llama-3-2-1B-CoT-012025

Loading
Public
1B
BF16
32768
Jan 2, 2025
License: other
Hugging Face
Gated
Overview

Bio-Medical-Llama-3-2-1B-CoT-012025 Overview

ContactDoctor's Bio-Medical-Llama-3-2-1B-CoT-012025 is a 1 billion parameter language model, fine-tuned from the Llama-3.2-1B-Instruct base model. It is specifically developed for the Healthcare & Lifesciences (HLS) domain, utilizing a proprietary "BioMedData" dataset comprising 625,000 examples. A key differentiator is the inclusion of 25,000 chain-of-thought (CoT) instruction samples, which significantly enhance its reasoning capabilities and logical coherence for complex biomedical tasks.

Key Capabilities

  • Domain-Specific Content Generation: Creates relevant content within healthcare and biomedical fields.
  • Complex Question Answering: Addresses intricate questions requiring step-by-step reasoning through its CoT enhancement.
  • Reasoning and Interpretability: Provides improved logical coherence and interpretability in its responses.
  • Evaluated Performance: Shows consistent performance improvements over general-purpose models of similar size on biomedical benchmarks like medmcqa, medqa_4options, and various MMLU subtasks (anatomy, clinical knowledge, college biology, college medicine, medical genetics, professional medicine, pubmedqa).

Intended Use Cases

  • Research Support: Aids researchers in data extraction and reasoning from biomedical texts.
  • Clinical Decision Support: Offers evidence-based information to assist in clinical decision-making.
  • Educational Tool: Serves as a resource for learning and understanding complex biomedical concepts.