BioMistral/BioMistral-7B-TIES

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Feb 5, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

BioMistral/BioMistral-7B-TIES is a 7 billion parameter language model developed by BioMistral, merged using the TIES method with Mistral-7B-Instruct-v0.1 as its base. This model is specifically pre-trained on PubMed Central data, making it highly specialized for biomedical and medical domains. It excels in medical question-answering tasks and offers competitive performance against other open-source and some proprietary medical models.

Loading preview...

BioMistral-7B-TIES: Specialized Medical LLM

BioMistral/BioMistral-7B-TIES is a 7 billion parameter language model, part of the BioMistral suite, specifically designed for medical domains. It is a merge of the original BioMistral-7B and Mistral-7B-Instruct-v0.1, created using the TIES merge method.

Key Capabilities & Features

  • Biomedical Specialization: Further pre-trained on extensive textual data from PubMed Central Open Access, enhancing its knowledge in healthcare and medicine.
  • Competitive Performance: Demonstrates superior performance on a benchmark of 10 established medical question-answering tasks in English compared to other open-source medical models like MedAlpaca, PMC-LLaMA, and MediTron.
  • Multilingual Evaluation: The broader BioMistral project includes the first large-scale multilingual evaluation of LLMs in the medical domain, with benchmarks translated into 7 languages.
  • Research Tool: Intended primarily as a research tool for exploring medical LLM capabilities.

When to Use This Model

  • Medical Research: Ideal for academic research in biomedical and healthcare natural language processing.
  • Medical QA Benchmarking: Suitable for evaluating and developing models for medical question-answering tasks.
  • Exploration of Merged Models: Useful for researchers interested in the TIES model merging strategy for domain adaptation.

Advisory: This model is not intended for deployment in production environments or for professional health and medical purposes without thorough alignment and testing in real-world clinical settings. It should be used strictly as a research tool due to potential risks, biases, and unassessed limitations.