BioLing-7B-Dare: A Merged Language Model

BioLing-7B-Dare is a 7 billion parameter language model developed by John Snow Labs. This model is constructed using a novel merging technique called DARE TIES, combining two distinct base models: BioMistral/BioMistral-7B and Nexusflow/Starling-LM-7B-beta. The configuration specifies a density of 0.53 and weights of 0.4 and 0.3 for BioMistral and Starling-LM respectively, indicating a strategic blend of their capabilities.

Key Characteristics

Architecture: Merged model based on BioMistral/BioMistral-7B and Nexusflow/Starling-LM-7B-beta.
Parameter Count: 7 billion parameters.
Context Length: Supports an 8192-token context window.
Merging Method: Utilizes the dare_ties method for model combination, with int8_mask enabled and bfloat16 dtype.

Usage and Licensing

The model is available under a CC-BY-NC-ND license and adheres to John Snow Labs' Acceptable Use Policy. Commercial use requires specific licensing. Developers can easily integrate the model using the transformers library, with provided Python code examples for text generation tasks.

Evaluation

Evaluation results for BioLing-7B-Dare are currently pending and will be released soon.

Overview

BioLing-7B-Dare: A Merged Language Model

Key Characteristics

Usage and Licensing

Evaluation

Full Model Card (README)