Technoculture/BioMistral-Hermes-Dare
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Feb 21, 2024License:apache-2.0Architecture:Transformer Open Weights Cold
Technoculture/BioMistral-Hermes-Dare is a 7 billion parameter language model, created by Technoculture, formed by merging BioMistral/BioMistral-7B-DARE and NousResearch/Nous-Hermes-2-Mistral-7B-DPO. This model is designed for general language tasks with a focus on biomedical and conversational applications, leveraging its 4096-token context length. Its architecture combines specialized biomedical knowledge with strong instruction-following capabilities, making it suitable for diverse text generation and understanding. The model aims to provide robust performance across various benchmarks, including medical and general reasoning tasks.
Loading preview...