allenai/scitulu-7b

TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Jun 12, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

SciTulu 7B is a 7 billion parameter instruction-following language model developed by AllenAI, built upon the Tulu v2 7B architecture. It is specifically fine-tuned for scientific literature understanding, leveraging a mix of science-specific demonstrations from the SciRIFF dataset and general-domain instructions. This model achieves a 28.1% average improvement over its base model on nine scientific literature understanding tasks, making it highly effective for processing and interpreting scientific texts.

Loading preview...

SciTulu 7B: Specialized for Scientific Literature Understanding

SciTulu 7B is an instruction-following language model developed by AllenAI, designed to excel in tasks related to scientific literature. Built upon the Tulu v2 7B model, SciTulu has undergone specialized training to enhance its performance in the scientific domain.

Key Capabilities and Training:

  • Scientific Domain Expertise: SciTulu 7B is fine-tuned on a unique blend of science-specific demonstrations from the SciRIFF dataset, alongside general-domain instructions from the Tulu v2 SFT mixture.
  • Enhanced Performance: The model demonstrates a significant 28.1% average improvement over its base model, Tulu v2 7B, across nine distinct scientific literature understanding tasks.
  • Instruction Following: It is optimized for accurately following instructions, particularly within the context of scientific texts.

Use Cases:

  • Scientific Literature Analysis: Ideal for tasks requiring the understanding, summarization, or extraction of information from scientific papers and articles.
  • Research Assistance: Can aid researchers in navigating and interpreting complex scientific documentation.

Further details on its development and evaluation can be found in the associated preprint: SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature. Training and evaluation code is available on the AllenAI GitHub repository.