Tulu 7B: An Instruction-Tuned LLaMA Model

The Tulu 7B model, developed by Allen AI, is a 7 billion parameter LLaMA-based language model specifically fine-tuned for instruction following. It leverages a comprehensive blend of instruction datasets, including FLAN V2, CoT, Dolly, Open Assistant 1, GPT4-Alpaca, Code-Alpaca, and ShareGPT, to enhance its ability to understand and execute diverse commands.

Key Capabilities

Instruction Following: Excels at responding to a wide range of instructions due to its extensive training on varied instruction datasets.
Reasoning: Demonstrates capabilities in reasoning tasks, as indicated by its performance on benchmarks like MMLU and Big-Bench Hard (BBH).
Code Generation: Shows proficiency in generating code, with notable scores on Codex-Eval.
Multitask Performance: Achieves a strong average performance across multiple benchmarks, including MMLU (47.0 5-shot), GSM CoT (27.0), BBH CoT (39.2), and Codex-Eval Pass@1 (17.5).

Usage and Integration

This model is distributed as a model diff and requires a base LLaMA model for recovery. The recommended prompt template for optimal performance is:

<|user|>
Your message here!
<|assistant|>

This model is suitable for applications requiring robust instruction adherence and general-purpose language understanding, making it a versatile choice for various AI tasks.

Overview

Tulu 7B: An Instruction-Tuned LLaMA Model

Key Capabilities

Usage and Integration

Full Model Card (README)