Name: umd-zhou-lab/recycled-alpaca-7b-v2.0 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: umd-zhou-lab

Overview

The umd-zhou-lab/recycled-alpaca-7b-v2.0 is a 7 billion parameter auto-regressive language model developed by the UMD Tianyi Zhou Lab. It is fine-tuned from the meta-llama/Llama-2-7b base model using a unique "recycled Alpaca data V2" methodology, as detailed in their paper "Reflection-Tuning: Data Recycling Improves LLM Instruction-Tuning". This approach aims to enhance the model's instruction-following capabilities and overall performance.

Key Capabilities & Performance

This model demonstrates notable improvements over its base and other 7B models, particularly in instruction-following and general language understanding:

Enhanced Instruction Following: Achieves an AlpacaEval score of 79.58, significantly outperforming the baseline Alpaca 7B (26.46) and even WizardLM 7B (67.64).
Improved General Benchmarks: Shows higher average scores across various benchmarks, including ARC, HellaSwag, MMLU, and TruthfulQA, compared to the original Alpaca 7B.
Efficient Fine-tuning: The model was trained with a global batch size of 128 over 3 epochs, using a maximum sequence length of 2048 tokens.

Use Cases

The primary intended use for recycled-alpaca-7b-v2.0 is for research in large language models and chatbots. It is particularly well-suited for:

Researchers exploring advanced instruction-tuning techniques.
Hobbyists and developers working on natural language processing and AI applications requiring strong instruction-following.

Limitations

As an auto-regressive language model, it shares common limitations with similar transformer-based architectures. The specific V2 Recycled Alpaca Data and WizardLM data, along with the corresponding paper, are slated for future release.