Name: jondurbin/bagel-dpo-1.1b-v0.3 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: jondurbin

Overview of jondurbin/bagel-dpo-1.1b-v0.3

This model, developed by jondurbin, is an experimental 1.1 billion parameter language model fine-tuned from TinyLlama. It leverages the 'bagel' framework for instruction tuning, aiming to explore the effects of training on a highly diverse set of data sources and prompt formats. The developer explicitly states that the model is "basically unusable" due to the limitations of its TinyLlama base.

Key Characteristics & Training

Diverse Data Sources: Trained on a wide array of datasets including ai2_arc (reasoning), airoboros (synthetic instructions), apps (Python coding), belebele (multilingual reading comprehension), cinematika (RP-style data), lmsys_chat_1m (GPT-4 chats), mathinstruct, mmlu, and slimorca (GPT-4 verified chats). Only train splits were used, with decontamination via cosine similarity.
Multi-Format Prompting: Each instruction was converted into four different prompt formats (Alpaca, Vicuna, ChatML-ish, Llama-2 chat) and used during training. This approach aimed to improve generalization across various instruction types.
Context Length: Supports a context length of 2048 tokens.

Limitations and Usage Considerations

Experimental Nature: The model is primarily for experimental purposes, with the developer noting its limited practical usability.
Licensing: While the base TinyLlama model is Apache-2.0, the fine-tuning data includes content generated by OpenAI's GPT-4. Users should exercise caution and seek legal advice regarding commercial viability, as the implications of OpenAI's ToS on derivative models are complex and not definitively settled.

Overview

Overview of jondurbin/bagel-dpo-1.1b-v0.3

Key Characteristics & Training

Limitations and Usage Considerations

Full Model Card (README)