Name: koutch/short_paper_llama_0.json_train_dpo_v1_dev API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: koutch

Overview

This model, developed by koutch, is an 8 billion parameter instruction-tuned variant of the Llama 3.1 architecture. It was fine-tuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit.

Key Characteristics

Base Model: Llama 3.1-8B-Instruct
Training Efficiency: Fine-tuned using Unsloth and Huggingface's TRL library, resulting in a 2x faster training time compared to standard methods.
License: Released under the Apache-2.0 license.

Use Cases

This model is suitable for various natural language processing tasks that benefit from an instruction-tuned Llama 3.1 base, particularly where efficient training methods are a consideration. Its 8 billion parameters and 32768 token context length make it versatile for general text generation, summarization, and question-answering applications.

Overview

Overview

Key Characteristics

Use Cases

Full Model Card (README)