Name: koutch/short_paper_llama_2.json_train_dpo_v1_train_no_think API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: koutch

Model Overview

This model, developed by koutch, is an 8 billion parameter instruction-tuned language model based on the Llama 3.1 architecture. It was finetuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit.

Key Characteristics

Architecture: Llama 3.1
Parameter Count: 8 billion
Context Length: 32768 tokens
Training Efficiency: Finetuned using Unsloth and Huggingface's TRL library, resulting in 2x faster training compared to standard methods.
License: Apache-2.0

Potential Use Cases

This model is suitable for a variety of general-purpose language generation and instruction-following tasks, benefiting from its Llama 3.1 base and efficient finetuning.

Overview

Model Overview

Key Characteristics

Potential Use Cases

Full Model Card (README)