allenai/Llama-3.1-Tulu-3-8B-SFT
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Nov 18, 2024License:llama3.1Architecture:Transformer0.0K Warm

The allenai/Llama-3.1-Tulu-3-8B-SFT is an 8 billion parameter instruction-following model developed by Allen Institute for AI, fine-tuned from Meta's Llama 3.1 base model. It is part of the Tülu3 family, which provides fully open-source data, code, and recipes for post-training techniques. This model is designed for strong performance across diverse tasks, including chat, mathematical reasoning (MATH, GSM8K), and instruction following (IFEval), with a context length of 32768 tokens.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p