refuelai/Llama-3-Refueled

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 3, 2024License:cc-by-nc-4.0Architecture:Transformer0.2K Open Weights Warm

Llama-3-Refueled is an 8 billion parameter Llama3-based instruction-tuned model developed by Refuel AI, optimized for a wide range of NLP tasks. It was fine-tuned on over 2750 datasets, excelling in classification, reading comprehension, structured attribute extraction, and entity resolution. With an 8192-token context length, it demonstrates strong performance across various labeling tasks, often outperforming larger models like Llama3-70B-Instruct and GPT-3.5-Turbo in specific benchmarks. This model is particularly suited for applications requiring robust text understanding and data extraction capabilities.

Loading preview...

Llama-3-Refueled: An Instruction-Tuned Llama3-8B Model

Refuel AI's Llama-3-Refueled is an 8 billion parameter language model built upon the Llama3-8B base architecture. This model has undergone extensive instruction tuning using a diverse corpus of over 2750 datasets, encompassing more than 4 billion tokens. The training data includes human-annotated sources like Flan, Task Source, and Aya, alongside synthetic datasets such as OpenOrca, OpenHermes, and WizardLM, as well as proprietary datasets from Refuel AI.

Key Capabilities

  • Broad NLP Task Proficiency: Excels across a wide array of natural language processing tasks.
  • Specialized Data Labeling: Demonstrates strong performance in classification, reading comprehension, structured attribute extraction, and entity resolution.
  • Competitive Benchmarking: Achieves an overall quality score of 79.67% on Refuel's labeling task benchmarks, outperforming Llama3-70B-Instruct (78.20%) and GPT-3.5-Turbo (68.13%).
  • Optimized Transformer Architecture: Utilizes an auto-regressive language model with an optimized transformer architecture.

Good For

  • Text Classification: Identifying categories or sentiments within text.
  • Reading Comprehension: Answering questions based on provided text passages.
  • Structured Data Extraction: Extracting specific attributes or information from unstructured text.
  • Entity Resolution: Identifying and linking mentions of the same real-world entity.
  • General Instruction Following: Responding to a variety of text-based prompts and instructions.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p