Name: tomg-group-umd/zephyr-llama3-8b-sft-refusal-n-contrast-multiple-tokens API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: tomg-group-umd

Overview

This model, developed by tomg-group-umd, is an 8 billion parameter language model built upon the meta-llama/Meta-Llama-3-8B architecture. It has been fine-tuned using a combination of UltraChat SFT data with a respond token, CoCoNoT refusals with a refuse token, and CoCoNoT's contrast data. A key distinguishing feature is its implementation of multiple refusal tokens, specifically designed for each of five distinct categories.

Key Capabilities

Refusal Handling: Incorporates specific refusal tokens to manage and categorize model refusals.
Contrastive Generation: Fine-tuned with contrast data to generate nuanced and contrasting responses.
Fine-tuned from Llama 3: Leverages the strong base capabilities of the Llama 3 8B model.

Usage Recommendations

For optimal generation, users are advised to refer to the specific code examples provided in the associated repository, particularly within the coconot_eval folder. This guidance is crucial for effectively utilizing the model's unique refusal and contrastive token mechanisms.

Overview

Overview

Key Capabilities

Usage Recommendations

Full Model Card (README)