Name: WisdomShell/ADG-WizardLM-LLaMa3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: WisdomShell

Overview of ADG-WizardLM-LLaMa3-8B

WisdomShell/ADG-WizardLM-LLaMa3-8B is an 8 billion parameter LLaMa3-based model that utilizes the Answer Divergence-Guided Selection (ADG) method for instruction data selection. Developed by Bo Li, Mingda Wang, Shikun Zhang, and Wei Ye, this approach focuses on improving instruction tuning quality by selecting the most impactful examples under a fixed data budget. Unlike traditional methods that rely on a single reference response, ADG scores instructions by analyzing the geometric structure of multiple answers sampled from a base model using stochastic decoding.

Key Capabilities & Methodology

Geometry-Aware Scoring: ADG samples multiple answers for each instruction, maps them into a representation space, and computes scores based on their dispersion magnitude and shape anisotropy.
Bin-wise Selection: It performs proportional selection within semantic bins to ensure broad semantic coverage.
Improved Instruction Tuning: The method consistently enhances instruction tuning performance across various benchmarks, including reasoning, knowledge, and coding tasks.
Practical Pipeline: The repository provides a complete pipeline for multi-sample answer generation, instruction embedding and clustering, ADG scoring and subset selection, model training, and benchmark evaluation.

Good for

Researchers and Developers interested in advanced instruction data selection techniques.
Improving LLM performance on reasoning, knowledge, and coding tasks with limited data budgets.
Understanding and implementing a novel approach to data curation for instruction tuning.

Overview

Overview of ADG-WizardLM-LLaMa3-8B

Key Capabilities & Methodology

Good for

Full Model Card (README)