Name: WisdomShell/ADG-CoT-LLaMa3-8B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: WisdomShell

Overview of WisdomShell/ADG-CoT-LLaMa3-8B

WisdomShell/ADG-CoT-LLaMa3-8B is an 8 billion parameter LLaMa3-based model that leverages the Answer Divergence-Guided Selection (ADG) method for instruction data selection. Developed by WisdomShell, this approach significantly enhances instruction tuning quality by evaluating the geometric structure of multiple sampled answers for each instruction, rather than relying on a single reference response. This method was presented at ACL 2026.

Key Capabilities & Features

Geometry-Aware Data Selection: ADG scores instructions based on the dispersion magnitude and shape anisotropy of multiple high-temperature sampled answers, providing a more robust quality assessment.
Improved Instruction Tuning: Consistently improves model performance across reasoning, knowledge, and coding benchmarks under a fixed 10K data budget.
Semantic Coverage: Employs bin-wise selection to ensure semantic diversity and coverage within the selected instruction subset.
Practical Pipeline: The repository provides a complete pipeline for multi-sample answer generation, instruction embedding and clustering, ADG scoring, subset selection, model training, and benchmark evaluation.

Ideal Use Cases

Optimizing Instruction Datasets: Developers looking to select the most impactful instruction examples from large pools for fine-tuning.
Enhancing Reasoning Tasks: Applications requiring strong performance in complex reasoning, mathematical, and logical problem-solving.
Improving Code Generation: Scenarios where high-quality code generation and understanding are critical.
Knowledge-Intensive Applications: Use cases demanding accurate and comprehensive knowledge recall and application.

Overview

Overview of WisdomShell/ADG-CoT-LLaMa3-8B

Key Capabilities & Features

Ideal Use Cases

Full Model Card (README)