SakanaAI/Llama-3-8B-Instruct-Coding-Expert

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Jul 29, 2024License:llama3Architecture:Transformer0.0K Warm

SakanaAI/Llama-3-8B-Instruct-Coding-Expert is an 8 billion parameter instruction-tuned language model developed by Sakana AI, based on the Llama-3-8B-Instruct architecture. This model is specifically fine-tuned for coding tasks, leveraging datasets like Magicoder-Evol-Instruct-110K and Magicoder-OSS-Instruct-75K. It serves as a specialized component within the broader Llama-3-CycleQD agentic LLM collection, designed to excel in code generation and understanding.

Loading preview...

SakanaAI/Llama-3-8B-Instruct-Coding-Expert Overview

This model is an 8 billion parameter instruction-tuned language model developed by Sakana AI, built upon the Meta Llama 3 architecture. It is a specialized component within the larger Llama-3-CycleQD collection of agentic LLMs, which also includes DB and OS expert models. The model's development utilized the CycleQD method, as detailed in the associated paper.

Key Capabilities

  • Specialized Coding Expertise: Fine-tuned specifically for coding tasks, leveraging datasets such as Agent-FLAN, Magicoder-Evol-Instruct-110K, and Magicoder-OSS-Instruct-75K.
  • Llama 3 Foundation: Benefits from the robust base capabilities of the Llama-3-8B-Instruct model.
  • Research Prototype: Intended for research and development purposes, serving as an experimental prototype.

Good For

  • Code Generation: Generating code snippets or functions based on instructions.
  • Code Understanding: Assisting with interpreting or debugging code.
  • Agentic Workflows: Integration into multi-agent systems where a dedicated coding expert is required.
  • Research & Experimentation: Exploring advanced fine-tuning techniques for domain-specific LLMs, particularly in the coding domain.

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p