Menlo/llama3-s-instruct-v0.2
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Aug 20, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Menlo/llama3-s-instruct-v0.2 is an 8 billion parameter Llama-3 architecture model developed by Homebrew Research, featuring a 32768 token context length. This model is uniquely designed for multimodal input, natively understanding both audio and text. It expands on semantic token experiments using WhisperVQ as an audio tokenizer, making it specialized for sound understanding capabilities.

Loading preview...