nnethercott/llava-v1.5-7b-hf-vicuna
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Feb 23, 2024License:llama2Architecture:Transformer Open Weights Cold

nnethercott/llava-v1.5-7b-hf-vicuna is a 7 billion parameter vision-language model, fine-tuned from LLaMA/Vicuna, designed for multimodal instruction-following tasks. This model integrates visual understanding with language generation, making it capable of processing and responding to queries based on both text and images. It is specifically intended for LLM benchmarking, offering a robust foundation for evaluating multimodal AI performance.

Loading preview...