hem007/Llama-2-7b-chat-finetune
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kArchitecture:Transformer Cold

The hem007/Llama-2-7b-chat-finetune model is a 7 billion parameter language model based on the Llama-2 architecture, fine-tuned for chat-based interactions. It supports a context length of 4096 tokens. This model is designed for generating conversational responses and can be used for various interactive text generation tasks.

Loading preview...