tokyotech-llm/Swallow-7b-NVE-hf
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:4kPublished:Nov 30, 2023License:llama2Architecture:Transformer0.0K Open Weights Cold
The Swallow-7b-NVE-hf model by TokyoTech-LLM is a 7 billion parameter language model continually pre-trained from the Llama 2 family, specifically enhanced with Japanese language data. This version, 'NVE' (No Vocabulary Expansion), focuses on improving Japanese language capabilities without altering the original Llama 2 tokenizer's vocabulary. It demonstrates strong performance on various Japanese NLP tasks, often outperforming its Llama 2 base, while maintaining competitive English task performance.
Loading preview...