SakanaAI/TinySwallow-1.5B-Instruct
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Jan 7, 2025License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

TinySwallow-1.5B-Instruct is a 1.5 billion parameter instruction-tuned causal language model developed by Sakana AI, specifically optimized for Japanese language tasks. It was created using Temporally Adaptive Interpolated Distillation (TAID) with Qwen2.5-32B-Instruct as the teacher model and Qwen2.5-1.5B-Instruct as the student. This model excels at following instructions and engaging in conversations in Japanese, leveraging its 131072 token context length for complex interactions.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p