KaraKaraWitch/oiiaioiiai-B
Hugging Face
TEXT GENERATIONConcurrency Cost:4Model Size:70BQuant:FP8Ctx Length:32kArchitecture:Transformer0.0K Warm

KaraKaraWitch/oiiaioiiai-B is a 70 billion parameter merged language model created by KaraKaraWitch, utilizing the TIES merge method with ReadyArt/The-Omega-Directive-L-70B-v1.0 as its base. This model is specifically noted for its proficiency in Japanese to English translation and its ability to generate dialogue, incorporating elements from various merged models including AISG's SEA Lion for enhanced Southeast Asian language support. It tends to produce Wikipedia-like content for idea crafting and is characterized by a 'weeb' aesthetic in its writing style.

Loading preview...

Model Overview

KaraKaraWitch/oiiaioiiai-B is a 70 billion parameter merged language model developed by KaraKaraWitch, built upon the ReadyArt/The-Omega-Directive-L-70B-v1.0 base using the TIES merge method. This model is the result of combining numerous other models, including aisingapore/Llama-SEA-LION-v3-70B-IT, nbeerbower/Llama3-Asobi-70B, and LatitudeGames/Wayfarer-Large-70B-Llama-3.3, among others. The merging process aimed to consolidate diverse capabilities, particularly focusing on improving translation and creative writing aspects.

Key Capabilities

  • Japanese to English Translation: The model demonstrates notable proficiency in translating Japanese text to English, a key differentiator from previous models by the creator.
  • Multilingual Support: Incorporates AISG's SEA Lion models, theoretically enhancing performance across Burmese, Chinese, English, Filipino, Indonesian, Javanese, Khmer, Lao, Malay, Sundanese, Tamil, Thai, and Vietnamese.
  • Dialogue Generation: Capable of generating dialogue, likely influenced by models like Wayfarer.
  • Content Generation Style: Tends to produce Wikipedia-like content when prompted for idea crafting and exhibits a 'weeb' aesthetic, often stretching out expressions seen in visual novels.
  • Prompt Responsiveness: Designed to be more 'direct' and attentive to prompt instructions.

Good For

  • Users requiring Japanese to English translation capabilities.
  • Applications involving dialogue generation or creative writing with a specific stylistic flair.
  • Scenarios where multilingual support for Southeast Asian languages is beneficial.
  • Experimentation with merged models, particularly for those interested in the TIES merge method and its outcomes.
Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p