RudraChakrin/Llama-3.1-8B-Instruct-TTS-Phonetic-Denglish

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:May 15, 2026License:llama3.1Architecture:Transformer Warm

RudraChakrin/Llama-3.1-8B-Instruct-TTS-Phonetic-Denglish is an 8 billion parameter Llama 3.1-based instruction-tuned model developed by Sven Rein (RudraChakrin). This model functions as a highly specialized phonetic preprocessor for German Text-to-Speech (TTS) systems, specifically optimized for end-to-end architectures like Piper using the Thorsten-Voice. It excels at intelligently code-switching and phonetically transcribing English and "Denglish" terms within German text, ensuring smooth pronunciation for TTS engines. The model also handles context-based expansion of numbers, units, and abbreviations, making it ideal for improving the naturalness of German speech synthesis in mixed-language contexts.

Loading preview...

Overview

This Llama 3.1-based 8 billion parameter model, developed by Sven Rein (RudraChakrin), is a highly specialized phonetic preprocessor for German Text-to-Speech (TTS) systems. It's specifically optimized for architectures like Piper using the Thorsten-Voice, aiming to resolve pronunciation issues caused by English words, abbreviations, and numbers in modern "Denglish" texts. It is not a chatbot and will only perform phonetic transcription.

Key Capabilities

  • Intelligent Code-Switching: Preserves standard German text while phonetically translating English and "Denglish" terms (e.g., download to daunlohd).
  • Context-Based Unit Expansion: Expands abbreviations and symbols (e.g., 12,5cm becomes 12 Komma 5 Zenti Meter in German context, or Twällw-Päunt-Feihw ßänti Mieters in English).
  • Decimal Point Splitting: Separates decimal places for individual digit pronunciation by TTS engines.
  • Zero-Shot Resilience: Safely passes through complex alphanumeric strings like IPv4/IPv6 addresses without transcription.

Good for

  • Improving German TTS Quality: Ensures natural pronunciation of mixed German and English content.
  • Developers using Piper/Thorsten-Voice: Provides pre-processed text optimized for these specific TTS systems.
  • Handling Technical & "Denglish" Content: Ideal for texts containing English loanwords, technical jargon, and units within a German context.

Limitations

Known issues include occasional mispronunciations, challenges with the German "ß" at word beginnings, the English "TH" sound, heteronyms, and some unit confusion, especially in quantized versions. It does not support source code, complex mathematical formulas, or French loanwords in this alpha release. For best precision with numbers and units, the unquantized 16-bit model (BF16) is recommended.