tabularisai/Faust-1
Faust-1 by tabularisai is a 1.6 billion parameter German-first decoder-only causal language model, trained from scratch on a predominantly German corpus. It features a custom tokenizer optimized for German morphology, resulting in higher token efficiency for German text. Designed for local and cost-efficient deployment, Faust-1 excels in German conversational tasks and research, running effectively on consumer-grade hardware.
Loading preview...
tabularisai/Faust-1: A German-First LLM for Efficient Local Deployment
Faust-1 is a 1.6 billion parameter decoder-only causal language model developed by tabularisai, trained entirely from scratch with a German-dominant corpus (approximately 90% German). This model prioritizes German syntax, morphology, and reasoning patterns, making it a specialized foundation model for the German language.
Key Capabilities & Features
- German-First Design: Trained on a predominantly German corpus, capturing specific linguistic regularities.
- Optimized Tokenizer: Utilizes a custom tokenizer specifically designed for German morphology and compounding, leading to improved token efficiency and lower inference costs for German text.
- Synthetic Data Training: Incorporates a substantial portion of verified synthetic data, ensuring broad coverage of instruction-following and reasoning patterns with quality control.
- Instruction Tuning: Undergoes supervised post-training and Direct Preference Optimization (DPO) to enhance conversational and task-oriented performance, stability, and alignment with human expectations.
- Resource Efficient: Deliberately sized and optimized to run on consumer-grade hardware, making it suitable for local and cost-efficient deployment without requiring expensive data-center GPUs.
Ideal Use Cases
- German Conversational Assistants: Designed for natural and effective interaction in German.
- German NLP Research: A valuable tool for research and benchmarking on German natural language processing tasks.
- Local & Privacy-Sensitive Deployments: Its efficiency allows for on-device or edge experimentation where data privacy and cost are critical.