Name: LSX-UniWue/LLaMmlein_7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: LSX-UniWue

LLäMmlein 7B: A German-Centric LLaMa Model

LLäMmlein 7B is a 7 billion parameter language model developed by LSX-UniWue, built upon an adapted Tinyllama codebase. Unlike many general-purpose LLMs, this model was trained from scratch exclusively on the German portion of the RedPajama V2 dataset. To ensure high data quality, the training data underwent rigorous deduplication at the paragraph level and was further filtered using a token-to-word ratio.

Key Capabilities

German Language Specialization: Optimized for understanding and generating text in German, making it suitable for German-specific NLP tasks.
Data Quality Focus: Training on a carefully curated and filtered German dataset aims to improve linguistic accuracy and relevance for German content.
Intermediate Checkpoints: Provides access to intermediate training checkpoints, allowing researchers to analyze the model's learning progression and experiment with different stages of training.

Good For

German NLP Applications: Ideal for use cases requiring robust performance in German, such as content generation, translation, summarization, or chatbots in German.
Research and Development: The availability of intermediate checkpoints and the detailed data logging (including data_id and file_id for each iteration) makes it valuable for researchers studying model training dynamics and data influence.
Comparative Studies: Can serve as a strong baseline or comparison point for other German language models due to its focused training methodology.

Overview

LLäMmlein 7B: A German-Centric LLaMa Model

Key Capabilities

Good For

Full Model Card (README)