Name: xaviergillard/digita API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: xaviergillard

Model Overview

xaviergillard/digita is an 8 billion parameter conversational language model, fine-tuned from unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit using QLoRA. Developed by Xavier Gillard and funded by BELSPO under the Arkey project, its primary purpose is to prototype a chatbot that can interact and respond to user requests in a manner consistent with the Belgian State Archive's customer support personnel. The model was trained on a proprietary dataset derived from the "digit" mailbox of the DiVa section, focusing on emulating human-like responses in an archival context.

Key Capabilities

Specialized Conversational AI: Fine-tuned to mimic the communication style of the Belgian State Archive's customer support.
Multilingual Support: Capable of processing and generating responses in Dutch, French, German, and English.
Proof-of-Concept: Demonstrates the feasibility of creating domain-specific chatbots for public institutions using fine-tuning techniques.

Limitations and Recommendations

Digita's training data spans the 2022-2024 period, meaning some information may be outdated, particularly regarding the AGATHA platform. The 8B parameter size is noted as a potential limitation for broader applications. The developer recommends re-finetuning with a larger model and a more recent dataset for serious deployment within the Belgian State Archive, highlighting this model as a strong proof-of-concept for future development. All training scripts and notebooks (excluding the dataset) are available on GitHub.

Overview

Model Overview

Key Capabilities

Limitations and Recommendations

Full Model Card (README)