Name: Havoc999/tiny-chatbot API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Havoc999

Havoc999/tiny-chatbot: A LoRA Fine-Tuned Conversational Assistant

This model is a 1.1 billion parameter conversational assistant developed by Havoc999. It was created by applying LoRA (rank 16) fine-tuning to the TinyLlama-1.1B-Chat-v1.0 base model using the tatsu-lab/alpaca instruction dataset, which comprises 52,000 English instruction-response pairs. The fine-tuning was performed on a Kaggle Dual T4 GPU environment.

Key Capabilities & Performance

Instruction Following: Designed to respond to English instructions, leveraging the Alpaca dataset's diverse prompts.
Commonsense Reasoning: Achieves a normalized accuracy of 56.00% on the HellaSwag benchmark, indicating better-than-random general language understanding.
Physical Intuition: Scores 74.00% normalized accuracy on PIQA, demonstrating solid everyday procedural knowledge for its size.
Lightweight: At 1.1 billion parameters, it offers a compact solution for conversational AI.

Limitations & Intended Use

English Only: Primarily focused on English; performance in other languages is not guaranteed.
Limited Reasoning: Due to its small scale, multi-step logical and mathematical reasoning (e.g., MMLU Elementary Math: 30.00%, ARC Challenge: 35.00%) is unreliable.
No RLHF Safety Alignment: Lacks reinforcement learning from human feedback, inheriting only the base model's alignment. Users should be aware of potential inappropriate responses.
Short Context: Trained with a maximum sequence length of 512 tokens, limiting very long conversations.
Research & Learning: Intended as a learning artifact and research baseline rather than a production-ready consumer product.

This model is ideal for developers and researchers looking for a small, efficient, and openly licensed (Apache 2.0) instruction-tuned model to experiment with conversational AI or as a foundation for further fine-tuning.

Overview

Havoc999/tiny-chatbot: A LoRA Fine-Tuned Conversational Assistant

Key Capabilities & Performance

Limitations & Intended Use

Full Model Card (README)