Name: QKing-Official/EndAI-Small API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: QKing-Official

EndAI-Small: A Compact and Efficient LLM

EndAI-Small, developed by QKing-Official, is a 1.1 billion parameter language model based on the TinyLlama architecture. This model prioritizes efficiency and speed, making it a strong candidate for applications where computational resources are limited.

Key Capabilities

Lightweight Design: Built on TinyLlama, ensuring a small footprint.
Optimized for Speed: Engineered for rapid inference on both CPUs and GPUs.
Instruction-Tuned: Trained on 3% of the HuggingFaceH4/ultrachat_200k dataset, providing basic instruction-following capabilities.

Good For

Edge Devices: Ideal for deployment on hardware with limited memory and processing power.
Local Inference: Enables quick AI processing directly on user devices without requiring powerful cloud infrastructure.
Rapid Prototyping: Its small size allows for fast experimentation and integration into projects.
CPU-Bound Applications: Specifically designed to perform well even on CPU-only setups, broadening its accessibility.

Overview

EndAI-Small: A Compact and Efficient LLM

Key Capabilities

Good For

Full Model Card (README)