LFM2.5-1.2B-Instruct: On-Device AI Powerhouse

LFM2.5-1.2B-Instruct is a 1.2 billion parameter instruction-tuned model developed by Liquid AI, part of the LFM2.5 family of hybrid models. It is specifically engineered for on-device deployment, offering high performance in a compact footprint. The model builds on the LFM2 architecture with significantly extended pre-training (28 trillion tokens) and advanced multi-stage reinforcement learning.

Key Capabilities & Features

Best-in-class performance for its size: Rivals much larger models, enabling high-quality AI on resource-constrained devices.
Fast edge inference: Achieves 239 tok/s decode on AMD CPU and 82 tok/s on mobile NPU, running under 1GB of memory. Supports llama.cpp, MLX, and vLLM from day one.
Long context window: Features a 32,768-token context length.
Multilingual support: Handles English, Arabic, Chinese, French, German, Japanese, Korean, and Spanish.
Tool Use: Supports function calling with a flexible template for integrating external tools, outputting Pythonic or JSON function calls.
Optimized formats: Available in native, GGUF, ONNX, and MLX formats for diverse deployment scenarios, including Apple Silicon.

Ideal Use Cases

Agentic tasks
Data extraction
Retrieval Augmented Generation (RAG)
On-device applications across mobile, IoT, vehicles, and embedded systems.

It is important to note that this model is not recommended for knowledge-intensive tasks or programming.

Overview

LFM2.5-1.2B-Instruct: On-Device AI Powerhouse

Key Capabilities & Features

Ideal Use Cases

Full Model Card (README)