LFM2-700M: A Hybrid Model for Edge AI

LFM2-700M is a 0.7 billion parameter model from Liquid AI's new generation of hybrid models, specifically engineered for efficient edge AI and on-device deployment. It offers a 32,768 token context length and is built on a novel architecture featuring multiplicative gates and short convolutions (10 conv + 6 attn layers).

Key Capabilities & Features

Optimized Performance: Achieves 3x faster training and 2x faster decode/prefill speeds on CPU compared to previous generations and Qwen3, respectively.
Benchmark Outperformance: Surpasses similarly-sized models in knowledge, mathematics, instruction following, and multilingual benchmarks.
Flexible Deployment: Designed for efficient operation across CPU, GPU, and NPU hardware.
Multilingual Support: Supports English, Arabic, Chinese, French, German, Japanese, Korean, and Spanish.
Tool Use: Incorporates a structured tool-use mechanism with JSON function definitions and Pythonic function calls.
Training: Trained on 10 trillion tokens using knowledge distillation from LFM1-7B, large-scale SFT, custom DPO, and iterative model merging.

Recommended Use Cases

LFM2-700M is particularly suited for fine-tuning on narrow applications to maximize performance. It is recommended for:

Agentic tasks
Data extraction
Retrieval Augmented Generation (RAG)
Creative writing
Multi-turn conversations

However, it is not recommended for knowledge-intensive tasks or those requiring strong programming skills.

Overview

LFM2-700M: A Hybrid Model for Edge AI

Key Capabilities & Features

Recommended Use Cases

Full Model Card (README)