LFM2-350M: A Hybrid Model for Edge AI

LFM2-350M is a 0.35 billion parameter model from Liquid AI's new generation of hybrid models, specifically engineered for edge AI and on-device deployment. It introduces a novel architecture combining multiplicative gates and short convolutions, resulting in significant performance gains.

Key Capabilities & Performance

Optimized Speed: Achieves 3x faster training and 2x faster decode and prefill speeds on CPU compared to Qwen3, making it highly efficient for resource-constrained environments.
Superior Performance: Outperforms other models of similar size across multiple benchmarks, including knowledge, mathematics, instruction following, and multilingual tasks.
Flexible Deployment: Designed to run efficiently on CPU, GPU, and NPU hardware, enabling deployment on diverse devices like smartphones, laptops, and vehicles.
Multilingual Support: Supports English, Arabic, Chinese, French, German, Japanese, Korean, and Spanish.
Tool Use: Features robust tool use capabilities, allowing for complex agentic tasks through JSON function definitions and Pythonic function calls.

Recommended Use Cases

LFM2-350M is particularly suited for fine-tuning on narrow use cases to maximize performance. It is recommended for:

Agentic tasks
Data extraction
RAG (Retrieval Augmented Generation)
Creative writing
Multi-turn conversations

Due to its small size, it is not recommended for knowledge-intensive tasks or those requiring extensive programming skills without specific fine-tuning.

Overview

LFM2-350M: A Hybrid Model for Edge AI

Key Capabilities & Performance

Recommended Use Cases

Full Model Card (README)