ermiaazarkhalili/LFM2.5-350M-Function-Calling-xLAM-Unsloth
TEXT GENERATIONConcurrency Cost:1Model Size:0.35BQuant:BF16Ctx Length:32kPublished:Apr 24, 2026License:apache-2.0Architecture:Transformer Open Weights Cold
The ermiaazarkhalili/LFM2.5-350M-Function-Calling-xLAM-Unsloth is a 350 million parameter language model, fine-tuned by ermiaazarkhalili from the LiquidAI LFM2.5-350M base model. Optimized for function calling, it was trained using Unsloth for efficiency and the Salesforce/xlam-function-calling-60k dataset. This model excels at interpreting natural language queries to generate structured function calls, making it suitable for tool-use applications.
Loading preview...
Overview of LFM2.5-350M-Function-Calling-xLAM-Unsloth
This model, developed by ermiaazarkhalili, is a specialized fine-tune of the LiquidAI LFM2.5-350M base model, specifically optimized for function calling. It leverages the Unsloth framework, which enabled 2x faster training and 60% less VRAM usage during its development.
Key Capabilities & Features
- Function Calling Expertise: Fine-tuned on the extensive Salesforce/xlam-function-calling-60k dataset, comprising 60,000 examples of queries, tool definitions, and structured answers.
- Efficient Training: Utilizes Unsloth and QLoRA (4-bit) for efficient SFT, achieving a final training loss of 0.6507 with a peak VRAM of only 5.73 GB on an NVIDIA H100 GPU.
- Compact Size: At 350 million parameters, it offers a lightweight solution for integrating function calling capabilities into applications.
- Flexible Deployment: Available in standard Transformers format, and also provides GGUF versions for CPU and edge inference, compatible with tools like Ollama and llama.cpp.
- English Language Focus: Primarily trained on English data, making it best suited for English-language function calling tasks.
Ideal Use Cases
- Tool Use & Agent Systems: Excellent for applications requiring the model to interpret user requests and generate appropriate function calls to external tools or APIs.
- Resource-Constrained Environments: Its small size and efficient training/inference (especially with Unsloth and GGUF) make it suitable for deployment where computational resources are limited.
- Prototyping & Development: A strong candidate for developers looking to quickly integrate function calling without the overhead of larger models.