LLM-OS-Models/LFM2.5-1.2B-Terminal-SFT-1Epoch-LiquidCLI-TemplateHoldout
LLM-OS-Models/LFM2.5-1.2B-Terminal-SFT-1Epoch-LiquidCLI-TemplateHoldout is a 1.2 billion parameter model based on LiquidAI/LFM2.5-1.2B-Instruct, specifically fine-tuned for terminal automation tasks. It generates subsequent terminal commands in JSON format based on user input and previous terminal states. This model is optimized for cost-effective and fast inference in specific acceleration paths, demonstrating a conservative approach to command generation.
Loading preview...
Overview
This model, LLM-OS-Models/LFM2.5-1.2B-Terminal-SFT-1Epoch-LiquidCLI-TemplateHoldout, is a 1.2 billion parameter model derived from LiquidAI/LFM2.5-1.2B-Instruct. It is specialized for terminal automation, designed to generate the next command in JSON format given a task description and the current terminal state. The model was trained for 1 epoch using Liquid-CLI style preprocessing and training, with a chat-template aligned holdout split.
Key Capabilities & Characteristics
- Terminal Command Generation: Generates JSON-formatted commands for terminal automation.
- JSON Output: Recommends a specific JSON output format including
analysis,plan,commands(withkeystrokesandduration), andtask_complete. - Conservative Command Generation: Tends to issue correct commands conservatively rather than many incorrect ones.
- Efficiency: Offers fast inference at
0.085sec/step, making it suitable for iterative evaluation and RL experiments. - Base Model: Built upon the LFM series, which shows significant SFT performance gains relative to its base score.
Performance & Limitations
Evaluated on the corrected TB2-lite replay set, the model achieved a score of 28.10 (Command F1) and a rank of 36 / 56. Key metrics include:
- Command F1: 0.2810
- Valid JSON: 50.5%
- First command exact: 29.4%
Limitations include:
- Lower Recall: May omit some necessary commands.
- JSON Format Failures: Requires parsing validation and retries before execution due to a 50.5% valid JSON rate.
- Specialized Use: This model is specifically for automated terminal operation assistance and does not guarantee general conversation or reasoning performance.
- Safety: Generated commands require safety measures like sandboxing, allowlisting, or human review before execution.
Use Cases
- Automated Terminal Operations: Ideal for scenarios requiring automated command execution in a terminal environment.
- Cost-Effective Inference: Suitable for applications where fast inference and cost efficiency are critical in specific acceleration paths.
- RL Experimentation: Its efficiency makes it a good candidate for reinforcement learning experiments focused on improving terminal interaction.