atenareply/lfm2.5-1.2b-noval-agentic
The atenareply/lfm2.5-1.2b-noval-agentic is a 1.2 billion parameter language model specifically fine-tuned for agentic behavior in a narrow, verifiable maintenance assistant domain. It excels at telemetry anomaly triage, native tool-calling, and autonomous maintenance loops, including raw-log reading. This model is optimized for integration into tool-execution loops, acting as a decision brain for tasks like classifying alerts and orchestrating maintenance actions within a fictional Orbital Mining Corporation (OMC) context.
Loading preview...
Model Overview
The atenareply/lfm2.5-1.2b-noval-agentic is a 1.2 billion parameter model developed by atenareply, specialized as an agentic maintenance assistant. It is the result of a Supervised Fine-Tuning (SFT) process using LoRA on the ParamΔ model, focusing on grounded tool-use trajectories. The model's primary domain is fictional Orbital Mining Corporation (OMC) technical documentation and Mars Express telemetry.
Key Capabilities
- Telemetry Anomaly Triage: Classifies alerts as routine or genuine with high precision (1.00) for genuine anomalies.
- Native Tool-Calling: Supports 8 specific tools, including
get_channel_stats,query_telemetry,classify_alert, andschedule_maintenance, using a native LFM2 tool format. - Autonomous Maintenance Loop: Capable of executing a full maintenance loop for genuine anomalies, involving consulting SOPs, issuing commands, scheduling maintenance, and opening incidents.
- Raw-Log Reading: Can process raw telemetry rows for complex tasks, demonstrating a difficulty-graded agentic curriculum.
- High Performance in Domain: Achieves a pass@1 of 0.782 for full trajectories and an action-loop accuracy of 0.782, significantly outperforming its ParamΔ baseline.
Good For
- Automated Operations: Ideal for use within a tool-execution loop where the model's outputs (tool calls) are executed, and results are fed back.
- Decision-Making Agents: Functions as the 'decision brain' for complex, multi-step maintenance and triage processes.
- Verifiable Agentic Systems: Benefits from innovations like a deterministic verifier for evaluation and reward, ensuring high reliability in its specific domain.
Limitations
- Requires Tool Execution Loop: The model must be used within a multi-turn loop that executes tools and feeds results back; it will hallucinate or stall in plain chat without tool execution.
- Fictional Domain: Primarily trained on fictional data, meaning real-world applicability outside the OMC/Mars-Express context may vary.
- Recall Rate: Has a recall of 0.63 for genuine anomalies, indicating approximately 37% of genuine issues might be missed, though with perfect precision (no false alarms).