astom-M/matsuo-llm-advanced-phase-e3ab

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 19, 2026License:apache-2.0Architecture:Transformer Open Weights Cold

The astom-M/matsuo-llm-advanced-phase-e3ab is a 7.6 billion parameter language model fine-tuned from Qwen/Qwen2.5-7B-Instruct with a 32K context length. This model is specifically optimized for agent tasks, demonstrating proficiency in environments like DBBench and ALFWorld. Its training incorporates diverse datasets including synthetic SFT for DBBench, xlangai/spider, and birdsql/bird_mini_dev, making it suitable for complex database interaction and interactive environment reasoning.

Loading preview...

Overview

This model, astom-M/matsuo-llm-advanced-phase-e3ab, is a 7.6 billion parameter language model built upon the Qwen/Qwen2.5-7B-Instruct architecture. It features a substantial 32,768 token context length, making it capable of processing extensive inputs for complex tasks. The primary focus of its fine-tuning has been on enhancing performance in agent-based tasks, specifically targeting environments like DBBench and ALFWorld.

Key Capabilities

  • Agent Task Optimization: Fine-tuned for robust performance in interactive and decision-making agent scenarios.
  • Database Interaction: Utilizes datasets like dbbench_sft_dataset_react_v4, xlangai/spider, and birdsql/bird_mini_dev to excel in tasks involving database querying and interaction.
  • Environmental Reasoning: Training on the official ALFWorld v5 dataset equips it for reasoning and action within simulated interactive environments.
  • Compliance: Adheres to strict data integrity protocols, ensuring evaluation data was not used in training and no LLM-based data filtering was applied.

Good For

  • Developing AI Agents: Ideal for creating agents that need to interact with structured data (like databases) or navigate complex, interactive environments.
  • Database Query Generation: Suitable for applications requiring the generation of SQL queries or understanding database schemas.
  • Automated Reasoning: Can be applied to tasks demanding sequential decision-making and problem-solving in defined environments.

Technical Details

This model is compatible with vLLM v0.13.0 and newer versions, facilitating efficient inference and deployment.