Name: astom-M/matsuo-llm-advanced-phase-e3ab API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: astom-M

Overview

This model, astom-M/matsuo-llm-advanced-phase-e3ab, is a 7.6 billion parameter language model built upon the Qwen/Qwen2.5-7B-Instruct architecture. It features a substantial 32,768 token context length, making it capable of processing extensive inputs for complex tasks. The primary focus of its fine-tuning has been on enhancing performance in agent-based tasks, specifically targeting environments like DBBench and ALFWorld.

Key Capabilities

Agent Task Optimization: Fine-tuned for robust performance in interactive and decision-making agent scenarios.
Database Interaction: Utilizes datasets like dbbench_sft_dataset_react_v4, xlangai/spider, and birdsql/bird_mini_dev to excel in tasks involving database querying and interaction.
Environmental Reasoning: Training on the official ALFWorld v5 dataset equips it for reasoning and action within simulated interactive environments.
Compliance: Adheres to strict data integrity protocols, ensuring evaluation data was not used in training and no LLM-based data filtering was applied.

Good For

Developing AI Agents: Ideal for creating agents that need to interact with structured data (like databases) or navigate complex, interactive environments.
Database Query Generation: Suitable for applications requiring the generation of SQL queries or understanding database schemas.
Automated Reasoning: Can be applied to tasks demanding sequential decision-making and problem-solving in defined environments.

Technical Details

This model is compatible with vLLM v0.13.0 and newer versions, facilitating efficient inference and deployment.

Overview

Overview

Key Capabilities

Good For

Technical Details

Full Model Card (README)