Name: langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: langfeng01

Model Overview

langfeng01/GiGPO-Qwen2.5-7B-Instruct-ALFWorld is a 7.6 billion parameter instruction-tuned language model built upon the Qwen2.5 architecture. This model is uniquely specialized for embodied AI tasks, particularly within the ALFRED Embodied Environment. It leverages a substantial context length of 131,072 tokens, enabling it to process extensive observational histories and task descriptions for complex sequential decision-making.

Key Capabilities

Embodied AI Task Execution: Specifically trained to operate as an expert agent in the ALFRED Embodied Environment, handling tasks that require understanding observations and generating appropriate actions.
GiGPO Training: Utilizes the GiGPO (Generative Imitation Guided Policy Optimization) method, as detailed in the associated arXiv paper, to enhance its performance in interactive environments.
Structured Reasoning and Action: Designed to perform step-by-step reasoning within <think> tags before selecting an admissible action, presented within <action> tags, following a specific prompt template.
Contextual Understanding: Benefits from its large context window to maintain a detailed history of observations and actions, crucial for navigating and completing multi-step tasks.

Good For

Research in Embodied AI: Ideal for researchers and developers working on agents for simulated environments like ALFRED.
Sequential Decision-Making: Applications requiring an agent to reason and act based on a series of observations and a history of actions.
Developing Intelligent Agents: Useful for building agents that can interpret complex instructions and execute tasks in interactive settings.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)