mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 27, 2026Architecture:Transformer Cold

The mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset is a 7.6 billion parameter instruction-tuned model based on the Qwen2.5 architecture. This model is specifically designed for tasks related to the Alfworld environment, likely focusing on trajectory generation or understanding within interactive text-based games. Its instruction-tuned nature suggests optimization for following commands and generating relevant responses in such structured environments.

Loading preview...

Overview

This model, mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset, is an instruction-tuned variant of the Qwen2.5 architecture, featuring 7.6 billion parameters and a context length of 32768 tokens. While specific training details and performance metrics are not provided in the available model card, its naming convention strongly indicates a specialization in tasks related to the Alfworld environment, particularly concerning trajectory datasets.

Key Characteristics

  • Architecture: Qwen2.5 base model.
  • Parameter Count: 7.6 billion parameters.
  • Context Length: Supports a substantial context window of 32768 tokens.
  • Instruction-Tuned: Optimized for understanding and executing instructions.
  • Specialization: Implied focus on Alfworld-related tasks, likely involving understanding or generating action trajectories within interactive text-based game environments.

Potential Use Cases

  • Alfworld Research: Ideal for researchers working on agents for the Alfworld environment.
  • Trajectory Generation: Potentially useful for generating sequences of actions or plans in text-based interactive settings.
  • Instruction Following: Applicable in scenarios requiring a model to follow complex instructions within a defined environment.