mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Feb 27, 2026Architecture:Transformer Cold
The mssfj/Qwen2.5-7B-Instruct_grpo_alfworld_trajectory_dataset is a 7.6 billion parameter instruction-tuned model based on the Qwen2.5 architecture. This model is specifically designed for tasks related to the Alfworld environment, likely focusing on trajectory generation or understanding within interactive text-based games. Its instruction-tuned nature suggests optimization for following commands and generating relevant responses in such structured environments.
Loading preview...