Name: kazuyamaa/alfworld-lambda-grpo-v004 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: kazuyamaa

Model Overview

The kazuyamaa/alfworld-lambda-grpo-v004 is a 4 billion parameter Qwen3 model developed by kazuyamaa. It is a finetuned version of the kazuyamaa/alfworld-lambda-grpo-v002-hull model, specifically optimized for efficiency during training.

Key Capabilities

Efficient Finetuning: This model was trained significantly faster using Unsloth and Huggingface's TRL library, indicating a focus on rapid iteration and development.
ALFWorld Specialization: As indicated by its lineage and naming convention, the model is likely specialized for tasks within the ALFWorld environment, which involves interactive text-based games requiring reasoning and action generation.

Good For

ALFWorld Research: Ideal for researchers and developers working on agents for the ALFWorld environment, particularly those interested in models finetuned for this specific domain.
Efficient Model Development: Demonstrates the application of tools like Unsloth for accelerating the finetuning process of large language models.

Overview

Model Overview

Key Capabilities

Good For

Full Model Card (README)