kazuyamaa/alfworld-lambda-grpo-v002-hull
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 1, 2026License:apache-2.0Architecture:Transformer Open Weights Warm
The kazuyamaa/alfworld-lambda-grpo-v002-hull is a 4 billion parameter Qwen3 model developed by kazuyamaa, fine-tuned from kazuyamaa/Qwen3-afworld-v001. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster training speeds. It is designed for specific applications within the ALFWorld environment, leveraging its optimized training for efficient performance.
Loading preview...