nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Nov 28, 2025License:apache-2.0Architecture:Transformer Open Weights Cold
nics-efc/MARSHAL-Kuhn-Poker-Qwen3-4B is a 4 billion parameter model, initialized from Qwen3-4B, specifically trained as a Kuhn Poker specialist within the MARSHAL framework. Developed by Huining Yuan et al. from the MARSHAL project, this model leverages self-play with a turn-level advantage estimator and agent-specific advantage normalization for fine-grained credit assignment in multi-agent, multi-turn strategic games. It excels in competitive imperfect-information games like Kuhn Poker and demonstrates generalization capabilities, improving performance on reasoning benchmarks when integrated into multi-agent systems.
Loading preview...