IAAR-Shanghai/MemReader-4B-thinking
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 7, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

IAAR-Shanghai/MemReader-4B-thinking is a 4 billion parameter language model built on Qwen3-4B, specifically designed for active long-term agent memory management. It formulates memory construction as a reasoning-and-action process, enabling explicit evaluation and selection of memory operations like adding, searching, buffering, or ignoring information. This model excels in long-horizon dialogue systems, personalized assistants, and agent frameworks requiring low-noise, updatable, and retrievable long-term memory, supporting a 32768 token context length.

Loading preview...