Name: LMIS-ORG/MemAgent_Slime_Agentic_Qwen2.5_7B API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: LMIS-ORG

Overview

LMIS-ORG/MemAgent_Slime_Agentic_Qwen2.5_7B is a 7.6 billion parameter model from LMIS-ORG, built upon the Qwen2.5 base. It implements the core ideas of the MemAgent architecture, focusing on efficient processing of extremely long documents by maintaining a recurrent memory.

Key Capabilities

Arbitrarily Long Document Processing: Compresses extensive documents into a fixed-size recurrent memory, chunk by chunk, allowing it to handle contexts far exceeding typical LLM limits.
Recurrent Memory Update: Utilizes an LLM update loop to continuously refine its memory based on new document chunks, enabling question answering from this compressed memory.
Reinforcement Learning (RL) Optimization: Employs GRPO with a Multi-Conversation training objective, applying RL to all memory-update turns to learn effective retention of critical information across chunks.
Superior Long-Context Performance: Consistently outperforms larger baseline models, including 14B and 32B parameter models, on the RULER-HQA benchmark for context lengths ranging from 7K to 448K tokens.

Good For

Question Answering on Very Long Documents: Ideal for applications requiring accurate information retrieval and synthesis from extremely lengthy texts, such as legal documents, research papers, or extensive reports.
Memory-Efficient Long-Context Understanding: Suitable for scenarios where processing vast amounts of text is necessary but computational resources or memory constraints prevent loading the entire document at once.
Research and Development in Agentic LLMs: Provides a strong baseline and architecture for further exploration into agentic models that manage and update internal states or memories for complex tasks.