Lambent/Qwen3-4B-Base-Continued-GRPO-Style-Karcher
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 5, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

Lambent/Qwen3-4B-Base-Continued-GRPO-Style-Karcher is a 4 billion parameter language model based on the Qwen3-4B-Base architecture, fine-tuned using a Karcher Mean merge of adapters. This model demonstrates improved perplexity on tasks like lambada_openai and enhanced diversity metrics, particularly in distinct-1 and pairwise diversity for domains such as ao3_english and bbc_news. It is optimized for generating more varied and less repetitive text outputs across diverse domains.

Loading preview...