allura-org/remnant-glm4-32b
TEXT GENERATIONConcurrency Cost:2Model Size:32BQuant:FP8Ctx Length:32kPublished:May 13, 2025License:apache-2.0Architecture:Transformer0.0K Warm

The allura-org/remnant-glm4-32b is a 32 billion parameter language model based on the GLM-4 architecture, fine-tuned by allura-org. This model is specifically optimized for SFW and NSFW roleplaying and conversational tasks. It leverages a 32768 token context length, making it suitable for extended and nuanced interactions. The model's primary strength lies in generating engaging and contextually rich dialogue for character-based applications.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p