iaa01/CIA-1.7B
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Jan 11, 2026License:mitArchitecture:Transformer0.0K Open Weights Warm

The iaa01/CIA-1.7B model is a reinforcement learning (RL) post-trained language model, optimized using the novel ∆Belief reward signal. This approach rewards the model for reducing its own belief uncertainty, providing dense feedback for long-horizon, open-ended information-seeking tasks. Trained in a Twenty Questions environment, it learns general information-seeking strategies that generalize effectively.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p