iaa01/CIA-1.7B
TEXT GENERATIONConcurrency Cost:1Model Size:2BQuant:BF16Ctx Length:32kPublished:Jan 11, 2026License:mitArchitecture:Transformer0.0K Open Weights Warm
The iaa01/CIA-1.7B model is a reinforcement learning (RL) post-trained language model, optimized using the novel ∆Belief reward signal. This approach rewards the model for reducing its own belief uncertainty, providing dense feedback for long-horizon, open-ended information-seeking tasks. Trained in a Twenty Questions environment, it learns general information-seeking strategies that generalize effectively.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–