szymonrucinski/Curie-7B-v1
TEXT GENERATIONConcurrency Cost:1Model Size:7BQuant:FP8Ctx Length:8kPublished:Jan 11, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold
Curie-7B-v1 is a 7 billion parameter decoder-based large language model developed by Szymon Ruciński, specifically fine-tuned for Polish text generation. It utilizes Language Adaptive Pre-training (LAPT) on a high-quality Polish dataset, achieving a perplexity of 3.02 and rivaling top Polish encoder-decoder models on KLEJ challenges. This model excels in generating Polish text and can be adapted for various NLP tasks, including classification and regression, with high efficiency.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–