Zachary1150/merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.1_linear
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Dec 20, 2025Architecture:Transformer Warm
Zachary1150/merge_lenfmt_MRL4096_ROLLOUT4_LR5e-7_w0.1_linear is a 1.5 billion parameter language model created by Zachary1150 through a linear merge of two pre-trained models. This model features an exceptionally long context length of 131,072 tokens, making it suitable for tasks requiring extensive contextual understanding. Its primary differentiation lies in its merged architecture, combining specific base models to potentially enhance performance in areas related to length and accuracy formatting. It is designed for applications that benefit from processing and generating very long sequences of text.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–