JetBrains/Mellum-4b-base
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Apr 28, 2025License:apache-2.0Architecture:Transformer0.4K Open Weights Warm

JetBrains' Mellum-4b-base is a 4 billion parameter, LLaMA-style causal language model specifically optimized for code-related tasks. Trained on over 4 trillion tokens with an 8192-token context window, it excels at code completion across multiple programming languages. This base model is designed for efficient deployment in developer tooling, AI-powered coding assistants, and serves as a strong foundation for fine-tuning.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p