jpacifico/Chocolatine-2-4B-Instruct-DPO-v2.1
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Feb 1, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Warm
Chocolatine-2-4B-Instruct-DPO-v2.1 by jpacifico is a 4.0 billion parameter instruction-tuned language model based on Qwen3-4B-Instruct-2507, featuring a native context length of 262,144 tokens. It is specifically post-trained using DPO and model merging to enhance instruction-following and reasoning in French, while maintaining strong multilingual capabilities. This model excels in French language tasks, showing consistent benchmark improvements, and is optimized for direct generation efficiency, making it suitable for local inference with available MLX and GGUF variants.
Loading preview...
Popular Sampler Settings
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.
temperature
–
top_p
–
top_k
–
frequency_penalty
–
presence_penalty
–
repetition_penalty
–
min_p
–