anthracite-org/magnum-v2.5-12b-kto
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Aug 12, 2024License:apache-2.0Architecture:Transformer0.1K Open Weights Warm

anthracite-org/magnum-v2.5-12b-kto is a 12 billion parameter experimental language model developed by Anthracite, fine-tuned on magnum-12b-v2. It utilizes a hybrid KTO + DPOP reinforcement learning strategy to enhance instruction following, aiming to replicate the prose quality of Claude 3 models. This model is optimized for generating high-quality, instruction-tuned text with a 32768 token context length.

Loading preview...

Popular Sampler Settings

Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.

temperature
top_p
top_k
frequency_penalty
presence_penalty
repetition_penalty
min_p