nbeerbower/Mistral-Nemo-Gutenberg-Encore-12B

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kPublished:Jun 4, 2025License:apache-2.0Architecture:Transformer0.0K Open Weights Warm

nbeerbower/Mistral-Nemo-Gutenberg-Encore-12B is a 12 billion parameter causal language model, fine-tuned by nbeerbower from mistralai/Mistral-Nemo-Instruct-2407. This model specializes in creative writing and narrative generation, particularly for fiction, through ORPO tuning on various DPO datasets focused on literary content. It offers enhanced prose style and thematic depth compared to its base model, making it suitable for generating imaginative and coherent stories.

Loading preview...

Model Overview

nbeerbower/Mistral-Nemo-Gutenberg-Encore-12B is a 12 billion parameter language model derived from the Mistral-Nemo-Instruct-2407 architecture. It has been specifically fine-tuned using the ORPO (Optimized Reward Policy Optimization) method to excel in creative writing tasks, particularly fiction generation.

Key Capabilities

  • Enhanced Narrative Quality: The model demonstrates improved story arcs, pacing, and coherent plot development compared to its base model.
  • Refined Prose Style: It generates text with more evocative imagery and thematic depth, moving beyond generic phrasing.
  • Thematic Exploration: Excels at handling complex themes like sacrifice, entropy, and memory, integrating them effectively into narratives.
  • Prompt Adherence: Highly capable of following detailed and imaginative writing prompts, incorporating specific elements and stylistic instructions.

Training Details

The model was ORPO tuned over 3 epochs using a single RTX A6000 GPU. The training leveraged several DPO datasets curated by nbeerbower, including gutenberg-dpo-v0.1, gutenberg2-dpo, gutenberg-moderne-dpo, synthetic-fiction-dpo, Arkhaios-DPO, Purpura-DPO, and Schule-DPO. This specialized dataset focus aims to imbue the model with a strong understanding of literary structures and styles.

Use Cases

This model is particularly well-suited for:

  • Generating creative fiction, short stories, and narrative content.
  • Developing imaginative scenarios and detailed world-building.
  • Assisting writers with plot development and character interactions.
  • Applications requiring nuanced and thematically rich text generation.