mergekit-community/nsfw-w-deepseek-r1-retry

Warm
Public
8B
FP8
32768
1
Jan 21, 2025
Hugging Face
Overview

Overview

This model, nsfw-w-deepseek-r1-retry, is a merged language model developed by mergekit-community. It leverages the Model Stock merge method, as described in the Model Stock paper, building upon mergekit-community/nsfw_merge_test_vFFS as its base.

Key Capabilities

The model integrates components from several specialized LoRAs with stepenZEN/DeepSeek-R1-Distill-Llama-8B-Abliterated. This merge aims to combine:

  • Long-form narrative generation: Incorporating Azazelle/Llama-3-LongStory-LORA suggests an emphasis on extended text generation.
  • Instruction following: The inclusion of moetezsa/Llama3_instruct_on_wikibio and Azazelle/Llama-3-LimaRP-Instruct-LoRA-8B indicates a focus on robust instruction-tuned capabilities and role-play scenarios.

Good for

This model is particularly well-suited for applications requiring:

  • Generating detailed and coherent long-form text.
  • Following complex instructions in conversational or creative contexts.
  • Engaging in role-playing or interactive narrative generation, benefiting from the combined strengths of its Llama-3 based LoRA components.