18-Death/sq-rot13-walnut53-gsm8k

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-rot13-walnut53-gsm8k is a 3.1 billion parameter language model fine-tuned using the TRL library. This model is based on an unspecified architecture and has a context length of 32768 tokens. It was trained using Supervised Fine-Tuning (SFT) and is suitable for general text generation tasks.

Loading preview...

Model Overview

The 18-Death/sq-rot13-walnut53-gsm8k is a 3.1 billion parameter language model that has been fine-tuned using the TRL (Transformers Reinforcement Learning) library. This model was developed by 18-Death and utilizes a context length of 32768 tokens, making it capable of processing relatively long inputs.

Training Details

The model underwent Supervised Fine-Tuning (SFT) as its primary training procedure. The development leveraged several key frameworks:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Key Capabilities

  • Text Generation: Capable of generating coherent and contextually relevant text based on user prompts.
  • Long Context Handling: Supports a substantial context window of 32768 tokens, allowing for more extensive conversational or document-based interactions.

Recommended Use Cases

This model is suitable for various text generation applications where a medium-sized model with a large context window is beneficial. It can be used for tasks such as:

  • Answering open-ended questions.
  • Creative writing prompts.
  • General conversational AI.
  • Content creation requiring longer input understanding.