18-Death/sq-bijection-rot13-aqua_rat

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-bijection-rot13-aqua_rat is a 3.1 billion parameter language model fine-tuned using the TRL framework. This model is designed for general text generation tasks, leveraging its 32768 token context length to process and generate longer sequences. It is suitable for applications requiring instruction-following capabilities based on its training methodology.

Loading preview...

Model Overview

The 18-Death/sq-bijection-rot13-aqua_rat is a 3.1 billion parameter language model, fine-tuned using the TRL (Transformers Reinforcement Learning) library. This model is built for text generation, capable of processing inputs and generating responses based on its instruction-tuned training.

Key Capabilities

  • Text Generation: Designed to generate coherent and contextually relevant text based on user prompts.
  • Instruction Following: Trained with SFT (Supervised Fine-Tuning) to understand and respond to instructions.
  • Extended Context Window: Features a 32768 token context length, allowing for more extensive input processing and longer generated outputs.

Training Details

The model was fine-tuned using the TRL framework, specifically employing Supervised Fine-Tuning (SFT). The training utilized specific versions of key libraries:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Use Cases

This model is suitable for various text generation applications where a moderately sized model with a large context window is beneficial. It can be used for tasks such as:

  • Answering open-ended questions.
  • Generating creative content.
  • Assisting with conversational AI where context retention is important.