18-Death/sq-atbash-base64-strategyqa

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-atbash-base64-strategyqa model is a 3.1 billion parameter language model fine-tuned by 18-Death. This model was trained using SFT with the TRL framework. It features a context length of 32768 tokens, making it suitable for tasks requiring extensive contextual understanding. Its fine-tuning process suggests an optimization for specific question-answering or strategic reasoning tasks.

Loading preview...

Model Overview

The 18-Death/sq-atbash-base64-strategyqa is a 3.1 billion parameter language model developed by 18-Death. It has been fine-tuned using Supervised Fine-Tuning (SFT) with the TRL (Transformers Reinforcement Learning) framework. This model is designed to handle tasks that benefit from its substantial 32768-token context length.

Key Capabilities

  • Extensive Context Handling: With a 32768-token context window, the model can process and generate responses based on large amounts of input text, which is beneficial for complex reasoning or long-form content generation.
  • Fine-tuned Performance: The SFT training indicates a specialization for particular tasks, likely in question-answering or strategic reasoning, as suggested by its name.
  • TRL Framework: Built upon the TRL framework, it leverages advanced techniques for training transformer models.

Training Details

The model was trained using SFT, with specific framework versions:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Good For

  • Applications requiring processing and understanding of long documents or conversations.
  • Tasks that benefit from a model fine-tuned for specific strategic question-answering or reasoning patterns.
  • Developers looking for a moderately sized model with a large context window for specialized NLP tasks.