18-Death/sq-atbash-walnut53-gsm8k

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-atbash-walnut53-gsm8k is a 3.1 billion parameter language model fine-tuned by 18-Death using TRL. This model is designed for text generation tasks, leveraging its fine-tuned architecture to produce coherent and contextually relevant responses. With a context length of 32768 tokens, it is suitable for applications requiring processing of moderately long inputs.

Loading preview...

Model Overview

The 18-Death/sq-atbash-walnut53-gsm8k is a 3.1 billion parameter language model developed by 18-Death. It has been fine-tuned using the TRL library, which specializes in Transformer Reinforcement Learning. The training procedure involved Supervised Fine-Tuning (SFT).

Key Capabilities

  • Text Generation: The model is primarily designed for generating human-like text based on given prompts.
  • Context Handling: It supports a substantial context length of 32768 tokens, allowing it to process and generate text for moderately long inputs.

Training Details

The model was trained using the following framework versions:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Usage

This model can be easily integrated into applications using the Hugging Face transformers library, as demonstrated by the quick start example provided in its model card. It is suitable for various text-based tasks where a fine-tuned language model with a decent context window is beneficial.