18-Death/sq-walnut53-atbash-gsm8k

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-walnut53-atbash-gsm8k is a 3.1 billion parameter language model fine-tuned by 18-Death. This model was trained using SFT with a context length of 32768 tokens. It is designed for general text generation tasks, demonstrating capabilities in conversational responses and creative text completion.

Loading preview...

Overview

The 18-Death/sq-walnut53-atbash-gsm8k is a 3.1 billion parameter language model developed by 18-Death. It has been fine-tuned using the TRL library and supports a substantial context length of 32768 tokens. This model is suitable for various text generation applications.

Key Capabilities

  • Text Generation: Capable of generating coherent and contextually relevant text based on user prompts.
  • Conversational AI: Can be used to generate responses in a conversational format, as demonstrated by the quick start example.
  • Fine-tuned Performance: Benefits from a fine-tuning process (SFT) to enhance its performance on general language tasks.

Good For

  • Prototyping: Quickly generate text for various applications.
  • Creative Writing: Assist in generating creative content or story elements.
  • Question Answering: Provide detailed answers to open-ended questions, as shown in the example.

Training Details

The model underwent a Supervised Fine-Tuning (SFT) process. The training leveraged specific versions of popular ML frameworks, including TRL 1.3.0, Transformers 5.6.2, Pytorch 2.10.0, Datasets 4.8.4, and Tokenizers 0.22.2.