18-Death/sq-bijection-atbash-sciq

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-bijection-atbash-sciq model is a 3.1 billion parameter language model fine-tuned using the TRL framework. It is designed for text generation tasks, particularly conversational responses, with a notable context length of 32768 tokens. This model specializes in generating creative and thoughtful answers to open-ended questions.

Loading preview...

Model Overview

The 18-Death/sq-bijection-atbash-sciq is a 3.1 billion parameter language model, fine-tuned for text generation. It leverages the TRL (Transformers Reinforcement Learning) framework for its training process, building upon an unspecified base model.

Key Capabilities

  • Text Generation: Excels at generating coherent and contextually relevant text based on user prompts.
  • Conversational AI: Particularly suited for generating responses to open-ended, thought-provoking questions, as demonstrated by its quick start example.
  • Extended Context Window: Features a substantial context length of 32768 tokens, allowing it to process and generate longer sequences of text while maintaining coherence.

Training Details

The model was trained using Supervised Fine-Tuning (SFT) techniques. The development utilized specific versions of key frameworks:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

When to Use This Model

This model is ideal for applications requiring creative text generation, especially for generating nuanced and detailed answers to complex or philosophical questions. Its large context window makes it suitable for tasks where understanding and maintaining context over long inputs is crucial.