18-Death/sq-base64-rot13-sciq

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-base64-rot13-sciq is a 3.1 billion parameter causal language model with a 32768 token context length, fine-tuned using the TRL framework. This model is designed for text generation tasks, having been trained with Supervised Fine-Tuning (SFT). Its primary application is generating responses to user prompts, as demonstrated by its text-generation pipeline.

Loading preview...

Model Overview

The 18-Death/sq-base64-rot13-sciq is a 3.1 billion parameter language model, fine-tuned for text generation. It leverages a substantial context window of 32768 tokens, allowing it to process and generate longer, more coherent sequences of text. The model was developed using the TRL (Transformers Reinforcement Learning) framework, indicating a focus on supervised fine-tuning (SFT) as its training methodology.

Key Capabilities

  • Text Generation: The model is primarily designed for generating human-like text based on given prompts.
  • Long Context Understanding: With a 32768 token context length, it can handle extensive input and maintain context over longer conversations or documents.
  • SFT Training: Trained with Supervised Fine-Tuning, suggesting a focus on learning from high-quality, labeled data to improve response quality.

Training Details

The model's training utilized the following framework versions:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Good For

  • Applications requiring conversational AI or prompt-based text completion.
  • Scenarios where maintaining context over long interactions is crucial.
  • Developers looking for a fine-tuned model for general text generation tasks.