18-Death/sq-vigenere-base64-gsm8k

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-vigenere-base64-gsm8k model is a 3.1 billion parameter language model fine-tuned by 18-Death using TRL. It features a context length of 32768 tokens. This model is designed for general text generation tasks, leveraging its fine-tuned architecture to produce coherent and contextually relevant responses.

Loading preview...

Model Overview

18-Death/sq-vigenere-base64-gsm8k is a 3.1 billion parameter language model developed by 18-Death. It has been fine-tuned using the TRL (Transformers Reinforcement Learning) library, indicating a focus on optimizing its performance through supervised fine-tuning (SFT) methods.

Key Capabilities

  • Text Generation: Capable of generating human-like text based on given prompts.
  • Extended Context: Supports a substantial context length of 32768 tokens, allowing for processing and generating longer sequences of text while maintaining coherence.
  • Fine-tuned Performance: Benefits from supervised fine-tuning, which typically enhances its ability to follow instructions and generate relevant outputs for various text-based tasks.

Training Details

The model was trained using the SFT (Supervised Fine-Tuning) method, a common approach for adapting pre-trained language models to specific tasks or improving their general conversational abilities. The training utilized specific versions of key frameworks:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Good For

  • General conversational AI applications.
  • Generating creative content or responses.
  • Tasks requiring understanding and generation over long contexts.