18-Death/sq-base64-walnut53-aqua_rat

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-base64-walnut53-aqua_rat is a 3.1 billion parameter causal language model, fine-tuned using the TRL framework. This model is designed for text generation tasks, leveraging its 32768-token context length to process and generate coherent responses. Its training methodology focuses on supervised fine-tuning (SFT) to enhance its conversational and generative capabilities.

Loading preview...

Model Overview

The 18-Death/sq-base64-walnut53-aqua_rat is a 3.1 billion parameter language model, fine-tuned for text generation. It was developed by 18-Death and trained using the TRL (Transformers Reinforcement Learning) framework, specifically employing Supervised Fine-Tuning (SFT).

Key Characteristics

  • Parameter Count: 3.1 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Features a substantial 32768-token context window, enabling it to handle longer inputs and generate more contextually relevant outputs.
  • Training Method: Utilizes Supervised Fine-Tuning (SFT) for its training, which typically results in models well-suited for instruction-following and conversational tasks.

Intended Use Cases

This model is suitable for various text generation applications, particularly those requiring coherent and context-aware responses. Its fine-tuning approach suggests proficiency in tasks such as:

  • Answering open-ended questions.
  • Generating creative text formats.
  • Engaging in conversational AI scenarios.

Technical Details

The model was developed with specific framework versions:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2