18-Death/sq-bijection-base64-aqua_rat

TEXT GENERATIONConcurrency Cost:1Model Size:3.1BQuant:BF16Ctx Length:32kTool Calling:SupportedPublished:Jun 16, 2026Architecture:Transformer Cold

The 18-Death/sq-bijection-base64-aqua_rat is a 3.1 billion parameter fine-tuned causal language model, developed by 18-Death. This model was trained using the TRL library, indicating a focus on reinforcement learning from human feedback or similar fine-tuning methods. With a context length of 32768 tokens, it is designed for generating coherent and extended text responses. Its primary strength lies in its fine-tuned nature for conversational or instruction-following tasks.

Loading preview...

Model Overview

The 18-Death/sq-bijection-base64-aqua_rat is a 3.1 billion parameter language model, fine-tuned by 18-Death. It leverages the TRL (Transformers Reinforcement Learning) library for its training procedure, suggesting an optimization for instruction-following or conversational capabilities through methods like SFT (Supervised Fine-tuning).

Key Capabilities

  • Instruction Following: Fine-tuned using SFT, making it suitable for responding to specific prompts and instructions.
  • Extended Context: Supports a context length of 32768 tokens, allowing for processing and generating longer sequences of text.
  • Text Generation: Capable of generating human-like text based on given prompts, as demonstrated by the quick start example.

Training Details

The model was trained using Supervised Fine-tuning (SFT) with the following framework versions:

  • TRL: 1.3.0
  • Transformers: 5.6.2
  • Pytorch: 2.10.0
  • Datasets: 4.8.4
  • Tokenizers: 0.22.2

Use Cases

This model is well-suited for applications requiring:

  • Conversational AI: Engaging in dialogue and responding to user queries.
  • Content Generation: Creating various forms of text content based on prompts.
  • Instruction-based Tasks: Executing tasks where clear instructions are provided.