laion/glm46-stackexchange-tezos-maxeps-131k

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Dec 26, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

The laion/glm46-stackexchange-tezos-maxeps-131k is an 8 billion parameter language model, fine-tuned from Qwen/Qwen3-8B. This model is specifically adapted using the DCAgent2/glm46-stackexchange-tezos-maxeps-131k dataset, suggesting a specialization in content related to Tezos and Stack Exchange data. With a context length of 32768 tokens, it is designed for applications requiring deep understanding and generation within its specialized domain.

Loading preview...

Model Overview

The laion/glm46-stackexchange-tezos-maxeps-131k is an 8 billion parameter language model, fine-tuned from the robust Qwen/Qwen3-8B architecture. This specialization is achieved through training on the DCAgent2/glm46-stackexchange-tezos-maxeps-131k dataset, indicating a focus on content derived from Stack Exchange discussions, particularly those related to the Tezos blockchain.

Key Characteristics

  • Base Model: Fine-tuned from Qwen/Qwen3-8B.
  • Parameter Count: 8 billion parameters.
  • Context Length: Supports a substantial context window of 32768 tokens.
  • Specialized Training: Adapted using a dataset focused on Tezos-related Stack Exchange data, suggesting enhanced performance for queries and generation within this specific technical domain.

Training Details

The model was trained with a learning rate of 4e-05, a total batch size of 16 (achieved with train_batch_size: 1 and gradient_accumulation_steps: 2 across 8 GPUs), and utilized the AdamW optimizer with a cosine learning rate scheduler over 7 epochs. This configuration aims to optimize its performance on the specialized dataset.

Potential Use Cases

This model is likely well-suited for applications requiring detailed understanding, summarization, or generation of text related to:

  • Tezos Blockchain: Answering questions, explaining concepts, or generating content about the Tezos ecosystem.
  • Technical Q&A: Processing and generating responses based on Stack Exchange-style technical discussions, particularly within its trained domain.
  • Domain-Specific Content Creation: Assisting developers or researchers working with Tezos-related information.