Name: laion/glm-4_6-freelancer-32ep-131k-torch API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

laion/glm-4_6-freelancer-32ep-131k-torch is an 8 billion parameter language model, fine-tuned from the robust Qwen/Qwen3-8B architecture. This model was developed by laion through a fine-tuning process on the penfever/glm-4.6-freelancer-32ep-131k-torch dataset.

Training Details

The model underwent 7 epochs of training, utilizing a learning rate of 4e-05 and a total batch size of 16 (with a train_batch_size of 1 and gradient_accumulation_steps of 2). The optimizer used was ADAMW_TORCH_FUSED with specific beta and epsilon parameters, and a cosine learning rate scheduler with a 0.1 warmup ratio. The training was conducted across 8 GPUs, leveraging Transformers 4.57.3, Pytorch 2.9.0+cu128, Datasets 4.4.1, and Tokenizers 0.22.1.

Intended Use

While specific intended uses and limitations are not detailed in the provided information, as a fine-tuned version of Qwen3-8B, it is generally suitable for a wide range of natural language processing tasks. Developers should consider its base model's capabilities and the fine-tuning dataset's characteristics for specific applications.

Overview

Model Overview

Training Details

Intended Use

Full Model Card (README)