Name: laion/kimi-k2t-freelancer-32ep-32k API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: laion

Model Overview

The laion/kimi-k2t-freelancer-32ep-32k is an 8 billion parameter language model, fine-tuned from the robust Qwen/Qwen3-8B architecture. This model has been specifically adapted through training on the penfever/kimi-k2t-freelancer-32ep-32k dataset, indicating a potential specialization for tasks aligned with the characteristics of this particular dataset.

Key Characteristics

Base Model: Fine-tuned from Qwen/Qwen3-8B.
Parameter Count: 8 billion parameters.
Context Length: Supports a context window of 32,768 tokens, allowing for processing and generation of extensive text sequences.
Training Data: Specialized training on the penfever/kimi-k2t-freelancer-32ep-32k dataset.

Training Details

The model underwent training with a learning rate of 4e-05, utilizing a cosine learning rate scheduler with a 0.1 warmup ratio. Training involved 7 epochs, a total batch size of 16 (achieved with a train_batch_size of 1 and gradient_accumulation_steps of 2 across 8 devices), and the AdamW_Torch_Fused optimizer. The training environment used Transformers 4.56.1, Pytorch 2.9.1+cu128, Datasets 4.4.1, and Tokenizers 0.22.1.

Potential Use Cases

Given its fine-tuning on a specific dataset, this model is likely best suited for applications that align with the domain or characteristics of the penfever/kimi-k2t-freelancer-32ep-32k data. Its large context window makes it suitable for tasks requiring comprehension or generation of long-form content.

Overview

Model Overview

Key Characteristics

Training Details

Potential Use Cases

Full Model Card (README)