lllqaq/Qwen3-8B-fim-v2v3pt

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 29, 2026License:otherArchitecture:Transformer Cold

The lllqaq/Qwen3-8B-fim-v2v3pt is an 8 billion parameter language model, fine-tuned from the Qwen/Qwen3-8B architecture. This model specializes in fill-in-the-middle (FIM) tasks, having been trained on the fim_midtrain_v2, fim_midtrain_v3_pairs, and fim_midtrain_v3_triples datasets. With a 32768 token context length, it is designed for applications requiring code completion or text infilling capabilities.

Loading preview...

Overview

The lllqaq/Qwen3-8B-fim-v2v3pt is an 8 billion parameter language model derived from the Qwen3-8B base architecture. Its primary distinction lies in its specialized fine-tuning for fill-in-the-middle (FIM) tasks, making it particularly adept at predicting missing code or text segments within a given context.

Key Capabilities

  • Fill-in-the-Middle (FIM): Specifically trained on FIM datasets (fim_midtrain_v2, fim_midtrain_v3_pairs, fim_midtrain_v3_triples) to excel at code completion and text infilling.
  • Base Model: Built upon the robust Qwen3-8B foundation, suggesting strong general language understanding prior to FIM specialization.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing for processing and generating FIM completions within large codebases or extensive text documents.

Training Details

The model was trained with a learning rate of 1e-05, using a total batch size of 96 (achieved with 6 GPUs and 16 gradient accumulation steps) over 1 epoch. The optimizer used was ADAMW_TORCH with cosine learning rate scheduling and a 0.1 warmup ratio.

When to Use This Model

This model is ideal for scenarios requiring:

  • Code Completion: Assisting developers by suggesting missing code snippets or entire functions.
  • Text Infilling: Completing sentences or paragraphs where parts have been omitted.
  • Contextual Generation: Generating text that seamlessly fits into existing content, leveraging its FIM capabilities.