L1nus/qwen3-4B-default-pubmed-labeled-5000-seq-2048

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Mar 24, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

L1nus/qwen3-4B-default-pubmed-labeled-5000-seq-2048 is a 4 billion parameter Qwen3 model developed by L1nus, fine-tuned from unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit. This model was trained using Unsloth and Huggingface's TRL library, enabling 2x faster training. It is designed for tasks related to PubMed-labeled data, leveraging its 32768 token context length for processing longer sequences.

Loading preview...

Model Overview

L1nus/qwen3-4B-default-pubmed-labeled-5000-seq-2048 is a 4 billion parameter Qwen3 model developed by L1nus. It is fine-tuned from the unsloth/qwen3-4b-instruct-2507-unsloth-bnb-4bit base model, indicating an instruction-tuned foundation. The model was trained with a significant focus on efficiency, utilizing Unsloth and Huggingface's TRL library, which reportedly enabled a 2x speedup in the training process.

Key Characteristics

  • Architecture: Qwen3, a powerful transformer-based architecture.
  • Parameter Count: 4 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: Leverages Unsloth for accelerated training, making it a potentially cost-effective and faster-to-deploy option.
  • Context Length: Features a substantial context window of 32768 tokens, suitable for processing extensive documents or conversations.
  • License: Distributed under the Apache-2.0 license, allowing for broad use and modification.

Potential Use Cases

Given its fine-tuning and context length, this model is likely well-suited for applications requiring:

  • Processing and understanding long-form text.
  • Tasks related to the domain of its fine-tuning (e.g., PubMed-labeled data).
  • Scenarios where efficient deployment and inference of a 4B parameter model are beneficial.