akshayballal/Qwen2.5-1.5B-Instruct-SFT-Pubmed-16bit-DFT

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:1.5BQuant:BF16Ctx Length:32kPublished:Jan 10, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

akshayballal/Qwen2.5-1.5B-Instruct-SFT-Pubmed-16bit-DFT is a 1.5 billion parameter Qwen2.5-based instruction-tuned language model developed by akshayballal. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is specifically optimized for tasks related to the Pubmed dataset, making it suitable for biomedical and scientific text processing. The model supports a substantial context length of 131072 tokens.

Loading preview...

Model Overview

The akshayballal/Qwen2.5-1.5B-Instruct-SFT-Pubmed-16bit-DFT is a 1.5 billion parameter instruction-tuned language model based on the Qwen2.5 architecture. Developed by akshayballal, this model was fine-tuned from unsloth/qwen2.5-1.5b-instruct-unsloth-bnb-4bit using a combination of Unsloth and Huggingface's TRL library. This approach facilitated a 2x faster training process.

Key Characteristics

  • Architecture: Qwen2.5-based, a robust foundation for language understanding and generation.
  • Parameter Count: 1.5 billion parameters, offering a balance between performance and computational efficiency.
  • Training Efficiency: Leverages Unsloth for accelerated fine-tuning, reducing training time significantly.
  • Context Length: Supports an extensive context window of 131072 tokens, allowing for processing of long documents and complex queries.
  • Specialization: Fine-tuned on the Pubmed dataset, indicating a strong focus on biomedical and scientific literature.

Ideal Use Cases

This model is particularly well-suited for applications requiring deep understanding and generation within the scientific and medical domains.

  • Biomedical Text Analysis: Tasks such as information extraction, summarization, and question answering from scientific papers and medical records.
  • Research Assistance: Aiding researchers in navigating and synthesizing information from large volumes of academic literature.
  • Domain-Specific Instruction Following: Responding to instructions and queries tailored to scientific and medical contexts.