The akshayballal/Qwen3-4B-Instruct-SFT-Pubmed-16bit-DFT is a 4 billion parameter Qwen3 instruction-tuned causal language model developed by akshayballal. This model was fine-tuned using Unsloth and Huggingface's TRL library, enabling faster training. It is designed for general instruction-following tasks, leveraging its Qwen3 architecture for robust performance.
No reviews yet. Be the first to review!