modrill/lingcoder_shortcot_merged_fixed200k_4k_qwen3_4b_instruct2507

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:May 20, 2026License:cc-by-nc-4.0Architecture:Transformer Open Weights Warm

The modrill/lingcoder_shortcot_merged_fixed200k_4k_qwen3_4b_instruct2507 model is a 4 billion parameter instruction-tuned language model. It is based on the Qwen3 architecture and features a 32,768 token context length. This model is specifically fine-tuned for code-related tasks, leveraging a merged dataset including 'lingcoder' and 'shortcot' for enhanced performance in code generation and understanding.

Loading preview...

Overview

The modrill/lingcoder_shortcot_merged_fixed200k_4k_qwen3_4b_instruct2507 is a 4 billion parameter instruction-tuned language model built upon the Qwen3 architecture. It boasts a substantial context window of 32,768 tokens, enabling it to process and generate longer sequences of text and code. This model has undergone specific fine-tuning using a merged dataset, which includes 'lingcoder' and 'shortcot' components, along with a fixed 200k data points and a 4k context length during training, indicating an optimization for code-centric applications.

Key Capabilities

  • Code Generation: Optimized for generating various programming language constructs.
  • Code Understanding: Capable of interpreting and reasoning about code snippets.
  • Instruction Following: Designed to respond accurately to user instructions, particularly in technical and coding contexts.
  • Extended Context: The 32,768 token context window supports complex coding tasks and multi-turn conversations.

Good For

  • Developers seeking an instruction-tuned model for code-related tasks.
  • Applications requiring robust code generation and completion.
  • Scenarios benefiting from a large context window for handling extensive codebases or detailed technical prompts.