amityco/amax-sigma-scratch-sft

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:4BQuant:BF16Ctx Length:32kPublished:Jan 8, 2026License:apache-2.0Architecture:Transformer Open Weights Warm

The amityco/amax-sigma-scratch-sft is a 4 billion parameter Qwen3-based causal language model, developed by amityco. This model was finetuned using Unsloth and Huggingface's TRL library, achieving 2x faster training. It features a 40960 token context length, making it suitable for tasks requiring extensive contextual understanding.

Loading preview...

amityco/amax-sigma-scratch-sft Overview

The amityco/amax-sigma-scratch-sft is a 4 billion parameter language model based on the Qwen3 architecture, developed by amityco. This model was specifically finetuned using the Unsloth library in conjunction with Huggingface's TRL library, which enabled a 2x acceleration in its training process. It supports a substantial context length of 40960 tokens, allowing it to process and generate longer sequences of text.

Key Capabilities

  • Qwen3 Architecture: Leverages the robust Qwen3 base model for strong language understanding and generation.
  • Efficient Finetuning: Benefits from finetuning with Unsloth, resulting in faster training times.
  • Extended Context Window: Features a 40960-token context length, ideal for tasks requiring deep contextual awareness.

Good For

  • Applications needing a 4B parameter model with an extended context window.
  • Projects where efficient finetuning methods are a priority.
  • Tasks that can benefit from the Qwen3 model's capabilities.