UICHEOL-HWANG/EcomGen-Gemma3-4B

VISIONConcurrency Cost:1Model Size:4.3BQuant:BF16Ctx Length:32kPublished:Jun 8, 2025License:apache-2.0Architecture:Transformer Open Weights Cold

EcomGen-Gemma3-4B is a 4.3 billion parameter instruction-tuned causal language model developed by UICHEOL-HWANG, fine-tuned from unsloth/gemma-3-4b-it-unsloth-bnb-4bit. Optimized for generating product descriptions, this model leverages Unsloth for faster training and supports a 32768 token context length. It is specifically designed to create varied product descriptions based on provided product details and desired tones.

Loading preview...

EcomGen-Gemma3-4B: Product Description Generation Model

EcomGen-Gemma3-4B is a 4.3 billion parameter language model developed by UICHEOL-HWANG, specifically fine-tuned for generating product descriptions. This model is built upon the Gemma-3-4B architecture and was trained using Unsloth and Huggingface's TRL library, enabling 2x faster fine-tuning.

Key Capabilities

  • Specialized Product Description Generation: Excels at creating detailed and tonally appropriate product descriptions based on structured input.
  • Contextual Understanding: Processes product name, category, price, core keywords, and desired writing tone to generate relevant output.
  • Flexible Tone Adaptation: Capable of generating descriptions in various tones, such as "신뢰감_있는_전문가_톤" (trustworthy expert tone) or "트렌디한_젊은_톤" (trendy young tone).
  • Efficient Training: Leverages Unsloth for optimized and accelerated fine-tuning processes.

Usage Notes

  • Requires disabling TorchDynamo (os.environ["TORCHDYNAMO_DISABLE"] = "1") before execution to avoid operational issues.
  • Utilizes a chat template for input formatting, allowing for clear instruction of product details and desired output style.

Good For

  • Automating the creation of diverse product descriptions for e-commerce platforms.
  • Generating marketing copy that aligns with specific product characteristics and brand tones.
  • Developers looking for a specialized model for text generation in the e-commerce domain.