artificialguybr/Gemma2-2B-OpenHermes2.5

TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Aug 16, 2024License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

artificialguybr/Gemma2-2B-OpenHermes2.5 is a 2.6 billion parameter causal language model developed by artificialguybr. It is a fine-tuned version of Google's Gemma 2B, trained on the OpenHermes-2.5 dataset. This model is optimized for instruction following and general language tasks, making it suitable for text generation and question answering applications.

Loading preview...

Model Overview

artificialguybr/Gemma2-2B-OpenHermes2.5 is a 2.6 billion parameter causal language model, fine-tuned by artificialguybr. It is based on Google's Gemma 2 - 2B architecture and was trained using the teknium/OpenHermes-2.5 dataset. This model is designed for robust instruction following and general natural language processing tasks.

Key Capabilities

  • Instruction Following: Excels at understanding and executing given instructions.
  • General Language Tasks: Proficient in various language-related applications.
  • Text Generation: Capable of generating coherent and contextually relevant text.
  • Question Answering: Can be used for extracting answers from provided text or general knowledge.

Training Details

The model was fine-tuned on a single NVIDIA A100-SXM4-80GB GPU using the 🤗 Transformers and Axolotl frameworks. It utilizes an Apache-2.0 license.

Good For

  • Developers seeking a compact yet capable model for instruction-tuned applications.
  • Projects requiring text generation or question answering with a focus on general language understanding.
  • Experimentation with Gemma 2B fine-tunes on established instruction datasets like OpenHermes-2.5.