artificialguybr/Gemma2-2B-OpenHermes2.5
artificialguybr/Gemma2-2B-OpenHermes2.5 is a 2.6 billion parameter causal language model developed by artificialguybr. It is a fine-tuned version of Google's Gemma 2B, trained on the OpenHermes-2.5 dataset. This model is optimized for instruction following and general language tasks, making it suitable for text generation and question answering applications.
Loading preview...
Model Overview
artificialguybr/Gemma2-2B-OpenHermes2.5 is a 2.6 billion parameter causal language model, fine-tuned by artificialguybr. It is based on Google's Gemma 2 - 2B architecture and was trained using the teknium/OpenHermes-2.5 dataset. This model is designed for robust instruction following and general natural language processing tasks.
Key Capabilities
- Instruction Following: Excels at understanding and executing given instructions.
- General Language Tasks: Proficient in various language-related applications.
- Text Generation: Capable of generating coherent and contextually relevant text.
- Question Answering: Can be used for extracting answers from provided text or general knowledge.
Training Details
The model was fine-tuned on a single NVIDIA A100-SXM4-80GB GPU using the 🤗 Transformers and Axolotl frameworks. It utilizes an Apache-2.0 license.
Good For
- Developers seeking a compact yet capable model for instruction-tuned applications.
- Projects requiring text generation or question answering with a focus on general language understanding.
- Experimentation with Gemma 2B fine-tunes on established instruction datasets like OpenHermes-2.5.