Vishesh1617/Shunya-o1-8B-v2-SFT-Merged
Vishesh1617/Shunya-o1-8B-v2-SFT-Merged is an 8 billion parameter Llama 3.1 instruction-tuned model developed by Vishesh1617. It was finetuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit using Unsloth and Huggingface's TRL library, enabling faster training. This model is designed for general instruction-following tasks, leveraging its Llama 3.1 base and efficient finetuning process.
Loading preview...
Model Overview
Vishesh1617/Shunya-o1-8B-v2-SFT-Merged is an 8 billion parameter instruction-tuned language model developed by Vishesh1617. It is based on the Llama 3.1 architecture, specifically finetuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit.
Key Characteristics
- Base Model: Finetuned from Meta Llama 3.1 8B Instruct.
- Training Efficiency: The model was trained using Unsloth and Huggingface's TRL library, which facilitated a 2x faster finetuning process.
- License: Distributed under the Apache-2.0 license.
Use Cases
This model is suitable for a variety of instruction-following applications, benefiting from its Llama 3.1 foundation and efficient finetuning. Its 8B parameter count makes it a capable option for tasks requiring a balance of performance and computational resources.