Vishesh1617/Shunya-o1-8B-v2-SFT-Merged

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Mar 7, 2026License:apache-2.0Architecture:Transformer0.0K Open Weights Cold

Vishesh1617/Shunya-o1-8B-v2-SFT-Merged is an 8 billion parameter Llama 3.1 instruction-tuned model developed by Vishesh1617. It was finetuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit using Unsloth and Huggingface's TRL library, enabling faster training. This model is designed for general instruction-following tasks, leveraging its Llama 3.1 base and efficient finetuning process.

Loading preview...

Model Overview

Vishesh1617/Shunya-o1-8B-v2-SFT-Merged is an 8 billion parameter instruction-tuned language model developed by Vishesh1617. It is based on the Llama 3.1 architecture, specifically finetuned from unsloth/meta-llama-3.1-8b-instruct-bnb-4bit.

Key Characteristics

  • Base Model: Finetuned from Meta Llama 3.1 8B Instruct.
  • Training Efficiency: The model was trained using Unsloth and Huggingface's TRL library, which facilitated a 2x faster finetuning process.
  • License: Distributed under the Apache-2.0 license.

Use Cases

This model is suitable for a variety of instruction-following applications, benefiting from its Llama 3.1 foundation and efficient finetuning. Its 8B parameter count makes it a capable option for tasks requiring a balance of performance and computational resources.