qbz506/nyaya-llama-3b-stage0-full
The qbz506/nyaya-llama-3b-stage0-full model is a 3 billion parameter language model based on the unsloth/llama-3.2-3b-instruct architecture, fine-tuned for structured 6-phase Nyaya reasoning. This model is optimized for logic-style problems, emphasizing format adherence over open-ended creativity. It provides full merged weights, eliminating the need for a LoRA adapter during inference, and is available in both Safetensors and quantized GGUF formats.
Loading preview...
Model Overview
The qbz506/nyaya-llama-3b-stage0-full is a 3 billion parameter model derived from the unsloth/llama-3.2-3b-instruct base. It features full merged weights, meaning it does not require a LoRA adapter for inference, simplifying deployment. The model is specifically fine-tuned for structured 6-phase Nyaya reasoning, making it a research-grade tool for logic-style problems.
Key Capabilities & Features
- Nyaya Reasoning Engine: Designed to follow a precise 6-phase Nyaya reasoning structure, including Samshaya, Pramana, Pancha Avayava, Tarka, Hetvabhasa, and Nirnaya.
- Format Adherence: Optimized for strict adherence to a predefined output format, making it suitable for tasks requiring structured responses.
- Deployment Flexibility: Provided in both standard Safetensors format for Hugging Face Transformers and a quantized GGUF format for Ollama, facilitating local deployment.
- Research-Oriented: Intended for research in epistemic reasoning, as detailed in the associated paper, "Pramana: Fine-Tuning Large Language Models for Epistemic Reasoning through Navya-Nyaya" (arXiv:2604.04937).
Intended Use & Limitations
This model is best suited for use cases requiring structured, logic-based reasoning with a strong emphasis on output format. It excels when provided with exact Nyaya section headers in the prompt. Users should be aware that responses may be verbose due to the strict format, and the model has not been evaluated for safety-critical domains. Its strength lies in its specialized reasoning capabilities rather than open-ended creative generation.