Name: pankajmathur/orca_mini_v9_5_3B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: pankajmathur

Model Overview

The pankajmathur/orca_mini_v9_5_3B-Instruct is a 3 billion parameter instruction-tuned model built upon the Llama-3.2-3B-Instruct base. It has been trained using various Supervised Fine-Tuning (SFT) datasets by pankajmathur. This model is intended to serve as a versatile general-purpose foundation, encouraging developers to use it as a starting point for further customization through techniques like Full Fine-Tuning, DPO, PPO, ORPO, or model merging.

Key Capabilities

Instruction Following: Designed to respond effectively to user instructions, leveraging its SFT training.
Foundation for Customization: Explicitly built to be a base model for advanced tuning methods (DPO, PPO, ORPO) and merges.
Resource-Efficient Deployment: The 3B parameter size makes it suitable for deployment in constrained environments, including mobile devices, with support for 4-bit and 8-bit quantization.

Responsible AI & Safety

This model inherits the responsible AI practices and safety mitigations from its Llama 3.2 base, which includes a focus on managing trust and safety risks, protecting against adversarial use, and providing community safeguards. Developers are encouraged to implement additional system-level safeguards for specific use cases.

Good For

Developers looking for a compact, instruction-tuned base model for further fine-tuning.
Applications requiring a general-purpose conversational AI that can be specialized.
Deployment in environments with limited computational resources, leveraging quantization options.

Overview

Model Overview

Key Capabilities

Responsible AI & Safety

Good For

Full Model Card (README)