Name: pankajmathur/orca_mini_v9_1_1B-Instruct API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: pankajmathur

Overview

pankajmathur/orca_mini_v9_1_1B-Instruct is a 1 billion parameter instruction-tuned model built upon Meta's Llama-3.2-1B-Instruct. It has been trained using various Supervised Fine-Tuning (SFT) datasets, aiming to provide a versatile and comprehensive general-purpose model. The developer encourages its use as a foundational base for further customization through techniques like full fine-tuning, DPO, PPO, ORPO tuning, and model merges, provided proper credit and attribution are given.

Key Capabilities & Features

Instruction-tuned: Designed to follow instructions effectively, making it suitable for assistant-like applications.
Llama 3.2 Base: Leverages the Llama 3.2 architecture, incorporating its safety mitigations and responsible deployment guidelines.
Customization Ready: Explicitly designed to be a starting point for further fine-tuning and adaptation to specific use cases.
Quantization Support: Demonstrates usage with 4-bit and 8-bit quantization via bitsandbytes for efficient deployment.

Responsible AI & Deployment

Meta's Llama 3.2 base model emphasizes a three-pronged strategy for trust and safety, focusing on enabling helpful and safe experiences, protecting against adversarial use, and providing community safeguards. This includes:

Safety Fine-Tuning: Incorporates safety mitigations similar to Llama 3, with a focus on reducing refusals to benign prompts and refining refusal tone.
Systemic Safety: Highlights that the model is not for isolated deployment but should be integrated into an overall AI system with additional safety guardrails, such as Llama Guard.
Constrained Environments: The 1B and 3B Llama 3.2 models are specifically noted for deployment in highly constrained environments like mobile devices, requiring developers to ensure system safety meets use case requirements.

Ethical Considerations

Llama 3.2 values openness, inclusivity, and helpfulness. Users are advised that, like all LLMs, it may produce inaccurate, biased, or objectionable responses, and developers should perform safety testing tailored to their specific applications.

Overview

Overview

Key Capabilities & Features

Responsible AI & Deployment

Ethical Considerations

Full Model Card (README)