Overview
Model Overview
The bennett-tan/phi-4-mini-instruct-merged is a 3.8 billion parameter instruction-tuned language model. It is a finetuned version of the phi3 architecture, developed by bennett-tan.
Key Characteristics
- Architecture: Based on the phi3 model family.
- Parameter Count: 3.8 billion parameters.
- Training Efficiency: This model was finetuned using Unsloth and Huggingface's TRL library, enabling a 2x faster training process compared to standard methods.
- Context Length: Supports a context length of 131,072 tokens.
Use Cases
This model is suitable for various instruction-following applications where a compact yet capable language model is required. Its efficient training process suggests it could be a good candidate for scenarios prioritizing faster iteration and deployment.