ank028/Llama-3.2-1B-Instruct-gsm8k-MGSM8K-sft1-slerp
TEXT GENERATIONConcurrency Cost:1Model Size:1BQuant:BF16Ctx Length:32kArchitecture:Transformer Cold

The ank028/Llama-3.2-1B-Instruct-gsm8k-MGSM8K-sft1-slerp model is a 1 billion parameter instruction-tuned language model based on the Llama 3.2 architecture. It was created by ank028 through a SLERP merge of two specialized models: ank028/Llama-3.2-1B-Instruct-gsm8k and autoprogrammer/Llama-3.2-1B-Instruct-MGSM8K-sft1. This merge process aims to combine and enhance capabilities, particularly for mathematical reasoning and problem-solving tasks, making it suitable for applications requiring robust numerical understanding.

Loading preview...