grimjim/Nemo-Instruct-2407-MPOA-v3-12B
TEXT GENERATIONConcurrency Cost:1Model Size:12BQuant:FP8Ctx Length:32kLicense:apache-2.0Architecture:Transformer0.0K Open Weights Cold
grimjim/Nemo-Instruct-2407-MPOA-v3-12B is a 12 billion parameter instruction-tuned model with a 32768 token context length. It incorporates Magnitude-Preserving Othogonalized Ablation (MPOA) on specific layers, resulting in a model optimized for varied text completion with a nuanced approach to safety refusals. This model maintains coherent English text generation while being trained with multilingual prompts.
Loading preview...