netcat420/MFANN3b
TEXT GENERATIONConcurrency Cost:1Model Size:3BQuant:BF16Ctx Length:2kPublished:Dec 13, 2024License:mitArchitecture:Transformer Open Weights Cold

netcat420/MFANN3b is a 3 billion parameter causal language model based on the Phi-2 architecture, developed by netcat420. This model is fine-tuned using a modified Alpaca training regimen that incorporates a defined "thought-process" in its dataset, enabling it to generate reasoning tokens before producing its final output. MFANN3b is specifically designed for tasks requiring explicit reasoning and a structured thought-process, making it suitable for applications where intermediate reasoning steps are beneficial.

Loading preview...