RISys-Lab/RedSage-Qwen3-8B-CFW
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Oct 20, 2025Architecture:Transformer Cold

RedSage-Qwen3-8B-CFW is an 8 billion parameter large language model developed by RISys-Lab, continually pre-trained on the CyberFineWeb corpus, a specialized dataset of 11.7 billion cybersecurity tokens. This base model, built upon Qwen3-8B-Base with a 32768 token context length, is optimized for cybersecurity text completion and generation, demonstrating improved performance on cybersecurity benchmarks while retaining general reasoning capabilities through data replay. It is primarily intended for further fine-tuning on downstream cybersecurity tasks and research into domain adaptation.

Loading preview...