mobiuslabsgmbh/DeepSeek-R1-ReDistill-Qwen-7B-v1.1
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:Jan 27, 2025License:mitArchitecture:Transformer0.0K Open Weights Warm
The mobiuslabsgmbh/DeepSeek-R1-ReDistill-Qwen-7B-v1.1 is a 7.6 billion parameter language model, developed by mobiuslabsgmbh, based on the DeepSeek-R1-Distill-Qwen-7B architecture. This version is re-distilled to enhance performance across various benchmarks, particularly in reasoning and factual recall tasks. It features a substantial 131,072 token context length, making it suitable for applications requiring extensive context processing and improved accuracy in complex queries.
Loading preview...