che111/AlphaMed-7B-instruct-rl
TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kPublished:May 19, 2025License:mitArchitecture:Transformer Open Weights Cold
che111/AlphaMed-7B-instruct-rl is a 7.6 billion parameter medical large language model developed by che111. It is specifically trained for medical reasoning tasks, uniquely relying on reinforcement learning without supervised fine-tuning on chain-of-thought data. This model is designed to elicit step-by-step reasoning in complex medical scenarios, making it suitable for diagnostic support and medical question answering.
Loading preview...