ThaiLLM/ThaiLLM-8B-SFT-IQ
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kPublished:Jan 16, 2026Architecture:Transformer0.0K Cold

ThaiLLM/ThaiLLM-8B-SFT-IQ (Medical) is an 8 billion parameter, decoder-only causal language model developed by ThaiLLM, specialized for Thai-language medical information query. Fine-tuned from ThaiLLM-8B-SFT, it excels at citation-grounded question answering within medical contexts, designed specifically for Retrieval-Augmented Generation (RAG) workflows. The model focuses on generating answers strictly from provided medical documents and returning explicit citations, achieving significantly higher citation accuracy compared to its base model.

Loading preview...