cxrbon16/turkish-llama-MSFT-0.7-ngram-banned
TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:8kPublished:Mar 26, 2026License:llama3Architecture:Transformer Cold
The cxrbon16/turkish-llama-MSFT-0.7-ngram-banned model is an 8 billion parameter language model, fine-tuned from ytu-ce-cosmos/Turkish-Llama-8b-v0.1. It has a context length of 8192 tokens and was trained with a learning rate of 2e-05 over 2 epochs. This model is specifically adapted for Turkish language tasks, building upon an existing Turkish Llama base.
Loading preview...