MARS: Turkish-Optimized Llama 3 8B
MARS is the inaugural large language model from Curiosity Technology, built upon the robust Meta Llama 3 8B Instruct architecture. This model distinguishes itself through its specialized training regimen, focusing heavily on the Turkish language.
Key Capabilities & Training:
- Turkish Language Proficiency: MARS has been fine-tuned using LoRA (Low-Rank Adaptation) on a unique combination of in-house Turkish datasets and Turkish translations of various open-source datasets. This targeted training makes it particularly adept at understanding and generating Turkish text.
- Base Model: Leverages the strong foundational capabilities of Llama 3 8B Instruct.
- Training Duration: The model underwent a focused training period of 3 days on 4xA100 GPUs, indicating a dedicated effort to imbue it with specific linguistic capabilities.
Use Cases:
- Turkish Conversational AI: Ideal for chatbots, virtual assistants, and other interactive applications requiring high-quality Turkish language interaction.
- Turkish Content Generation: Suitable for tasks involving generating text, summaries, or responses in Turkish.
- Research and Development: Provides a strong base for further research and development in Turkish natural language processing, with the promise of future release of its Turkish datasets to the community.