Kuyash/teptez-ai

TEXT GENERATIONConcurrency Cost:1Model Size:7.6BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 9, 2026Architecture:Transformer Cold

Kuyash/teptez-ai is a 7.6 billion parameter language model, fine-tuned and converted to GGUF format using Unsloth. This model is based on the Qwen2.5-Coder-7B-Instruct architecture, indicating its specialization in code-related tasks. It is designed for efficient deployment and use with tools like llama-cli and Ollama, offering a streamlined solution for developers working with instruction-tuned coding models.

Loading preview...

Overview

Kuyash/teptez-ai is a 7.6 billion parameter language model, specifically a qwen2.5-coder-7b-instruct variant, fine-tuned and converted into the GGUF format. This model leverages the Unsloth framework, which is noted for its efficiency in training, claiming to be 2x faster. The GGUF format facilitates broad compatibility with various inference engines, including llama-cli and Ollama.

Key Capabilities

  • Code-centric Instruction Following: Based on the qwen2.5-coder-7b-instruct architecture, this model is optimized for understanding and generating code-related instructions.
  • GGUF Format: Provided in GGUF, ensuring compatibility with a wide range of local inference tools and platforms.
  • Ollama Integration: Includes an Ollama Modelfile for straightforward deployment and use within the Ollama ecosystem.

Good For

  • Developers seeking an instruction-tuned model for coding tasks.
  • Users who prioritize efficient local deployment via llama-cli or Ollama.
  • Experimentation with models fine-tuned using the Unsloth framework for performance benefits.