espressovi/BODHI-qwen-3-8b-distil

TEXT GENERATIONConcurrency Cost:1Model Size:8BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Feb 20, 2026License:mitArchitecture:Transformer Open Weights Cold

BODHI-qwen-3-8b-distil is an 8 billion parameter distilled version of the Qwen3-8B-Base model, developed by espressovi. This model is optimized for efficiency and performance, inheriting capabilities from its larger Qwen3 base. It is suitable for applications requiring a powerful yet compact language model with a 32768 token context length.

Loading preview...

Model Overview

The espressovi/BODHI-qwen-3-8b-distil is an 8 billion parameter language model, representing a distilled version of the larger Qwen3-8B-Base architecture. Distillation techniques are typically employed to create smaller, more efficient models that retain much of the performance of their larger counterparts, making them suitable for deployment in resource-constrained environments or for faster inference.

Key Characteristics

  • Architecture: Based on the Qwen3 family, known for its strong general-purpose language understanding and generation capabilities.
  • Parameter Count: 8 billion parameters, offering a balance between performance and computational efficiency.
  • Context Length: Supports a substantial context window of 32768 tokens, allowing it to process and generate longer sequences of text.
  • Distilled Model: As a distilled version, it aims to provide competitive performance with reduced computational overhead compared to its base model.

Good For

  • Efficient Deployment: Ideal for applications where computational resources are limited, such as edge devices or cost-sensitive cloud environments.
  • General Language Tasks: Capable of handling a wide range of natural language processing tasks, including text generation, summarization, and question answering.
  • Long Context Applications: Its 32768 token context length makes it suitable for tasks requiring understanding or generating extensive documents or conversations.