eggdog100/Qwen3.6-35B_Zenith

TEXT GENERATIONConcurrency Cost:3Model Size:35.1BQuant:FP8Ctx Length:32kTool Calling:SupportedPublished:Jun 24, 2026License:cc-by-nc-4.0Architecture:Transformer0.0K Open Weights Cold

eggdog100/Qwen3.6-35B_Zenith is a 35.1 billion parameter LoRA fine-tune of Qwen/Qwen3.6-35B-A3B, a multimodal MoE model with a vision tower. This derivative strengthens math, code, tool-calling, and natural human-like conversation while preserving its original vision capabilities. It is optimized for enhanced reasoning and conversational fluency, making it suitable for applications requiring robust multimodal understanding and human-like interaction.

Loading preview...

Overview

Qwen3.6-35B_Zenith is a LoRA supervised-fine-tune of the Qwen/Qwen3.6-35B-A3B base model, a 35.1 billion parameter hybrid linear+full-attention multimodal Mixture-of-Experts (MoE) with a vision tower. This fine-tune specifically targets improvements in math, code, tool-calling, and natural human-like conversation, while critically preserving the model's existing vision capabilities.

Key Capabilities & Enhancements

  • Strengthened Reasoning: Improved performance in math and code generation tasks.
  • Enhanced Tool-Calling: Better ability to interact with and utilize external tools.
  • Natural Conversation: Fine-tuned to produce more human-like and empathetic conversational responses, addressing the "talks like a human, not a robot" goal.
  • Vision-Preserved: The model's original vision tower remains frozen and bit-identical to the base, ensuring no regression in multimodal understanding.
  • Open-Weights Training Data: Trained exclusively on openly licensed data, with no distillation from closed frontier models like GPT, Claude, or Gemini.

Performance & Evaluation

Independent evaluations show that Zenith either equals or slightly beats the base model on benchmarks like MMLU-Pro (+1.9) and SuperGPQA (+0.6), with no meaningful regression in core reasoning abilities. The model's conversational style is notably de-roboticized in emotional and conversational contexts.

Good For

  • Applications requiring strong multimodal capabilities combined with enhanced reasoning.
  • Use cases demanding improved math, code, and tool-calling performance.
  • Building conversational agents that aim for more natural and empathetic interactions.

Note: Due to the inclusion of CC-BY-NC datasets (no_robots, empathetic_dialogues), the resulting weights inherit a non-commercial restriction.