Jackrong/Qwopus3.5-27B-v3.5
Qwopus3.5-27B-v3.5 is a 27 billion parameter language model developed by Jackrong, based on Qwen3.5-27B. This version is a data-scaled continuation of Qwopus3.5-27B-v3, trained with approximately twice the SFT data to enhance generalization across domains like mathematics, programming, and multi-turn interactions. It is specifically designed for structured reasoning, tool-augmented workflows, and multi-step agentic tasks, showing improved performance in complex problem-solving and coding scenarios.
Loading preview...
Overview
Jackrong's Qwopus3.5-27B-v3.5 is a 27 billion parameter model built upon Qwen3.5-27B, representing a significant data-scaled evolution from its predecessor, Qwopus3.5-27B-v3. This version focuses on enhancing generalization by expanding its training data to include a broader array of domains such as mathematics, programming, puzzle-solving, multilingual dialogue, and STEM-related tasks. Unlike previous iterations, v3.5 does not introduce new architectural changes, RL stages, or template redesigns, instead relying on a substantial increase (approximately 2x) in high-quality Supervised Fine-Tuning (SFT) data.
Key Capabilities
- Enhanced Structured Reasoning: Designed for complex problem-solving, leveraging structured reasoning over simple Chain-of-Thought (CoT) mimicry.
- Tool-Augmented Workflows: Optimized for integration with external tools, supporting more effective and reliable use of existing knowledge.
- Multi-Step Agentic Tasks: Excels in scenarios requiring multiple steps and agentic planning, particularly in coding and diagnostic tasks.
- Improved Coding Performance: Demonstrated significant gains in SWE-style capability tests, including code inspection, bug diagnosis, and action planning, successfully passing 14 out of 15 programming tasks in a 44-case suite.
- Data-Driven Generalization: Improvements are attributed to scaling high-quality SFT data, leading to better utilization and activation of latent knowledge.
Good For
- Developers requiring robust structured reasoning for complex logical problems.
- Applications involving tool use and agentic workflows.
- Coding tasks, especially those requiring multi-step problem-solving, bug diagnosis, and code generation.
- Use cases demanding strong performance in mathematics, programming, and STEM-related domains.
Limitations
- Potential for overfitting if data scaling exceeds optimal regimes.
- Reasoning may still exhibit instability in certain edge cases.
- Tool-calling performance is dependent on the specific environment integration.
- Not all capabilities have been fully benchmarked yet.