KaraKaraWitch/BlenderCartel-llama33-70B-Pt2
KaraKaraWitch/BlenderCartel-llama33-70B-Pt2 is a 70 billion parameter language model created by KaraKaraWitch, formed by merging multiple Llama-3 and Llama-3.1 based models using the SCE method. This merge, built upon deepcogito/cogito-v2-preview-llama-70B, integrates diverse capabilities including tool calling, multilingual support (Japanese, Traditional Chinese, Korean, Arabic), and specialized instruction following. It is designed to offer a broad range of functionalities by combining the strengths of its constituent models.
Loading preview...
Model Overview
KaraKaraWitch/BlenderCartel-llama33-70B-Pt2 is a 70 billion parameter merged language model, developed by KaraKaraWitch. It was constructed using the SCE (Sparse Component Ensemble) merge method, with deepcogito/cogito-v2-preview-llama-70B serving as its base model.
Key Capabilities & Merged Components
This model integrates a diverse set of capabilities by combining fourteen different Llama-3 and Llama-3.1 based models. The merge specifically targets a broad range of applications, including:
- Tool Calling: Incorporates
watt-ai/watt-tool-70Bfor enhanced tool interaction capabilities. - Multilingual Support: Includes models like
rinna/llama-3-youko-70b(Japanese),yentinglin/Llama-3-Taiwan-70B-Instruct(Traditional Chinese),Bllossom/llama-3-Korean-Bllossom-70B(Korean), andFreedomIntelligence/AceGPT-v2-70B(Arabic), aiming for robust performance across multiple languages. - Instruction Following: Integrates various instruction-tuned models such as
kldzj/Llama-3.3-70B-Instruct-hereticandflammenai/Llama3.1-Flammades-70B. - Diverse General Knowledge: Blends models like
Delta-Vector/Shimamura-70B,Undi95/Sushi-v1.4, andMawdistical/Anthrobomination-70Bto enhance general understanding and response generation.
Merge Configuration
The merge process utilized a select_topk value of 0.2 and applied normalization, with the model weights stored in bfloat16 data type. This configuration aims to synthesize the strengths of the constituent models into a versatile and capable LLM.
Top 3 parameter combinations used by Featherless users for this model. Click a tab to see each config.