seed429/affine_hotkey11_5E2HEWBbHU73PkMU5saE7zRiTjW2CmxRMqWRLEn9Wrrxvk5f
The seed429/affine_hotkey11_5E2HEWBbHU73PkMU5saE7zRiTjW2CmxRMqWRLEn9Wrrxvk5f is a 32 billion parameter language model created by seed429, merged using the DARE TIES method with Qwen3-32B as its base. This model integrates four distinct 'Affine' models, combining their strengths for enhanced performance. It is designed to leverage the collective intelligence of its constituent models, making it suitable for a broad range of general-purpose language tasks.
Loading preview...
Model Overview
The seed429/affine_hotkey11_5E2HEWBbHU73PkMU5saE7zRiTjW2CmxRMqWRLEn9Wrrxvk5f is a 32 billion parameter language model developed by seed429. It was created using the DARE TIES merge method, a technique designed to combine multiple pre-trained language models effectively. The base model for this merge was Qwen3-32B.
Merge Details
This model is a composite of four distinct 'Affine' models, each contributing to the overall capabilities. The specific models integrated are:
fakemoonlo_Affine-5FnfLT3ntQXDsAnVC5H5WNQYVTY7SSCbxU3kxqhNybtJeNGbhippo-master_affine-20-5C88SLkEJWWGZoPpexcyZsnF7TWhPPg2avyKcmWfNdJx1hPodistillgpt_Affine-5CmFcnxCwJHQFaBxYLLZj4CXv2wbiwPckxnm1u7ng1d1NsH2prexpert_affine-107-5GbsxJvygQaBrTdsqUawR3XWDi6CbqNgiPDVgbSTSzSfMJDD
The merge process utilized mergekit with specific weighting and density parameters for each constituent model, aiming to optimize their combined performance. The configuration specified a bfloat16 data type and included normalization during the merge.
Key Characteristics
- Architecture: Merged model based on Qwen3-32B, integrating four specialized 'Affine' models.
- Parameter Count: 32 billion parameters.
- Context Length: Supports a context length of 32768 tokens.
- Merge Method: Employs the DARE TIES method for combining model weights.
Potential Use Cases
Given its large parameter count and the integration of multiple models, this model is likely suitable for a variety of general-purpose language generation and understanding tasks, benefiting from the diverse strengths of its merged components.