Name: Zachary1150/merge_cosfmt_MRL4096_ROLLOUT4_LR5e-7_w0.1_linear API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: Zachary1150

Model Overview

This model, merge_cosfmt_MRL4096_ROLLOUT4_LR5e-7_w0.1_linear, is a 1.5 billion parameter language model developed by Zachary1150. It was constructed using the Linear merge method via mergekit, combining the weights of two distinct pre-trained language models.

Merge Details

The merge process specifically integrated two actor checkpoints from the /local/scratch/zli2255/workspace/MergeExpert/checkpoints/baselines_openrs/ directory:

cos_MRL4096_ROLLOUT4_LR5e-7/global_step_54/actor/huggingface (weighted at 0.1)
accfmt_MRL4096_ROLLOUT4_LR5e-7/global_step_54/actor/huggingface (weighted at 0.9)

This configuration, using bfloat16 data type and normalized parameters, aims to leverage the complementary strengths of the merged components for improved performance in general language understanding and generation tasks. The model's architecture and specific capabilities are derived from the combined properties of its source models.

Overview

Model Overview

Merge Details

Full Model Card (README)