TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-tco-nv1-ng1-vlo-fsx

Hugging Face
TEXT GENERATIONConcurrency Cost:1Model Size:2.6BQuant:BF16Ctx Length:8kPublished:Apr 6, 2026Architecture:Transformer Warm

The TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-tco-nv1-ng1-vlo-fsx is a 2.6 billion parameter language model based on the Google Gemma-2-2b architecture, fine-tuned as part of the rankalign project. This model is specifically optimized for tasks involving hypernym-concat-bananas-to-dogs-double-all, utilizing a unique training methodology with typicality correction and validator log-odds. It is designed for specialized linguistic analysis, particularly in identifying hierarchical relationships between concepts, and operates with an 8192-token context length.

Loading preview...

Model Overview

This model, rankalign-v6-gemma-2-2b-d0.15-e2-hc-b2d-dbl-all-tco-nv1-ng1-vlo-fsx, is a fine-tuned checkpoint derived from the google/gemma-2-2b base model. It is part of the rankalign project, which focuses on advanced linguistic alignment tasks.

Key Training Details

The model underwent specific fine-tuning for a task identified as hypernym-concat-bananas-to-dogs-double-all over 2 epochs. Notable training parameters include a delta of 0.15, online typicality correction, and the use of validator log-odds. Both preference loss and NLL (Negative Log-Likelihood) for validator and generator had a weight of 1, with force-same-x enabled during training.

Specialized Capabilities

This model is particularly suited for evaluating and understanding hypernym relationships across various semantic categories. The provided evaluation scripts demonstrate its application across a diverse set of hypernym tasks, such as hypernym-bananas, hypernym-dogs, hypernym-cars, and hypernym-elephants. Its fine-tuning process suggests an enhanced ability to discern and rank conceptual hierarchies, making it valuable for research in semantic understanding and linguistic structure.

Intended Use Cases

Given its specialized training, this model is ideal for:

  • Linguistic Research: Investigating hypernymy and other semantic relations.
  • Knowledge Graph Construction: Identifying and validating hierarchical links between entities.
  • Specialized NLP Tasks: Applications requiring precise understanding of conceptual categorization and ranking.