google/txgemma-27b-predict
Hugging Face
TEXT GENERATIONConcurrency Cost:2Model Size:27BQuant:FP8Ctx Length:32kPublished:Mar 21, 2025License:health-ai-developer-foundationsArchitecture:Transformer0.0K Gated Warm

TxGemma-27B-Predict is a 27 billion parameter open language model developed by Google, built upon the Gemma 2 architecture and fine-tuned for therapeutic development. It excels at processing and understanding information related to therapeutic modalities and targets, such as small molecules, proteins, and diseases. This model is optimized for property prediction tasks in drug discovery and can serve as a foundation for further specialized fine-tuning.

Loading preview...

TxGemma-27B-Predict: Specialized LLM for Therapeutic Development

TxGemma-27B-Predict is a 27 billion parameter model from Google, part of the TxGemma collection, which are lightweight, state-of-the-art open language models based on Gemma 2. This specific variant is fine-tuned for therapeutic development, focusing on processing and understanding information across various therapeutic modalities and targets, including small molecules, proteins, nucleic acids, diseases, and cell lines.

Key Capabilities

  • Therapeutic Task Excellence: Designed to excel at tasks such as property prediction, outperforming or matching best-in-class performance on a significant number of benchmarks (50 out of 66 tasks on the Therapeutics Data Commons benchmark).
  • Data Efficiency: Demonstrates competitive performance even with limited data, offering improvements over predecessors.
  • Foundation Model: Can be used as a pre-trained foundation for further fine-tuning for specialized use cases in drug discovery.
  • Input Versatility: Accepts inputs including SMILES strings, amino acid sequences, nucleotide sequences, and natural language text, formatted according to the Therapeutics Data Commons (TDC) structure.

Good For

  • Accelerated Drug Discovery: Streamlining the therapeutic development process by predicting properties of therapeutics and targets, including target identification, drug-target interaction prediction, and clinical trial approval prediction.
  • Research and Development: A valuable tool for researchers in therapeutic R&D, offering strong performance across a wide range of tasks and integration into agentic workflows.

Note that this predict variant expects a narrow form of prompting for optimal performance, differing from the more flexible conversational Chat variants.