Name: THGLab/Llama-3.1-8B-SmileyLlama-1.1 API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: THGLab

THGLab/Llama-3.1-8B-SmileyLlama-1.1 Overview

THGLab/Llama-3.1-8B-SmileyLlama-1.1 is an 8 billion parameter language model, fine-tuned from Llama-3.1-8B-Instruct, with a specialized focus on generating SMILES (Simplified Molecular Input Line Entry System) strings for drug-like molecules. This model, dubbed SmileyLlama, was trained on millions of molecules to enable on-demand generation of chemical structures.

Key Capabilities

SMILES String Generation: Generates valid SMILES strings for molecules.
Property-Guided Generation: Can generate molecules based on a wide range of specified chemical properties, including:
- H-bond donors and acceptors
- Molecular weight
- logP
- Rotatable bonds
- Fraction sp3
- TPSA (Topological Polar Surface Area)
- Presence of macrocycles
- Absence/presence of "bad SMARTS" (structural alerts)
- Absence/presence of specific covalent warheads (e.g., sulfonyl fluorides, acrylamides, epoxides)
- Substructure matching (e.g., "A substructure of SMILES_STRING")
- Chemical formula matching (e.g., "A chemical of CHEMICAL_FORMULA")
Efficient Generation: Supports num_return_sequences for rapid generation of multiple SMILES strings, limited by memory.

Use Cases

This model is particularly well-suited for applications in cheminformatics, computational chemistry, and drug discovery where the generation of novel molecules with specific desired properties is required. It can be used to:

Design new drug candidates by specifying desired physicochemical properties.
Generate libraries of molecules for virtual screening.
Explore chemical space based on structural and property constraints.

For more in-depth technical details, refer to the associated ArXiv preprint.

Overview

THGLab/Llama-3.1-8B-SmileyLlama-1.1 Overview

Key Capabilities

Use Cases

Full Model Card (README)