Name: FunAudioLLM/InspireMusic-Base API
Brand: Featherless.ai
Price: 10.00 USD
Availability: InStock
Author: FunAudioLLM

InspireMusic-Base: Unified Music, Song, and Audio Generation

InspireMusic-Base is a 0.5 billion parameter model developed by FunAudioLLM, designed as a unified toolkit for generating music, songs, and general audio. It leverages an autoregressive transformer, specifically using a Qwen2.5 backbone, combined with a super-resolution flow-matching model to produce high-quality, long-form audio.

Key Capabilities

Unified Framework: Integrates audio tokenization with an autoregressive transformer and flow-matching for comprehensive audio generation.
High-Quality Audio: Focuses on generating music with high audio fidelity.
Long-Form Generation: Capable of producing extended music pieces, with some models supporting several minutes of audio.
Text and Audio Prompts: Supports controllable generation using both text descriptions and audio prompts.
Diverse Tasks: Currently supports text-to-music and music continuation, with future plans for song and general audio generation.
Hardware Efficiency: Can run in 'fast mode' with 12GB GPU memory, while 'normal mode' (with flow matching) recommends 24GB for optimal experience.

Good For

Developers and researchers focused on music generation from text or audio prompts.
Creating long-form musical compositions.
Innovating soundscapes and enhancing audio research.
Experimenting with a unified framework for various audio generation tasks.

Overview

InspireMusic-Base: Unified Music, Song, and Audio Generation

Key Capabilities

Good For

Full Model Card (README)