FunAudioLLM/InspireMusic-1.5B-24kHz is a 1.5 billion parameter autoregressive transformer model developed by FunAudioLLM, specifically designed for high-quality music generation. It integrates audio tokenization with a Qwen2.5-based backbone and a super-resolution flow-matching model to produce coherent and contextually relevant audio. This model excels at text-to-music generation, music continuation, and long-form audio synthesis, supporting 24kHz mono audio output.
No reviews yet. Be the first to review!