Focusing on multimodal synthesis (speech/audio/music), speech translation, and audio editing.
-
Zhejiang University
Pinned Loading
-
FunAudioLLM/ThinkSound
FunAudioLLM/ThinkSound Public[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.
-
Text-to-Audio/AudioLCM
Text-to-Audio/AudioLCM PublicPyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.