[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU [HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU 10/07/2024 Speech Synthesis
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry [VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry 01/07/2024 Speech Synthesis
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism [MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism 22/01/2024 Diffusion Model
CLAP] Contrastive Learning Model Of Speech And Text CLAP] Contrastive Learning Model Of Speech And Text 21/12/2023 Contrastive Learning
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM LP-MusicCaps] Automatic Generation Of Music Captions Using LLM 20/11/2023 Contrastive Learning
Now There's A Technique For Editing The Facial Movements Of Characters In A Video To Match Any Emotion! Now There's A Technique For Editing The Facial Movements Of Characters In A Video To Match Any Emoti ... 05/08/2022 CVPR
FreeMo, A Model That Automatically Generates Upper Body Gestures In Response To Speech, Is Here! FreeMo, A Model That Automatically Generates Upper Body Gestures In Response To Speech, Is Here! 19/07/2022 Speech Synthesis