Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of Training Data? Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of T ... 26/07/2024 Sound
[Unit-DSR] Normalization Of Disabled Speech To Normal Speech By HuBERT [Unit-DSR] Normalization Of Disabled Speech To Normal Speech By HuBERT 26/07/2024 Self-supervised Learning
[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU [HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU 10/07/2024 Speech Synthesis
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music [Mustango] Music Generation Model Utilizing Domain Knowledge Of Music 01/07/2024 Audio And Speech Processing
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry [VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry 01/07/2024 Speech Synthesis
The Secrets Of Speech Recognition Technology The Secrets Of Speech Recognition Technology 24/04/2024 Voice Recognition
AI's Cambrian Explosion: The Key To The Era Of Finding And Utilizing Useful AI Creators AI's Cambrian Explosion: The Key To The Era Of Finding And Utilizing Useful AI Creators 18/03/2024 Video Generation
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism [MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism 22/01/2024 Diffusion Model
[AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion [AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion 16/01/2024 Diffusion Model
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
CLAP] Contrastive Learning Model Of Speech And Text CLAP] Contrastive Learning Model Of Speech And Text 21/12/2023 Contrastive Learning
Brain2Music] Automatic Music Generation Based On Brain Information Brain2Music] Automatic Music Generation Based On Brain Information 06/12/2023 Large Language Models
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM LP-MusicCaps] Automatic Generation Of Music Captions Using LLM 20/11/2023 Contrastive Learning
MuLan] Multimodal Music-Text Using Contrastive Learning MuLan] Multimodal Music-Text Using Contrastive Learning 24/10/2023 Contrastive Learning
[MusicLM] Text-to-Music Generation Model Developed By Google. [MusicLM] Text-to-Music Generation Model Developed By Google. 18/10/2023 Transformer
Make-An-Audio] Prompt-enhanced Diffusion Model For Speech Generation. Make-An-Audio] Prompt-enhanced Diffusion Model For Speech Generation. 16/10/2023 Diffusion Model
Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! 31/01/2023 Emotion Recognition
How Should We Link Different Resolution Features? : D3Net Proposed By Sony How Should We Link Different Resolution Features? : D3Net Proposed By Sony 30/01/2023 CVPR