Speech Synthesis
The Time Has Come For Everyone To Speak English! Zero-shot Text-to-speech Technology For Multiple Languages Makes It Easy For Anyo ...
The Time Has Come For Everyone To Speak English! Zero-shot Text-to-speech Technology For Multiple La ...
Speech Recognition For The Dysarthric
Zero-shot Learning] AI Voice Cloning And Lip-syncing Verification And Explanation
Zero-shot Learning] AI Voice Cloning And Lip-syncing Verification And Explanation
Neural Network
The Future Of Music Education, Flute X GPT And LAUI's Potential To Change Large-Scale Language Models
The Future Of Music Education, Flute X GPT And LAUI's Potential To Change Large-Scale Language Model ...
Large Language Models
Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of Training Data?
Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of T ...
Sound
[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU
[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU
Speech Synthesis
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry
Speech Synthesis
AI's Cambrian Explosion: The Key To The Era Of Finding And Utilizing Useful AI Creators
AI's Cambrian Explosion: The Key To The Era Of Finding And Utilizing Useful AI Creators
Video Generation
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism
Diffusion Model
[AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion
[AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion
Diffusion Model
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality
Diffusion Model
CLAP] Contrastive Learning Model Of Speech And Text
CLAP] Contrastive Learning Model Of Speech And Text
Contrastive Learning
Brain2Music] Automatic Music Generation Based On Brain Information
Brain2Music] Automatic Music Generation Based On Brain Information
Large Language Models
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM
Contrastive Learning
MuLan] Multimodal Music-Text Using Contrastive Learning
MuLan] Multimodal Music-Text Using Contrastive Learning
Contrastive Learning
[MusicLM] Text-to-Music Generation Model Developed By Google.
[MusicLM] Text-to-Music Generation Model Developed By Google.
Transformer