[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU [HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU 10/07/2024 Speech Synthesis
Latent Diffusion Models Do Not Necessarily "increase In Size" Latent Diffusion Models Do Not Necessarily "increase In Size" 10/07/2024 Diffusion Model
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music [Mustango] Music Generation Model Utilizing Domain Knowledge Of Music 01/07/2024 Audio And Speech Processing
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry [VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry 01/07/2024 Speech Synthesis
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming [AlphaCodium] Highest Performance Code Generation Method Specialized For Programming 30/05/2024 Large Language Models
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism [MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism 22/01/2024 Diffusion Model
[AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion [AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion 16/01/2024 Diffusion Model
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
Versatile Diffusion] Diffusion Model That Integrates Text And Images Versatile Diffusion] Diffusion Model That Integrates Text And Images 21/12/2023 Diffusion Model
CLAP] Contrastive Learning Model Of Speech And Text CLAP] Contrastive Learning Model Of Speech And Text 21/12/2023 Contrastive Learning
UniD3] Multimodal Discrete Diffusion Model Integrating Image And Text UniD3] Multimodal Discrete Diffusion Model Integrating Image And Text 14/12/2023 Diffusion Model
Brain2Music] Automatic Music Generation Based On Brain Information Brain2Music] Automatic Music Generation Based On Brain Information 06/12/2023 Large Language Models
LP-MusicCaps] Automatic Generation Of Music Captions Using LLM LP-MusicCaps] Automatic Generation Of Music Captions Using LLM 20/11/2023 Contrastive Learning
MuLan] Multimodal Music-Text Using Contrastive Learning MuLan] Multimodal Music-Text Using Contrastive Learning 24/10/2023 Contrastive Learning
[MusicLM] Text-to-Music Generation Model Developed By Google. [MusicLM] Text-to-Music Generation Model Developed By Google. 18/10/2023 Transformer
Make-An-Audio] Prompt-enhanced Diffusion Model For Speech Generation. Make-An-Audio] Prompt-enhanced Diffusion Model For Speech Generation. 16/10/2023 Diffusion Model
Moûsai] Diffusion Model Of High-quality Music Generation By Text Input. Moûsai] Diffusion Model Of High-quality Music Generation By Text Input. 04/10/2023 Diffusion Model
Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning 23/05/2023 Reinforcement Learning