[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU [HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU 10/07/2024 Speech Synthesis
Latent Diffusion Models Do Not Necessarily "increase In Size" Latent Diffusion Models Do Not Necessarily "increase In Size" 10/07/2024 Diffusion Model
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music [Mustango] Music Generation Model Utilizing Domain Knowledge Of Music 01/07/2024 Audio And Speech Processing
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry [VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry 01/07/2024 Speech Synthesis
[AlphaCodium] Highest Performance Code Generation Method Specialized For Programming [AlphaCodium] Highest Performance Code Generation Method Specialized For Programming 30/05/2024 Large Language Models
BioinspiredLLM" Innovations In Biological Materials Research Using Large-scale Language Models BioinspiredLLM" Innovations In Biological Materials Research Using Large-scale Language Models 24/05/2024 Large Language Models
Can LLM Recreate A Persona Based On The Big Five! Can LLM Recreate A Persona Based On The Big Five! 23/05/2024 ChatGPT
U-ViT: ViT Backbone For Diffusion Models U-ViT: ViT Backbone For Diffusion Models 23/05/2024 Image Generation
ADD: Diffusion Model With Adversarial Learning And Knowledge Distillation ADD: Diffusion Model With Adversarial Learning And Knowledge Distillation 21/05/2024 Image Generation
ADD: Diffusion Model With Adversarial Learning And Knowledge Distillation ADD: Diffusion Model With Adversarial Learning And Knowledge Distillation 21/05/2024 Image Generation
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
Apple's Efficient Inference Of Large Language Models On Devices With Limited Memory Capacity Apple's Efficient Inference Of Large Language Models On Devices With Limited Memory Capacity 29/01/2024 Large Language Models
[MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism [MusicLDM] Text-to-Music Model With Low Risk Of Plagiarism 22/01/2024 Diffusion Model
[AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion [AudioLDM] Text-to-Audio Generation Model Using Latent Diffusion 16/01/2024 Diffusion Model
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
Versatile Diffusion] Diffusion Model That Integrates Text And Images Versatile Diffusion] Diffusion Model That Integrates Text And Images 21/12/2023 Diffusion Model
CLAP] Contrastive Learning Model Of Speech And Text CLAP] Contrastive Learning Model Of Speech And Text 21/12/2023 Contrastive Learning
Brain2Music] Automatic Music Generation Based On Brain Information Brain2Music] Automatic Music Generation Based On Brain Information 06/12/2023 Large Language Models