Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of Training Data? Generating Dysarthric Speech! What Is The Magic Data Extension Technology To Solve The Shortage Of T ... 26/07/2024 Sound
[Unit-DSR] Normalization Of Disabled Speech To Normal Speech By HuBERT [Unit-DSR] Normalization Of Disabled Speech To Normal Speech By HuBERT 26/07/2024 Self-supervised Learning
[Review] Industrial IoT Driving Smart Manufacturing [Review] Industrial IoT Driving Smart Manufacturing 25/07/2024 Internet Of Things
Enhanced Defect Detection Using Tensor CNN Enhanced Defect Detection Using Tensor CNN 25/07/2024 Tensor
[Double Descent] Why Are "large Models" And "large Data Sets" Important? [Double Descent] Why Are "large Models" And "large Data Sets" Important? 25/07/2024 Neural Network
[Chat-REC] Proposal For LLM-based Recommendation System [Chat-REC] Proposal For LLM-based Recommendation System 24/07/2024 Recommendation
[HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU [HiFi-GAN] GAN-based Vocoder Capable Of Generating 22 KHz Audio On A Single GPU 10/07/2024 Speech Synthesis
Latent Diffusion Models Do Not Necessarily "increase In Size" Latent Diffusion Models Do Not Necessarily "increase In Size" 10/07/2024 Diffusion Model
[Mustango] Music Generation Model Utilizing Domain Knowledge Of Music [Mustango] Music Generation Model Utilizing Domain Knowledge Of Music 01/07/2024 Audio And Speech Processing
[VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry [VoiceCraft] A Language Model That Synthesizes Natural Speech At The Highest Level In The Industry 01/07/2024 Speech Synthesis
A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created A Method To Automatically Evaluate "the Accuracy Of LLM's Output Of Long Sentences" Was Created 01/07/2024 Large Language Models
Transforming Legal Services With Large-Scale Language Models! Surpassing Humans In Speed And Accuracy? Transforming Legal Services With Large-Scale Language Models! Surpassing Humans In Speed And Accurac ... 28/06/2024 Large Language Models
VideoPrism Opens Up The Possibilities Of Video Analytics VideoPrism Opens Up The Possibilities Of Video Analytics 27/06/2024 Large Language Models
RakutenAI-7B" Pioneers The Frontiers Of Large-scale Language Models Specialized For Japanese RakutenAI-7B" Pioneers The Frontiers Of Large-scale Language Models Specialized For Japanese 27/06/2024 Large Language Models
LLM4Decompile, A Large-scale Language Model Specialized For Decompiling LLM4Decompile, A Large-scale Language Model Specialized For Decompiling 27/06/2024 Large Language Models
ScreenAI" Understands Images And Text From Infographics To UI ScreenAI" Understands Images And Text From Infographics To UI 24/06/2024 Large Language Models
Automation Of Scientific Experiments With Multi Large-scale Language Models From Autonomous Design To Execution Automation Of Scientific Experiments With Multi Large-scale Language Models From Autonomous Design T ... 24/06/2024 Large Language Models
Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP 24/06/2024 Contrastive Learning
Wavelet Diffusion: The Fastest Diffusion Model Wavelet Diffusion: The Fastest Diffusion Model 16/04/2024 Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ... 08/04/2024 Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available [RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ... 18/04/2024 Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! 22/04/2024 ChatGPT
VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Variance VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Var ... 03/10/2025
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore 12/07/2023 Object Detection
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, Laughter, And Personality The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, La ... 02/10/2025
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning
Machine Learning Model Tackles Soccer Match Prediction In Sports Betting Machine Learning Model Tackles Soccer Match Prediction In Sports Betting 29/01/2025 Sports Analytics
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions 01/10/2024 Large Language Models
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation 24/07/2025
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now! [Swin Transformer] Transformer-based Image Recognition Models To Keep Now! 22/03/2024 Image Recognition
U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation. U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation. 20/05/2022 Medical