Human Eyes Solve The Mystery Of Images That Deceive AI Human Eyes Solve The Mystery Of Images That Deceive AI 02/10/2024 Study
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions 01/10/2024 Large Language Models
A New Approach To Improving The Performance Of Biomedical NER With Large-Scale Language Models A New Approach To Improving The Performance Of Biomedical NER With Large-Scale Language Models 01/10/2024 Large Language Models
A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing! A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing! 30/09/2024 Large Language Models
PIDM] Diffusion Model With Physical Regularization PIDM] Diffusion Model With Physical Regularization 30/09/2024 Diffusion Model
TryOnDiffusion: The Most Powerful Model For Generating Fitting Images TryOnDiffusion: The Most Powerful Model For Generating Fitting Images 30/09/2024 Image Generation
[WavLM] Past All Speech Recognition Models! What Is The Structure And Performance? [WavLM] Past All Speech Recognition Models! What Is The Structure And Performance? 30/09/2024 Speech Processing
V3D: 3D Object Generation From A Single Image V3D: 3D Object Generation From A Single Image 30/09/2024 3D
See Finer, See More: Implicit Modality Alignment For Text-Based Person Search See Finer, See More: Implicit Modality Alignment For Text-Based Person Search 29/09/2024 Deep Learning
[OmniGen] All Image-related Tasks Can Be Performed With Only One Generation Model! [OmniGen] All Image-related Tasks Can Be Performed With Only One Generation Model! 29/09/2024 Image Generation
[LDDGAN] Diffusion Model With The Highest Speed Inference [LDDGAN] Diffusion Model With The Highest Speed Inference 29/09/2024 Diffusion Model
I Want To Use A Speech Activation System Even If I Have Dysarthria! Corpus For Speech Activation Systems And What Is A Speech Acti ... I Want To Use A Speech Activation System Even If I Have Dysarthria! Corpus For Speech Activation Sys ... 28/09/2024 Sound
Manufacturing Revolution In The Shop Floor: Programmable Materials And Modular Assemblies At The Forefront Manufacturing Revolution In The Shop Floor: Programmable Materials And Modular Assemblies At The For ... 28/09/2024 Manufacturing
Frontiers Of Manufacturing Service Recommendation Combining Knowledge Graph And ChatGPT Frontiers Of Manufacturing Service Recommendation Combining Knowledge Graph And ChatGPT 28/09/2024 Manufacturing
AdaptIoT] Self-labeling System Using Cause-and-effect Relationships In The Manufacturing Industry AdaptIoT] Self-labeling System Using Cause-and-effect Relationships In The Manufacturing Industry 27/09/2024 Internet Of Things
[NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks [NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks 27/09/2024 Large Language Models
It Is Clear That Human Memory Characteristics Are Present In LLM! It Is Clear That Human Memory Characteristics Are Present In LLM! 27/09/2024 Large Language Models
A Paper Examining Whether LLMs Understand Cultural Common Sense Is Now Available! A Paper Examining Whether LLMs Understand Cultural Common Sense Is Now Available! 27/09/2024 Cultural Commonsense
Wavelet Diffusion: The Fastest Diffusion Model Wavelet Diffusion: The Fastest Diffusion Model 16/04/2024 Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ... 08/04/2024 Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available [RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ... 18/04/2024 Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! 22/04/2024 ChatGPT
VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Variance VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Var ... 03/10/2025
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore 12/07/2023 Object Detection
LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning 25/09/2025
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning
The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, Laughter, And Personality The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, La ... 02/10/2025
Machine Learning Model Tackles Soccer Match Prediction In Sports Betting Machine Learning Model Tackles Soccer Match Prediction In Sports Betting 29/01/2025 Sports Analytics
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation 24/07/2025
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions 01/10/2024 Large Language Models
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now! [Swin Transformer] Transformer-based Image Recognition Models To Keep Now! 22/03/2024 Image Recognition