Articles
Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency
Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Hymba, A New Architecture That Pushes The Limits Of Small LLMs
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation
Open Vocabulary Object Detection Enabled By OWL-ViT
Open Vocabulary Object Detection Enabled By OWL-ViT
Neural Network
SOK-Bench] Situational Video Inference Benchmark Using Real-World Knowledge In Video
SOK-Bench] Situational Video Inference Benchmark Using Real-World Knowledge In Video
Computer Vision
Libra] A New Multimodal Design Of Large Language Models Using Separate Vision Systems
Libra] A New Multimodal Design Of Large Language Models Using Separate Vision Systems
Large Language Models
DrHouse] Diagnostic System Using Sensor Information And Expertise
DrHouse] Diagnostic System Using Sensor Information And Expertise
Medical
A New Method For Global Description Of Heterogeneous Graph Neural Networks Using Description Logic
A New Method For Global Description Of Heterogeneous Graph Neural Networks Using Description Logic
GNN
A Comprehensive Survey Of The Current Status And Challenges Of AI-Based Predictive Maintenance In The Steel Industry
A Comprehensive Survey Of The Current Status And Challenges Of AI-Based Predictive Maintenance In Th ...
Prediction Model
Proposal Of An Optimization Method For Activation Functions And CRReLU Using Information Entropy
Proposal Of An Optimization Method For Activation Functions And CRReLU Using Information Entropy
Loss Function
[For Everyone To Enjoy The Convenience... Speaker Adaptation Of Dysarthric Speech Using Whisper
[For Everyone To Enjoy The Convenience... Speaker Adaptation Of Dysarthric Speech Using Whisper
Speech Recognition For The Dysarthric
Speech Processing Model That Defies Common Sense! The Amazing Performance Of The Speech Processing Model "SpeechT5" Developed By M ...
Speech Processing Model That Defies Common Sense! The Amazing Performance Of The Speech Processing M ...
Sound
[You're Using Wav2vec2 For This? It Makes Feature Extraction Of Dysarthric Speech More Efficient!
[You're Using Wav2vec2 For This? It Makes Feature Extraction Of Dysarthric Speech More Efficient!
Speech Recognition For The Dysarthric
Classification Tasks - Extremely Difficult! Use The WHFEMD Algorithm To Accurately And Efficiently Capture And Classify Features O ...
Classification Tasks - Extremely Difficult! Use The WHFEMD Algorithm To Accurately And Efficiently C ...
Speech Recognition For The Dysarthric
A Paper That Overturns Conventional Wisdom! The Classification Of Dysarthria Was Based On Noise, Not Characteristics!
A Paper That Overturns Conventional Wisdom! The Classification Of Dysarthria Was Based On Noise, Not ...
Speech Recognition For The Dysarthric
Equal Access To Convenience! EasyCall Corpus", A Speech Corpus For The Dysarthric
Equal Access To Convenience! EasyCall Corpus", A Speech Corpus For The Dysarthric
Speech Recognition For The Dysarthric
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
Mask R-CNN: Efficient Detection Of Objects In Images
Mask R-CNN: Efficient Detection Of Objects In Images
Computer Vision
Graphs Are So Awesome! Review Of Integration With Deep Learning
Graphs Are So Awesome! Review Of Integration With Deep Learning
GNN
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
ChatGPT
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
Fakenews
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ...
RLHF
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Object Detection
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
Image Recognition
What Is Prompt Tuning To Optimize Prompts For High Performance?
What Is Prompt Tuning To Optimize Prompts For High Performance?
Prompting Method
AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available!
AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available!
Agent Simulation
StrongSORT: DeepSORT Is Back Stronger! Upgraded Tracking Model!
StrongSORT: DeepSORT Is Back Stronger! Upgraded Tracking Model!
Object Tracking