LP-MusicCaps] Automatic Generation Of Music Captions Using LLM LP-MusicCaps] Automatic Generation Of Music Captions Using LLM 20/11/2023 Contrastive Learning
I-ViT: Compute ViT In Integer Type! ?Shiftmax And ShiftGELU, Which Evolved From I-BERT Technology, Are Also Available! I-ViT: Compute ViT In Integer Type! ?Shiftmax And ShiftGELU, Which Evolved From I-BERT Technology, A ... 16/11/2023 Transformer
[PETRv2] Estimates The 3D Position Of An Object Using Only Camera Images. [PETRv2] Estimates The 3D Position Of An Object Using Only Camera Images. 10/11/2023 Object Detection
What Is Prompt Tuning To Optimize Prompts For High Performance? What Is Prompt Tuning To Optimize Prompts For High Performance? 25/10/2023 Prompting Method
MuLan] Multimodal Music-Text Using Contrastive Learning MuLan] Multimodal Music-Text Using Contrastive Learning 24/10/2023 Contrastive Learning
[MusicLM] Text-to-Music Generation Model Developed By Google. [MusicLM] Text-to-Music Generation Model Developed By Google. 18/10/2023 Transformer
Sparse Transformers: An Innovative Approach To The Problem Of Increasing Computational Complexity With Input Sequence Length Sparse Transformers: An Innovative Approach To The Problem Of Increasing Computational Complexity Wi ... 07/09/2023 Transformer
Breaking Through The Barriers Of Computation Time And Memory! Breaking Through The Barriers Of Computation Time And Memory! 01/09/2023 Transformer
LONGNET: Model Capable Of Processing Text Up To 1 Billion Tokens LONGNET: Model Capable Of Processing Text Up To 1 Billion Tokens 28/08/2023 Transformer
Vision GNN, A Computer Vision Model Using Graph Structure Vision GNN, A Computer Vision Model Using Graph Structure 06/06/2023 GNN
The SoTA Model In The Task Of Detecting Fake News On Social Networking Sites Is Now Available! The SoTA Model In The Task Of Detecting Fake News On Social Networking Sites Is Now Available! 09/05/2023 Rumor Detection
A New SoTA Model For CQA Tasks That Answers Questions About The Chart Is Now Available! A New SoTA Model For CQA Tasks That Answers Questions About The Chart Is Now Available! 11/04/2023 Chart Question Answering
ByteTrack+ Appearance Features Are The Strongest: SMILETrack ByteTrack+ Appearance Features Are The Strongest: SMILETrack 03/04/2023 Object Tracking
Survey The Latest Transformers For Time Series Survey The Latest Transformers For Time Series 20/02/2023 Time-series
Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! 31/01/2023 Emotion Recognition
Increase Affine Accuracy First! Registration Using ViT: C2FViT Increase Affine Accuracy First! Registration Using ViT: C2FViT 25/01/2023 Medical
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning