Vision GNN, A Computer Vision Model Using Graph Structure Vision GNN, A Computer Vision Model Using Graph Structure 06/06/2023 GNN
The SoTA Model In The Task Of Detecting Fake News On Social Networking Sites Is Now Available! The SoTA Model In The Task Of Detecting Fake News On Social Networking Sites Is Now Available! 09/05/2023 Rumor Detection
A New SoTA Model For CQA Tasks That Answers Questions About The Chart Is Now Available! A New SoTA Model For CQA Tasks That Answers Questions About The Chart Is Now Available! 11/04/2023 Chart Question Answering
ByteTrack+ Appearance Features Are The Strongest: SMILETrack ByteTrack+ Appearance Features Are The Strongest: SMILETrack 03/04/2023 Object Tracking
Survey The Latest Transformers For Time Series Survey The Latest Transformers For Time Series 20/02/2023 Time-series
Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! Multimodal Emotion Recognition From Text, Voice And Vision: Sony's Proposed M2FNet! 31/01/2023 Emotion Recognition
Increase Affine Accuracy First! Registration Using ViT: C2FViT Increase Affine Accuracy First! Registration Using ViT: C2FViT 25/01/2023 Medical
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning
GRIT, An Image Caption Generation Model That Integrates Two Visual Features And Achieves Significant Accuracy Improvements, Is Now ... GRIT, An Image Caption Generation Model That Integrates Two Visual Features And Achieves Significant ... 25/10/2022 Image Caption
A Model That Can Predict The Trajectory Of The Eye From The Input Image! A Model That Can Predict The Trajectory Of The Eye From The Input Image! 19/10/2022 Transformer
CogVideo, An Open Source Model Capable Of Generating Video From Text, Is Now Available! CogVideo, An Open Source Model Capable Of Generating Video From Text, Is Now Available! 11/10/2022 Video Generation
Inverse Synthetic Analysis Model GTA Using Both Graph And SMILES Representations Inverse Synthetic Analysis Model GTA Using Both Graph And SMILES Representations 15/09/2022 Materials Informatics
U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation. U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation. 20/05/2022 Medical
Why Are Vision Transformers So High Performance? Why Are Vision Transformers So High Performance? 16/05/2022 Transformer
We Will Summarize The Results That Transformer Has Brought To Medical Imaging. We Will Summarize The Results That Transformer Has Brought To Medical Imaging. 11/05/2022 Medical