SKETCHPAD] Enhanced Inference Of Multimodal Language Models With Intermediate Sketches SKETCHPAD] Enhanced Inference Of Multimodal Language Models With Intermediate Sketches 18/12/2024 Large Language Models
[Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability [Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability 18/01/2024 Prompting Method
[PETRv2] Estimates The 3D Position Of An Object Using Only Camera Images. [PETRv2] Estimates The 3D Position Of An Object Using Only Camera Images. 10/11/2023 Object Detection
A GAN-based Image Generation Method That Revolutionizes The Generation Of Annotated Datasets A GAN-based Image Generation Method That Revolutionizes The Generation Of Annotated Datasets 12/07/2023 Image Generation
Introduction To Kubric, A Large-scale Data Generation Library Introduction To Kubric, A Large-scale Data Generation Library 10/07/2023 Dataset
Vision GNN, A Computer Vision Model Using Graph Structure Vision GNN, A Computer Vision Model Using Graph Structure 06/06/2023 GNN
VoxFormer" Generates 3D Volumes From Images For Use In Automated Driving Technology. VoxFormer" Generates 3D Volumes From Images For Use In Automated Driving Technology. 28/04/2023 Object Detection
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning
StrongSORT: DeepSORT Is Back Stronger! Upgraded Tracking Model! StrongSORT: DeepSORT Is Back Stronger! Upgraded Tracking Model! 31/12/2022 Object Tracking
GAIA, A Transition Learning System That Can Handle Any Downstream Task GAIA, A Transition Learning System That Can Handle Any Downstream Task 08/12/2022 Transfer Learning
GIAOTracker: Proposing A Comprehensive Framework For Multi-class, Multi-object Tracking! GIAOTracker: Proposing A Comprehensive Framework For Multi-class, Multi-object Tracking! 30/11/2022 Object Tracking
Can The Robustness Gained From ImageNet Training Be Used For Downstream Tasks In Transition Learning? Can The Robustness Gained From ImageNet Training Be Used For Downstream Tasks In Transition Learning ... 29/08/2022 Robust
Medical Image Analysis Using W&D (Wide And Deep Network Model) Medical Image Analysis Using W&D (Wide And Deep Network Model) 16/07/2022 Transfer Learning
Proposal For An Out-of-distribution Detection Method And A New Benchmark That Allows Models To Identify Proposal For An Out-of-distribution Detection Method And A New Benchmark That Allows Models To Ident ... 24/06/2022 Out-Of-Distribution
Few-shot Object Detection That Does Not Forget The Base Class Either Few-shot Object Detection That Does Not Forget The Base Class Either 10/06/2022 Object Detection
The Embarrassingly Simple Vision Transformer The Embarrassingly Simple Vision Transformer 04/01/2022 Transformer
This Model Makes Efficient Real-Time Video Object Segmentation Possible For The First Time! This Model Makes Efficient Real-Time Video Object Segmentation Possible For The First Time! 24/03/2021 Video Object Segmentation
SpineNet, An AI-discovered Backbone Model With Outstanding Detection Accuracy SpineNet, An AI-discovered Backbone Model With Outstanding Detection Accuracy 11/09/2020 Object Detection