[InsectMamba] Classification Of Pests Using State Space Models To Support Smart Agriculture [InsectMamba] Classification Of Pests Using State Space Models To Support Smart Agriculture 04/09/2024 Computer Vision
[CoMat] Resolve The Discrepancy Between Text And Image [CoMat] Resolve The Discrepancy Between Text And Image 28/08/2024 Computer Vision
[OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video [OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video 21/08/2024 Computer Vision
Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP 24/06/2024 Contrastive Learning
[VideoAgent] Understanding Long-form Video Using A Large-scale Language Model As An Agent [VideoAgent] Understanding Long-form Video Using A Large-scale Language Model As An Agent 21/06/2024 Computer Vision
[DiffYOLO] Innovative Framework Improves Object Detection With Low Quality Data [DiffYOLO] Innovative Framework Improves Object Detection With Low Quality Data 18/03/2024 Computer Vision
Mobile-Agent: Automation Of Mobile App Operations Through Screenshot Analysis Mobile-Agent: Automation Of Mobile App Operations Through Screenshot Analysis 06/03/2024 Pattern Recognition
InstructPix2Pix: A New Model For Image Editing At The User's Direction InstructPix2Pix: A New Model For Image Editing At The User's Direction 28/02/2024 Computer Vision
[mPLUG-Owl] Developing An LLM That Can Understand Images And Text [mPLUG-Owl] Developing An LLM That Can Understand Images And Text 06/02/2024 Computation And Language
T2I-Adapter: Frontiers In Text-to-Image Conversion Technology T2I-Adapter: Frontiers In Text-to-Image Conversion Technology 25/01/2024 Computer Vision
ImageBind: Bringing All Information Together To Create New Knowledge ImageBind: Bringing All Information Together To Create New Knowledge 24/01/2024 Machine Learning
Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interaction Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interact ... 09/01/2024 Computer Vision
Mask R-CNN: Efficient Detection Of Objects In Images Mask R-CNN: Efficient Detection Of Objects In Images 04/01/2024 Computer Vision
U-Net: Convolutional Networks For Biomedical Image Segmentation U-Net: Convolutional Networks For Biomedical Image Segmentation 29/12/2023 Computer Vision
Very Deep Convolutional Networks For Large-scale Image Recognition Very Deep Convolutional Networks For Large-scale Image Recognition 28/12/2023 Image Recognition
Enhanced Diffusion Models Utilizing Constraints Of 3D Perspective Geometry Enhanced Diffusion Models Utilizing Constraints Of 3D Perspective Geometry 27/12/2023 Computer Vision