[InsectMamba] Classification Of Pests Using State Space Models To Support Smart Agriculture [InsectMamba] Classification Of Pests Using State Space Models To Support Smart Agriculture 04/09/2024 Computer Vision
[CoMat] Resolve The Discrepancy Between Text And Image [CoMat] Resolve The Discrepancy Between Text And Image 28/08/2024 Computer Vision
[OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video [OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video 21/08/2024 Computer Vision
Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP Assessing The Robustness Of Zero-shot Image Understanding Models Through CLIP 24/06/2024 Contrastive Learning
[VideoAgent] Understanding Long-form Video Using A Large-scale Language Model As An Agent [VideoAgent] Understanding Long-form Video Using A Large-scale Language Model As An Agent 21/06/2024 Computer Vision
[Segment Anything] Zero-shot Segmentation Model [Segment Anything] Zero-shot Segmentation Model 18/06/2024 Segmentation
Apple Developed A Large Scale Autoregressive Image Model That Is Scalable Like An LLM. Apple Developed A Large Scale Autoregressive Image Model That Is Scalable Like An LLM. 07/05/2024 Computer Vision
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now! [Swin Transformer] Transformer-based Image Recognition Models To Keep Now! 22/03/2024 Image Recognition
[DiffYOLO] Innovative Framework Improves Object Detection With Low Quality Data [DiffYOLO] Innovative Framework Improves Object Detection With Low Quality Data 18/03/2024 Computer Vision
InstructPix2Pix: A New Model For Image Editing At The User's Direction InstructPix2Pix: A New Model For Image Editing At The User's Direction 28/02/2024 Computer Vision
[mPLUG-Owl] Developing An LLM That Can Understand Images And Text [mPLUG-Owl] Developing An LLM That Can Understand Images And Text 06/02/2024 Computation And Language
T2I-Adapter: Frontiers In Text-to-Image Conversion Technology T2I-Adapter: Frontiers In Text-to-Image Conversion Technology 25/01/2024 Computer Vision
ImageBind: Bringing All Information Together To Create New Knowledge ImageBind: Bringing All Information Together To Create New Knowledge 24/01/2024 Machine Learning
[Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability [Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability 18/01/2024 Prompting Method
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interaction Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interact ... 09/01/2024 Computer Vision
Mask R-CNN: Efficient Detection Of Objects In Images Mask R-CNN: Efficient Detection Of Objects In Images 04/01/2024 Computer Vision
U-Net: Convolutional Networks For Biomedical Image Segmentation U-Net: Convolutional Networks For Biomedical Image Segmentation 29/12/2023 Computer Vision