ImageBind: Bringing All Information Together To Create New Knowledge ImageBind: Bringing All Information Together To Create New Knowledge 24/01/2024 Machine Learning
[Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability [Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability 18/01/2024 Prompting Method
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interaction Multimodal GPT-4 And LLaVA Integration Of Advanced Image Understanding And Natural Language Interact ... 09/01/2024 Computer Vision
Mask R-CNN: Efficient Detection Of Objects In Images Mask R-CNN: Efficient Detection Of Objects In Images 04/01/2024 Computer Vision
U-Net: Convolutional Networks For Biomedical Image Segmentation U-Net: Convolutional Networks For Biomedical Image Segmentation 29/12/2023 Computer Vision
Very Deep Convolutional Networks For Large-scale Image Recognition Very Deep Convolutional Networks For Large-scale Image Recognition 28/12/2023 Image Recognition
Enhanced Diffusion Models Utilizing Constraints Of 3D Perspective Geometry Enhanced Diffusion Models Utilizing Constraints Of 3D Perspective Geometry 27/12/2023 Computer Vision