LAVE, An Agent-assisted Video Editing Tool That Utilizes LLM LAVE, An Agent-assisted Video Editing Tool That Utilizes LLM 13/12/2024 Large Language Models
YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature! YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature! 22/11/2024 Dataset
From Face Recognition To Age Estimation, Potential Biometric Technologies Using ChatGPT-4 From Face Recognition To Age Estimation, Potential Biometric Technologies Using ChatGPT-4 23/05/2024 Large Language Models
[Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability [Set-of-Mark Visual Prompting] Prompting Technology To Enhance GPT-4V's Image Recognition Capability 18/01/2024 Prompting Method
[CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality [CoDi] Any-to-any Diffusion Model That Can Handle Almost Any Modality 12/01/2024 Diffusion Model
Generating 3D Objects From Text - DreamFusion Generating 3D Objects From Text - DreamFusion 05/12/2022 3D
Summary Of Image Caption Generation Techniques From Attention To GAN-based Methods Summary Of Image Caption Generation Techniques From Attention To GAN-based Methods 29/06/2022 Image Caption