YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature! YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature! 22/11/2024 Dataset
Comprehensive Evaluation Of Generalized Emotion Recognition (GER) Using The GPT-4V Comprehensive Evaluation Of Generalized Emotion Recognition (GER) Using The GPT-4V 06/11/2024 Large Language Models
MMSEARCH] Multimodal Search System Integrating Image And Text MMSEARCH] Multimodal Search System Integrating Image And Text 29/10/2024 Large Language Models
GestaltMML, A Multimodal Model For The Diagnosis Of Rare Genetic Disorders GestaltMML, A Multimodal Model For The Diagnosis Of Rare Genetic Disorders 13/10/2024 Large Language Models
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions 01/10/2024 Large Language Models
TryOnDiffusion: The Most Powerful Model For Generating Fitting Images TryOnDiffusion: The Most Powerful Model For Generating Fitting Images 30/09/2024 Image Generation
See Finer, See More: Implicit Modality Alignment For Text-Based Person Search See Finer, See More: Implicit Modality Alignment For Text-Based Person Search 29/09/2024 Deep Learning
[OmniGen] All Image-related Tasks Can Be Performed With Only One Generation Model! [OmniGen] All Image-related Tasks Can Be Performed With Only One Generation Model! 29/09/2024 Image Generation
[LDDGAN] Diffusion Model With The Highest Speed Inference [LDDGAN] Diffusion Model With The Highest Speed Inference 29/09/2024 Diffusion Model
[NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks [NVLM] Multimodal LLM Outperforms GPT-4o In Image And Language Tasks 27/09/2024 Large Language Models
New Frontier Of Deep Faking Detection Using CLIP New Frontier Of Deep Faking Detection Using CLIP 30/08/2024 Fake Detection
GenTron: Diffusion Transformers For Image And Video Generation GenTron: Diffusion Transformers For Image And Video Generation 26/08/2024 Image Generation
Diffusion2GAN: Knowledge Distillation Of Diffusion Models Into Conditional GANs Diffusion2GAN: Knowledge Distillation Of Diffusion Models Into Conditional GANs 26/08/2024 Image Generation
How Frame Interpolation AI Technologies RIFE & IFNet Work And How To Use Them How Frame Interpolation AI Technologies RIFE & IFNet Work And How To Use Them 20/08/2024 Image Generation
AVI-Talking" Generates Natural 3D Talking Faces From Audio AVI-Talking" Generates Natural 3D Talking Faces From Audio 17/08/2024 Face Recognition
Next-generation Deep-fake Detection Technology Using Frequency Masks Next-generation Deep-fake Detection Technology Using Frequency Masks 29/07/2024 Fake Detection
FreqNet] Generic Deep Fake Detection By Learning In Frequency Space FreqNet] Generic Deep Fake Detection By Learning In Frequency Space 29/07/2024 Fake Detection
DIFFUSSM] Attention-independent Diffusion Model DIFFUSSM] Attention-independent Diffusion Model 29/07/2024 Image Generation