Ferret-UI, A Multimodal Large-scale Language Model For Mobile UI Ferret-UI, A Multimodal Large-scale Language Model For Mobile UI 02/09/2024 Large Language Models
[ClimODE] Weather Forecasting Using Neural ODEs [ClimODE] Weather Forecasting Using Neural ODEs 31/08/2024 Computational Physics
Fusion Of Speech And Image! Does The Multimodal Method "AV-HuBERT" Shine In Speech Recognition For The Dysarthric? Fusion Of Speech And Image! Does The Multimodal Method "AV-HuBERT" Shine In Speech Recognition For T ... 31/08/2024 Speech Recognition For The Dysarthric
New Frontier Of Deep Faking Detection Using CLIP New Frontier Of Deep Faking Detection Using CLIP 30/08/2024 Fake Detection
SkySense: Multimodal Remote Sensing Foundation Model SkySense: Multimodal Remote Sensing Foundation Model 30/08/2024 CVPR
Artificial Intelligence Developed By Meta! How Well Does The "HuBERT" Model, Which Is Different From Conventional Self-supervised ... Artificial Intelligence Developed By Meta! How Well Does The "HuBERT" Model, Which Is Different From ... 29/08/2024 AI For Science
Explainability Techniques To Enhance Predictive Models Of Manufacturing Quality Explainability Techniques To Enhance Predictive Models Of Manufacturing Quality 29/08/2024 Explainable.AI
[CoMat] Resolve The Discrepancy Between Text And Image [CoMat] Resolve The Discrepancy Between Text And Image 28/08/2024 Computer Vision
GMS: Revolutionizing Manufacturing With ChatGPT And Diffusion Models GMS: Revolutionizing Manufacturing With ChatGPT And Diffusion Models 28/08/2024 Manufacturing
[BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values! [BitNet B1.58] Achieved Accuracy Better Than Llama By Expressing Model Parameters In Three Values! 27/08/2024 Large Language Models
Google's High-performance LLM That Compresses Very Long Prompt Sentences To Save Memory Google's High-performance LLM That Compresses Very Long Prompt Sentences To Save Memory 27/08/2024 Large Language Models
GenTron: Diffusion Transformers For Image And Video Generation GenTron: Diffusion Transformers For Image And Video Generation 26/08/2024 Image Generation
Diffusion2GAN: Knowledge Distillation Of Diffusion Models Into Conditional GANs Diffusion2GAN: Knowledge Distillation Of Diffusion Models Into Conditional GANs 26/08/2024 Image Generation
FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, Is Now Available! FABLES, A Dataset For Book Summarization Consisting Only Of Long Sentences Of 100k Tokens Or More, I ... 23/08/2024 Large Language Models
A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now Available! A Platform For Assessing LLMs' Collaborative Behavior And Ability To Manage Shared Resources Is Now ... 22/08/2024 Simulation Platform
GPT-4, Claude 3 Opus, And Gemini 1.0 Ultra Challenge New Frontiers In Control Engineering GPT-4, Claude 3 Opus, And Gemini 1.0 Ultra Challenge New Frontiers In Control Engineering 22/08/2024 Optimization And Control
[OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video [OW-VISCap] Look Out For Unseen Objects - A New Approach To Understanding Open World Video 21/08/2024 Computer Vision
From DNA Analysis To Gene Expression Prediction And Large-scale Language Modeling For Bioinformatics From DNA Analysis To Gene Expression Prediction And Large-scale Language Modeling For Bioinformatics 21/08/2024 Large Language Models
Wavelet Diffusion: The Fastest Diffusion Model Wavelet Diffusion: The Fastest Diffusion Model 16/04/2024 Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ... 08/04/2024 Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available [RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ... 18/04/2024 Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI! 22/04/2024 ChatGPT
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ... 13/08/2025
Mask R-CNN: Efficient Detection Of Objects In Images Mask R-CNN: Efficient Detection Of Objects In Images 04/01/2024 Computer Vision
Graphs Are So Awesome! Review Of Integration With Deep Learning Graphs Are So Awesome! Review Of Integration With Deep Learning 26/07/2021 GNN
The First Framework To Utilize LLM To Detect Fake News Is Now Available! The First Framework To Utilize LLM To Detect Fake News Is Now Available! 26/05/2024 Fakenews
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge 24/07/2025
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time! LLM Agents Successfully Lead Customers To Purchase 35% Of The Time! 05/02/2025 ChatGPT
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage 13/01/2023 Deep Learning
ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image 22/09/2023 Alignment
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
Integration Of Large-scale Language Models In HCI Research And Ethical Issues Integration Of Large-scale Language Models In HCI Research And Ethical Issues 06/09/2024 Large Language Models