Catch up on the latest AI articles

What is AI-SCHOLAR?

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, And Editing With High Efficiency

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, An ...

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...

Innovations In Outlier-Safe Pre-Training For Large Language Models To Prevent Outliers And Protect Quantization Accuracy

Innovations In Outlier-Safe Pre-Training For Large Language Models To Prevent Outliers And Protect Q ...

Toward AI That Doesn't Forget Images, CoMemo Pioneers Next-generation Vision And Language Models

Toward AI That Doesn't Forget Images, CoMemo Pioneers Next-generation Vision And Language Models

PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding

PictSure: A New Method To Challenge Few-Shot Classification With The Power Of Visual Embedding

A New Wave Of Multispeaker Speech Recognition! The Challenge Of High Accuracy Systems By DiCoW And DiariZen

A New Wave Of Multispeaker Speech Recognition! The Challenge Of High Accuracy Systems By DiCoW And D ...

Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency

Ultra-Sparse Memory Network: A New Method To Change Transformer Memory Efficiency

Hymba, A New Architecture That Pushes The Limits Of Small LLMs

Hymba, A New Architecture That Pushes The Limits Of Small LLMs

Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought

Insight-V: A New Strategy For Multimodal Reasoning Connecting Vision And Thought

Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation

Stable Flow: Visualization Of The "really Important Layers" Behind Image Generation

Open Vocabulary Object Detection Enabled By OWL-ViT

Open Vocabulary Object Detection Enabled By OWL-ViT

28/02/2025 Neural Network

Classification Tasks - Extremely Difficult! Use The WHFEMD Algorithm To Accurately And Efficiently Capture And Classify Features O ...

Classification Tasks - Extremely Difficult! Use The WHFEMD Algorithm To Accurately And Efficiently C ...

14/02/2025 Speech Recognition For The Dysarthric

Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoning Ability!

Giving LLMs A Whiteboard To Write Down Their Reasoning Process Greatly Improves Their Visual Reasoni ...

26/12/2024 Prompting Method

Cross-Layer Attention Significantly Reduces Transformer Memory

Cross-Layer Attention Significantly Reduces Transformer Memory

10/12/2024 Transformer

YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature!

YesBut: The Emergence Of A Dataset That Makes The VLM Understand Irony And Caricature!

22/11/2024 Dataset

[SCoRe] Reinforcement Learning To Enhance LLM's Ability To Self-correct! Identify And Correct Errors In A Multi-step Process

[SCoRe] Reinforcement Learning To Enhance LLM's Ability To Self-correct! Identify And Correct Errors ...

31/10/2024 Large Language Models

AI To Transform Mathematics Education; Possibilities And Challenges Of Solving Mathematical Problems Using Large-Scale Language Mo ...

AI To Transform Mathematics Education; Possibilities And Challenges Of Solving Mathematical Problems ...

16/10/2024 Large Language Models

A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing!

A Better Attention Mechanism Will Improve The Performance Of LLM's Long-text Processing!

30/09/2024 Large Language Models