Catch up on the latest AI articles

What is AI-SCHOLAR?

Wavelet Diffusion: The Fastest Diffusion Model

Wavelet Diffusion: The Fastest Diffusion Model

Image Generation: 16/04/2024

New Method "USO" By Separate Learning And Reward Learning: The Frontier Of Image Generation Integrating Style And Subject

New Method "USO" By Separate Learning And Reward Learning: The Frontier Of Image Generation Integrat ...

RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement Learning With GRPO-RoC

RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement ...

Pref-GRPO: A New Method For Stable Reinforcement Learning Of Text Image Generation Using Pairwise Comparison

Pref-GRPO: A New Method For Stable Reinforcement Learning Of Text Image Generation Using Pairwise Co ...

TRACEALIGN: Tracing Causes Of Alignment Drift In Large Language Models And Defensive Measures

TRACEALIGN: Tracing Causes Of Alignment Drift In Large Language Models And Defensive Measures

AlignGuard-LoRA: A New Regularization Method That Combines Efficient Fine-Tuning And Safety Preservation

AlignGuard-LoRA: A New Regularization Method That Combines Efficient Fine-Tuning And Safety Preserva ...

ChartCap: Suppressing Chart Captioning Hallucinations With Large Data Sets And New Evaluation Indexes

ChartCap: Suppressing Chart Captioning Hallucinations With Large Data Sets And New Evaluation Indexe ...

LAMIC: A Learning-free, Layout-controllable, Multi-reference Image Generation Method

LAMIC: A Learning-free, Layout-controllable, Multi-reference Image Generation Method

LiveMCPBench: A New Benchmark For Evaluating LLM Agents In Large Tool Environments

LiveMCPBench: A New Benchmark For Evaluating LLM Agents In Large Tool Environments

Goedel-Prover-V2: New Developments In Efficient Automated Theorem Proving By Self-Correction And Stepwise Data Synthesis

Goedel-Prover-V2: New Developments In Efficient Automated Theorem Proving By Self-Correction And Ste ...

New Developments In Multi-person Conversation Video Generation! MIT Dataset And Baseline Model "CovOG

New Developments In Multi-person Conversation Video Generation! MIT Dataset And Baseline Model "CovO ...

ToolTrain: A New Method For Repository Deep Search And Issue Localization With LLM

ToolTrain: A New Method For Repository Deep Search And Issue Localization With LLM

Mechanism And Effect Of "Representation Shift" Token Compression For FlashAttention

Mechanism And Effect Of "Representation Shift" Token Compression For FlashAttention

CRINN: Automatic Optimization Of Approximate Nearest Neighbor Search Algorithms Using Reinforcement Learning

CRINN: Automatic Optimization Of Approximate Nearest Neighbor Search Algorithms Using Reinforcement ...

CompassVerifier: A New Benchmark And Robust Model To Revolutionize LLM Solution Verification

CompassVerifier: A New Benchmark And Robust Model To Revolutionize LLM Solution Verification

LongVie: A New Era Of 1-minute Ultra-High Quality Video Generation Realized By Multimodal Control

LongVie: A New Era Of 1-minute Ultra-High Quality Video Generation Realized By Multimodal Control

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, And Editing With High Efficiency

Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, An ...

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance

Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...

MATE: Multi-agent Accessibility-specific Modality Transformation Framework

MATE: Multi-agent Accessibility-specific Modality Transformation Framework

Wavelet Diffusion: The Fastest Diffusion Model

Wavelet Diffusion: The Fastest Diffusion Model

16/04/2024 Image Generation

Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics

Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...

08/04/2024 Large Language Models

[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available

[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...

18/04/2024 Machine Learning

A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!

A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!

22/04/2024 ChatGPT

MMR1: A Multimodal Inference Model That Stabilizes Reinforcement Learning With Sampling Based On Reward Variance

MMR1: A Multimodal Inference Model That Stabilizes Reinforcement Learning With Sampling Based On Rew ...

VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Variance

VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Var ...

Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks

Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks

11/10/2024 Large Language Models

Diffusion Policy : Diffusion Models For Robots! When Robots Can Make Pizza!

Diffusion Policy : Diffusion Models For Robots! When Robots Can Make Pizza!

06/11/2023 Diffusion Model

Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore

Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore

12/07/2023 Object Detection

First Systematic Review Of The "Dataset For Evaluating The Safety Of LLMs"

First Systematic Review Of The "Dataset For Evaluating The Safety Of LLMs"

22/11/2024 Large Language Models

AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available!

AgentBench, A Comprehensive Benchmark For Evaluating AI Agent Performance, Is Now Available!

21/09/2023 Agent Simulation

CogVideo, An Open Source Model Capable Of Generating Video From Text, Is Now Available!

CogVideo, An Open Source Model Capable Of Generating Video From Text, Is Now Available!

11/10/2022 Video Generation