Articles
Dress&Dance: Video Diffusion Model For Highly Accurate Virtual Fitting And Motion Generation
Dress&Dance: Video Diffusion Model For Highly Accurate Virtual Fitting And Motion Generation
ROSE: A New Method And Benchmark For Video Object Removal With Side Effects
ROSE: A New Method And Benchmark For Video Object Removal With Side Effects
LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning
LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning
FakeParts: New Benchmark Reveals Partial Deep Fake Threats And Detection Limits
FakeParts: New Benchmark Reveals Partial Deep Fake Threats And Detection Limits
Next Generation VLA Model By CogVLA! Instruction-driven Routing And Efficient Robot Operation Based On Cognitive Science
Next Generation VLA Model By CogVLA! Instruction-driven Routing And Efficient Robot Operation Based ...
Exploring LLM's Persuasion Resistance And Flexibility! New Evaluation And Training Methods With DuET-PD And Holistic DPO
Exploring LLM's Persuasion Resistance And Flexibility! New Evaluation And Training Methods With DuET ...
Seedream 3.0 Fill: Next-generation Mask Editing With OneReward
Seedream 3.0 Fill: Next-generation Mask Editing With OneReward
MVTracker: A Multi-view 3D Point Tracking Method That Achieves High Accuracy With A Small Number Of Cameras
MVTracker: A Multi-view 3D Point Tracking Method That Achieves High Accuracy With A Small Number Of ...
LLM Safety Amplification Achieved By Rank 1 Update! ROSI Mechanism And Experimental Results
LLM Safety Amplification Achieved By Rank 1 Update! ROSI Mechanism And Experimental Results
LLM Learning That Combines Diversity And Task Specialization: TCIA Mechanism And Experimental Results
LLM Learning That Combines Diversity And Task Specialization: TCIA Mechanism And Experimental Result ...
Innovation In Feature Video Generation With Mixture Of Contexts! Efficient Context Preservation And High Precision Generation
Innovation In Feature Video Generation With Mixture Of Contexts! Efficient Context Preservation And ...
AWORLD: Efficient Learning Platform For Agent AI With A Distributed Framework
AWORLD: Efficient Learning Platform For Agent AI With A Distributed Framework
MCP-Bench Opens Up A New Wave Of LLM Agent Evaluation! Challenges For Complex Tasks And Real-World Scenarios
MCP-Bench Opens Up A New Wave Of LLM Agent Evaluation! Challenges For Complex Tasks And Real-World S ...
New Method "USO" By Separate Learning And Reward Learning: The Frontier Of Image Generation Integrating Style And Subject
New Method "USO" By Separate Learning And Reward Learning: The Frontier Of Image Generation Integrat ...
RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement Learning With GRPO-RoC
RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement ...
Pref-GRPO: A New Method For Stable Reinforcement Learning Of Text Image Generation Using Pairwise Comparison
Pref-GRPO: A New Method For Stable Reinforcement Learning Of Text Image Generation Using Pairwise Co ...
TRACEALIGN: Tracing Causes Of Alignment Drift In Large Language Models And Defensive Measures
TRACEALIGN: Tracing Causes Of Alignment Drift In Large Language Models And Defensive Measures
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
Mask R-CNN: Efficient Detection Of Objects In Images
Mask R-CNN: Efficient Detection Of Objects In Images
Computer Vision
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
Fakenews
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Graphs Are So Awesome! Review Of Integration With Deep Learning
Graphs Are So Awesome! Review Of Integration With Deep Learning
GNN
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
ChatGPT
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Deep Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ...
RLHF
RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement Learning With GRPO-RoC
RStar2-Agent: State-of-the-Art Mathematical Reasoning Reached By Efficient Agent-Based Reinforcement ...
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Object Detection