Articles
VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Variance
VCRL: A New Approach To LLM Reinforcement Learning That Controls Learning Difficulty With Reward Var ...
The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, Laughter, And Personality
The Challenge Of Social-MAE, A Social AI That Uses Self-supervised Learning To Decipher Emotions, La ...
OnGoal: New Chat Interface To Visualize The Goals Of LLM Dialogue
OnGoal: New Chat Interface To Visualize The Goals Of LLM Dialogue
TriMM: Collaborative Multimodal Coding For High-quality 3D Generation
TriMM: Collaborative Multimodal Coding For High-quality 3D Generation
Dress&Dance: Video Diffusion Model For Highly Accurate Virtual Fitting And Motion Generation
Dress&Dance: Video Diffusion Model For Highly Accurate Virtual Fitting And Motion Generation
ROSE: A New Method And Benchmark For Video Object Removal With Side Effects
ROSE: A New Method And Benchmark For Video Object Removal With Side Effects
LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning
LLM From Memory To Retrieval: Theoretical Advantages And Demonstrations Of In-Tool Learning
FakeParts: New Benchmark Reveals Partial Deep Fake Threats And Detection Limits
FakeParts: New Benchmark Reveals Partial Deep Fake Threats And Detection Limits
Next Generation VLA Model By CogVLA! Instruction-driven Routing And Efficient Robot Operation Based On Cognitive Science
Next Generation VLA Model By CogVLA! Instruction-driven Routing And Efficient Robot Operation Based ...
Exploring LLM's Persuasion Resistance And Flexibility! New Evaluation And Training Methods With DuET-PD And Holistic DPO
Exploring LLM's Persuasion Resistance And Flexibility! New Evaluation And Training Methods With DuET ...
Seedream 3.0 Fill: Next-generation Mask Editing With OneReward
Seedream 3.0 Fill: Next-generation Mask Editing With OneReward
MVTracker: A Multi-view 3D Point Tracking Method That Achieves High Accuracy With A Small Number Of Cameras
MVTracker: A Multi-view 3D Point Tracking Method That Achieves High Accuracy With A Small Number Of ...
LLM Safety Amplification Achieved By Rank 1 Update! ROSI Mechanism And Experimental Results
LLM Safety Amplification Achieved By Rank 1 Update! ROSI Mechanism And Experimental Results
LLM Learning That Combines Diversity And Task Specialization: TCIA Mechanism And Experimental Results
LLM Learning That Combines Diversity And Task Specialization: TCIA Mechanism And Experimental Result ...
Innovation In Feature Video Generation With Mixture Of Contexts! Efficient Context Preservation And High Precision Generation
Innovation In Feature Video Generation With Mixture Of Contexts! Efficient Context Preservation And ...
AWORLD: Efficient Learning Platform For Agent AI With A Distributed Framework
AWORLD: Efficient Learning Platform For Agent AI With A Distributed Framework
MCP-Bench Opens Up A New Wave Of LLM Agent Evaluation! Challenges For Complex Tasks And Real-World Scenarios
MCP-Bench Opens Up A New Wave Of LLM Agent Evaluation! Challenges For Complex Tasks And Real-World S ...
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Easy! High Accuracy! Attractiveness Of The Anomaly Detection Model PatchCore
Object Detection
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ...
RLHF
Machine Learning Model Tackles Soccer Match Prediction In Sports Betting
Machine Learning Model Tackles Soccer Match Prediction In Sports Betting
Sports Analytics
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions
Qwen2-VL] Latest VLM That Can Process Images And Videos In Different Resolutions
Large Language Models
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
[Swin Transformer] Transformer-based Image Recognition Models To Keep Now!
Image Recognition
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Deep Learning
The Latest Comprehensive Review Of Activation Functions!
The Latest Comprehensive Review Of Activation Functions!
Survey
Simulate The Behavior Of 25 AI Agents In A Virtual Space City!
Simulate The Behavior Of 25 AI Agents In A Virtual Space City!
Large Language Models
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Qwen2.5-Coder] LLM Specialized For Code Generation, Completion, And Mathematical Reasoning Tasks
Large Language Models
U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation.
U-Net And Transformer Combined! Introducing Swin Unet, A New Network For Medical Image Segmentation.
Medical