Articles
Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, And Editing With High Efficiency
Skywork UniPic: Next-generation Multimodal Model That Integrates Image Understanding, Generation, An ...
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...
MATE: Multi-agent Accessibility-specific Modality Transformation Framework
MATE: Multi-agent Accessibility-specific Modality Transformation Framework
Biomed-Enriched: Large Biomedical Dataset With LLM Annotation For Clinical And Educational Value
Biomed-Enriched: Large Biomedical Dataset With LLM Annotation For Clinical And Educational Value
How Many Times Is Debugging LLM Effective? What Is The New Indicator "DDI" To Detect The Decay Of Effectiveness?
How Many Times Is Debugging LLM Effective? What Is The New Indicator "DDI" To Detect The Decay Of Ef ...
Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP
Combining Speed And Accuracy: Quantization-aware LLM Pre-training "QAP
HiWave: Innovation In Wavelet Diffusion Generation For 4K Images Without Additional Learning
HiWave: Innovation In Wavelet Diffusion Generation For 4K Images Without Additional Learning
Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long Prompts
Forget-Me-Not: A Proposal For A Simple Prompting Technique To Prevent Forgetting Information In Long ...
Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency By 10%
Potential Of The Conversation Optimization Tokenizer: A Method To Improve LLM Inference Efficiency B ...
RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robots
RoboTwin 2.0: Scalable Synthetic Data Generation And Benchmark Design For Dual-Arm Manipulation Robo ...
Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception
Enhanced LLM Code Generation With Property-based Testing! New Framework PGS To Break Self-Deception
Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Learning
Evolution Of Llama To Support Reinforcement Learning, OctoThinker Shows The Power Of Intermediate Le ...
What Is DualTHOR? Next Generation Simulator For Dual-Arm Robots' Adaptability To Reality
What Is DualTHOR? Next Generation Simulator For Dual-Arm Robots' Adaptability To Reality
Innovations In Outlier-Safe Pre-Training For Large Language Models To Prevent Outliers And Protect Quantization Accuracy
Innovations In Outlier-Safe Pre-Training For Large Language Models To Prevent Outliers And Protect Q ...
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models
FedNano: Lightweight And Efficient Distributed Learning Of Large-scale Multimodal Models
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation
ImmerseGen: Agent-guided, Lightweight X Highly Realistic Next-generation VR Scene Generation
Wavelet Diffusion: The Fastest Diffusion Model
Wavelet Diffusion: The Fastest Diffusion Model
Image Generation
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biometrics
Improved Accuracy And Transparency Of Face Recognition With ChatGPT, New Developments In Soft Biomet ...
Large Language Models
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Available
[RL-GPT] A Framework To Acquire Diamonds Several Times Faster Than Usual With Mincraft Is Now Availa ...
Machine Learning
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
A Framework Is Now Available That Allows LLMs To Assess Human Personality Using The MBTI!
ChatGPT
Mask R-CNN: Efficient Detection Of Objects In Images
Mask R-CNN: Efficient Detection Of Objects In Images
Computer Vision
Graphs Are So Awesome! Review Of Integration With Deep Learning
Graphs Are So Awesome! Review Of Integration With Deep Learning
GNN
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
The First Framework To Utilize LLM To Detect Fake News Is Now Available!
Fakenews
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
Democratizing GPT-4o Level Image Generation: The Janus-4o And ShareGPT-4o-Image Challenge
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
LLM Agents Successfully Lead Customers To Purchase 35% Of The Time!
ChatGPT
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High Performance
Seed Diffusion Preview: Next-generation Code Generation Model That Combines Fast Inference And High ...
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Superior To ViT! A New Underlying Model For Large-Scale CNNs! : InternImage
Deep Learning
ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image
ImageReward: A Reward Model That Learns Human Evaluation In Text-to-image
Alignment
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ...
RLHF
RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation
RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation
Alignment