Meta Achieves Unexpected Improvements In Bayesian Optimization Meta Achieves Unexpected Improvements In Bayesian Optimization 19/02/2024 Bayesian Optimization
[DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using Reinforcement Learning [DPO] A Method For Directly Matching Large-scale Language Models To User Preferences Without Using R ... 02/02/2024 RLHF
Open X-Embodiment: Towards A Generic Robot Learning Open X-Embodiment: Towards A Generic Robot Learning 10/01/2024 Robot
Mask R-CNN: Efficient Detection Of Objects In Images Mask R-CNN: Efficient Detection Of Objects In Images 04/01/2024 Computer Vision
Machine Suggestion Of Optimal Strategies: A System That Recommends Strategies That Meet Advertisers' Objectives Is Now Available Machine Suggestion Of Optimal Strategies: A System That Recommends Strategies That Meet Advertisers' ... 26/12/2023 Reinforcement Learning
How To Make A Machine Learn Intuitive Human Understanding? How To Make A Machine Learn Intuitive Human Understanding? 25/12/2023 Machine Learning
EUREKA: Automated Compensation Design With LLM EUREKA: Automated Compensation Design With LLM 04/12/2023 RLHF
Diffusion Policy : Diffusion Models For Robots! When Robots Can Make Pizza! Diffusion Policy : Diffusion Models For Robots! When Robots Can Make Pizza! 06/11/2023 Diffusion Model
Implicit Behaviral Cloning : A New Formulation Of Imitation Learning! Robot Complex Behavior! Implicit Behaviral Cloning : A New Formulation Of Imitation Learning! Robot Complex Behavior! 30/10/2023 Robot
Can Wikipedia Assist Offline Reinforcement Learning? Introducing Pre-training In Language Tasks To Offline Reinforcement Learning! Can Wikipedia Assist Offline Reinforcement Learning? Introducing Pre-training In Language Tasks To O ... 11/10/2023 Offline Reinforcement Learning
I Would Like To Run Decision Transformer In A Stochastic Environment As Well! I Would Like To Run Decision Transformer In A Stochastic Environment As Well! 05/10/2023 RvS
Jump-Start RL: Streamlines Search By "guiding" With Pre-learned Strategies! Jump-Start RL: Streamlines Search By "guiding" With Pre-learned Strategies! 05/10/2023 Offline Pre-Training And Online Finetuning
Cal-QL: Offline Reinforcement Learning Specialized For Prior Learning, For Efficient Online Fine Tuning Cal-QL: Offline Reinforcement Learning Specialized For Prior Learning, For Efficient Online Fine Tun ... 28/09/2023 Offline Reinforcement Learning
GLAM: LLM As A Reinforcement Learning Agent GLAM: LLM As A Reinforcement Learning Agent 19/09/2023 Large Language Models
Success In Generating Various Robot Motions With LLM Success In Generating Various Robot Motions With LLM 11/09/2023 Large Language Models
RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation RLHF: How To Train Reinforcement Learning Agents Using Human Evaluation 01/09/2023 Alignment
Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning 23/05/2023 Reinforcement Learning
DeepFoids: Simulation Of Fish School Behavior Using Deep Reinforcement Learning DeepFoids: Simulation Of Fish School Behavior Using Deep Reinforcement Learning 01/05/2023 Reinforcement Learning