Featuring Offline Reinforcement Learning! Part 1

Reinforcement Learning 15/09/2020

3 main points
✔️ Offline RL, which learns strategies using only previously collected data
✔️ Offlne RL is expected to have a variety of applications in healthcare, robotics and other fields.
✔️ The main problem with Offlne RL is distribution shift.

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
written by Sergey Levine, Aviral Kumar, George Tucker, Justin Fu
(Submitted on 4 May 2020)
Comments:
Subjects: Machine Learning (cs.LG), Artificial Intelligence (cs.AI), Machine Learning (stat.ML)

Introduction

In recent years, there has been a shift away from online learning methods where data is collected at the same time as learning to reinforcement learning using only previously collected data. There has been a lot of research on offline reinforcement learning (offline RL). Offline reinforcement learning has the advantage of being able to use large data sets that have been collected in the past to learn more efficiently, while simultaneously performing actions on the environment and collecting data at the same time, which can be very time-consuming. Offline RL is gaining attention because of its potential to be effective in a variety of fields, including healthcare, education, and robotics.

However, due to various problems, offline RL has not yet been able to achieve this goal, and various studies have been carried out. In this special edition of Offline RL, we will discuss what the problems are, what kind of research has been done in the past, and the future prospects for Offline RL. In this first article, we will explain in detail what Offline RL is, how it is applied, and why it is so difficult.

To read more,

Please register with AI-SCHOLAR.

Categories related to this article

Reinforcement Learning

山田

Featuring Offline Reinforcement Learning! Part 1

Introduction

Interesting Discovery: Blind AI Learns To Map Its Environment

Interesting Discovery: Blind AI Learns To Map Its Environment

Machine Suggestion Of Optimal Strategies: A System That Recommends Strategies That Meet Advertisers' Objectives Is Now Available

Machine Suggestion Of Optimal Strategies: A System That Recommends Strategies That Meet Advertisers' ...

Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning

Autonomous Drone-controlled Reforestation Approach Using MA Reinforcement Learning

DeepFoids: Simulation Of Fish School Behavior Using Deep Reinforcement Learning

DeepFoids: Simulation Of Fish School Behavior Using Deep Reinforcement Learning

Multi-agent Reinforcement Learning Algorithm That Can Handle Increasing Or Decreasing Number Of Agents

Multi-agent Reinforcement Learning Algorithm That Can Handle Increasing Or Decreasing Number Of Agen ...

When Should Agents Explore?

When Should Agents Explore?