Learning human objectives by evaluating hypothetical behaviours
We present a new method for training reinforcement learning agents from human feedback in the presence of unknown unsafe states.Read More
Related Google News:
- Databricks on Google Cloud: an open integrated platform for data, analytics and machine learning February 17, 2021
- Uncovering Unknown Unknowns in Machine Learning February 11, 2021
- Can machine learning make you a better athlete? February 4, 2021
- Machine Learning for Computer Architecture February 4, 2021
- Evaluating Design Trade-offs in Visual Model-Based Reinforcement Learning February 3, 2021
- Learning to Reason Over Tables from Less Data January 29, 2021
- Using machine learning to improve road maintenance January 13, 2021
- How to automatically scale your machine learning predictions December 17, 2020
December 13, 2019
AI / Google
Copyright 2020