3 skills to master before reinforcement learning (RL)

栏目: IT技术 · 发布时间: 5年前

内容简介:Modern reinforcement learning is almost entirely focused onAlmost all of the courses and tutorials will assume you

1. Supervised learning

Modern reinforcement learning is almost entirely focused on deep reinforcement learning . The word in the “ deep ” in the phrase deep reinforcement learning implies the use of a neural network in a core aspect of the algorithm. The neural network does some high-dimensional approximation in the learning process. That being said, the model does not need to have many layers and features, which is a common misconception that deep implies many layers.

Almost all of the courses and tutorials will assume you can fine-tune simple neural networks to approximate state values or create a final policy . These models are historically highly sensitive to all of the following training parameters: learning rate, batch size, model parameters, data normalization, and more. Doubled with tasks that are difficult to solve, debugging RL can be very difficult, and just seem like a binary it works or it doesn’t . Eliminating tails of confusing by knowing that all the sub approximations made are up to par. The best way to do this would be to learn supervised learning, then let an AutoML tool finish the job for you.


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

思考,快与慢

思考,快与慢

[美] 丹尼尔·卡尼曼 / 胡晓姣、李爱民、何梦莹 / 中信出版社 / 2012-7 / 69.00元

《纽约时报》2011年度十大好书 新书上市,连续20多周蝉联亚马逊、《纽约时报》畅销书排行榜前20名,上市至今超过7个月,横扫全球各大畅销书排行榜,稳居亚马逊总榜前50名 《经济学人》、《华尔街日报》、《卫报》、《纽约时报》、《金融时报》、《商业周刊》、《华盛顿邮报》、等国外权威媒体,《三联生活周刊》、《商学院》、《东方早报》等国内知名媒体争相报道,国内外读者好评如潮 人类究竟有......一起来看看 《思考,快与慢》 这本书的介绍吧!

JS 压缩/解压工具
JS 压缩/解压工具

在线压缩/解压 JS 代码

URL 编码/解码
URL 编码/解码

URL 编码/解码

XML、JSON 在线转换
XML、JSON 在线转换

在线XML、JSON转换工具