3 skills to master before reinforcement learning (RL)

栏目: IT技术 · 发布时间: 5年前

内容简介:Modern reinforcement learning is almost entirely focused onAlmost all of the courses and tutorials will assume you

1. Supervised learning

Modern reinforcement learning is almost entirely focused on deep reinforcement learning . The word in the “ deep ” in the phrase deep reinforcement learning implies the use of a neural network in a core aspect of the algorithm. The neural network does some high-dimensional approximation in the learning process. That being said, the model does not need to have many layers and features, which is a common misconception that deep implies many layers.

Almost all of the courses and tutorials will assume you can fine-tune simple neural networks to approximate state values or create a final policy . These models are historically highly sensitive to all of the following training parameters: learning rate, batch size, model parameters, data normalization, and more. Doubled with tasks that are difficult to solve, debugging RL can be very difficult, and just seem like a binary it works or it doesn’t . Eliminating tails of confusing by knowing that all the sub approximations made are up to par. The best way to do this would be to learn supervised learning, then let an AutoML tool finish the job for you.


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

软件测试的艺术

软件测试的艺术

梅尔斯 / 机械工业出版社 / 2006年01月 / 22.0

《软件测试的艺术》(原书第2版)成功、有效地进行软件测试的实用策略和技术:    基本的测试原理和策略      验收测试    程序检查和走查         安装测试    代码检查            模块(单元)测试    错误列表            测试规划与控制    同行评分            独立测试机构    黑盒、白盒测试    ......一起来看看 《软件测试的艺术》 这本书的介绍吧!

图片转BASE64编码
图片转BASE64编码

在线图片转Base64编码工具

HTML 编码/解码
HTML 编码/解码

HTML 编码/解码

MD5 加密
MD5 加密

MD5 加密工具