Where to start learning RL

栏目: IT技术 · 发布时间: 5年前

内容简介:We learn new skills in computer science to make things. Taking ideas to code is where value is made (except for a few theoretical professors). The crux of this post is that you need toThere is a long list of resources to learn about RL theory at the end of

I would say I am competent in robot learning (robotics + reinforcement learning). I had the privilege of being pushed to this in my Ph.D., but you can too. The themes are repeatable and effective.

Learn by doing

We learn new skills in computer science to make things. Taking ideas to code is where value is made (except for a few theoretical professors). The crux of this post is that you need to find your problem space .

There is a long list of resources to learn about RL theory at the end of this, but with how broadly applicable AI methods are — you have to choose where. This comes down to an overlay of three motives:

  1. Problems you enjoy working on.
  2. Problems that have global impact.
  3. Problems that will get you a job and stability.

Decide on a problem space for RL where you like what you’re doing, it’ll do something to help the world, and hopefully, other people will catch on and give you a bigger platform to make change.

Where to start learning RL

An example simulated robot arm task — called Reacher3d. Uses Mujoco and Gym .

What have I built? I work with robots. I want robots to do many simple tasks, all over the place. They can move our furniture, drive our cars, deliver our boxes, and more . All this should come within a decade . A decade out, this looks like learning low-level locomotion controllers. The core repository for learning robot dynamics and control is found here . ( Most of the research is still on private before publication. )

Build fundamentals and depth with writing or reflection

I’ve written about 20 posts on Medium, and it’s an amazing compliment to any education program. It’s time to reflect on what you built and how it fits into a bigger picture. It’s a time to make sure others can comprehend your results. A common weakness of the best graduate students I meet — an inability to clearly break down their ideas. As a senior graduate student, I am focused on making my work last and be reused after I finish my degree.

Research papers, blog posts, etc are all writing forms that act as permanent recreations of your mind and self . There’s little that lets individuals continue to be of use and interacted with after their career, but high quality writing may be the most accessible tool we have for now.

Posts I have written on RL to date. It’s a wonderful subject and there’s always more to explore.

  1. 3 skills to master before RL .
  2. What is a Markov Decision Process anyways?
  3. The hidden linear algebra of reinforcement learning.
  4. Fundamentals iterative methods of reinforcement learning.
  5. Convergence of reinforcement learning algorithms

Learn PyTorch

PyTorch is becoming dominant in the are of machine learning research, and because reinforcement learning is young, it’s mostly research. You can find the statistics here . PyTorch is very fluid and pythonic, so don’t worry about getting too bogged down in learning it, it can happen along the way.


以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

HTML5经典实例

HTML5经典实例

Christopher Schmitt、Kyle Simpson / 李强 / 中国电力出版社 / 2013-7 / 48.00元

《HTML5经典实例》对于从中级到高级的Web和移动Web开发者来说是绝佳之选,它帮助你选择对你有用的HTML5功能,并且帮助你体验其他的功能。个技巧的信息十分丰富,都包含了示例代码,并详细讨论了解决方案为何有效以及如何工作。一起来看看 《HTML5经典实例》 这本书的介绍吧!

SHA 加密
SHA 加密

SHA 加密工具

Markdown 在线编辑器
Markdown 在线编辑器

Markdown 在线编辑器

HEX HSV 转换工具
HEX HSV 转换工具

HEX HSV 互换工具