5 Frameworks for Reinforcement Learning on Python

栏目: IT技术 · 发布时间: 4年前

内容简介：Programming your own Reinforcement Learning implementation from scratch can be a lot of work, but you don’t need to do that. There are lots of great, easy and free frameworks to get you started in few minutes.There are lots of standard libraries for superv

Programming your own Reinforcement Learning implementation from scratch can be a lot of work, but you don’t need to do that. There are lots of great, easy and free frameworks to get you started in few minutes.

5 Frameworks for Reinforcement Learning on Python

Mauricio Fadel Argerich

Jun 4 ·6min read

There are lots of standard libraries for supervised and unsupervised machine learning like Scikit-learn, XGBoost or even Tensorflow, that can get you started in no time and you can find log nads of support online. Sadly, for Reinforcement Learning (RL) this is not the case.

It is not that there are no frameworks, as a matter of fact, there are many frameworks for RL out there. The problem is that there is no standard yet, and so finding support online for starting, fixing a problem or customizing a solution is not easily found. This is probably caused by the fact that, while RL is a very popular research topic, it is still in its early days of being implemented and used in the industry.

But this doesn’t mean there are no great frameworks out there that can help you start and use RL for solving any problem you like. I have made a list here of some frameworks I have come to know and use along time, with their benefits and cons. I hope this gives you a quick overview about some of the RL frameworks currently available, so you can choose the one that better fits your needs.

Keras-RL

I have to admit from the whole list, this is my favorite. I believe it is by far the simplest to understand code implementation of several RL algorithms including Deep Q Learning (DQN), Double DQN, Deep Deterministic Policy Gradient (DDPG), Continuous DQN (CDQN or NAF), Cross-Entropy Method (CEM), Dueling DQN) and SARSA. When I say “simplest to understand code” I refer not to use, but to customize it and utilize it as a building block for your project*. The Keras-RL github also contains some examples that you can use to get started in no time. It uses Keras of course, and you can use it along with Tensorflow or PyTorch.

Unfortunately, Keras-RL has not been well-maintained for a while already and its official documentation is not the best. This has given light to a fork of this project called Keras-RL2 .

(*) What did I use this framework for? Well, I’m glad you asked — or was it me? I have used this framework to create a customized Tutored DQN agent, you can read more about it here .

Keras-RL2

Keras-RL2 is a fork from Keras-RL and as such it shares support for the same agents as Keras-RL2 and is easily customizable. The big change here is that Keras-RL2 is better maintained and uses Tensorflow 2.1.0. Unfortunately, there is no documentation for this library, even though the documentation for Keras-RL can be easily used for this fork too.

OpenAI Baselines

OpenAI Baselines is a set of high-quality implementations of RL algorithms by OpenAI, one of the leading companies in research and development of AI and in particular RL. It was conceived so researchers could compare their RL algorithms easily, using as a baseline the state-of-the-art implementations from OpenAI — thus the name. The framework contains implementations of many popular agents such as A2C , DDPG , DQN , PPO2 and TRPO .

On the downside, OpenAI Baselines is not well documented, even though there are lots of useful comments on the code. In addition, since it was developed to be used as a baseline and not as a building block, the code is not so friendly if you want to customize or modify some of the agents for your projects. In fact, the next framework is a fork from this and solves most of these issues.

Stable Baselines

Stable Baselines is a fork of OpenAI Baselines , with a major structural refactoring and code cleanups. The changes listed in their official documentation site are the following:

Unified structure for all algorithms
PEP8 compliant (unified code style)
Documented functions and classes
More tests & more code coverage
Additional algorithms: SAC and TD3 (+ HER support for DQN, DDPG, SAC and TD3)

I have personally used Stable Baselines in the past and I can confirm it is really well documented and easy to use. It is even possible to train an agent for OpenAI Gym environments with a one liner:

from stable_baselines import PPO2model = PPO2('MlpPolicy', 'CartPole-v1').learn(10000)

Acme

Acme comes from DeepMind, probably the most well-known company working on RL in research. As such, it has been developed for building readable, efficient, research-oriented RL algorithms and contains implementations of several state-of-the-art agents such as D4PG, DQN, R2D2, R2D3 and more. Acme uses Tensorflow as backend and also some agent implementations use a combination of JAX and Tensorflow.

Acme was developed keeping in mind to make its code as re-usable as possible, so its design is modular and easy to customize. Its documentation is not abundant but enough to give you a nice introduction to the library and there are also some examples to get you started in Jupyter notebooks.

Takeaways

All of the frameworks listed here are solid options for any RL project; deciding which one to use depends on your preferences and what you want to do with it exactly. To visualize better each framework and its pros and cons, I’ve made the following visual summary:

Keras-RL — Github

Choices of RL algorithms: ☆☆☆

Documentation: ☆☆☆

Customization: ☆☆☆☆☆

Maintenance: ☆

Backend: Keras and Tensorflow 1.14.

Keras-RL2 — Github

Choices of RL algorithms: ☆☆☆

Documentation: Not available

Customization: ☆☆☆☆☆

Maintenance: ☆☆☆

Backend: Keras and Tensorflow 2.1.0.

OpenAI Baselines — Github

Choices of RL algorithms: ☆☆☆

Documentation: ☆☆

Customization: ☆☆

Maintenance: ☆☆☆

Backend: Tensorflow 1.14.

Stable Baselines — Github

Choices of RL algorithms: ☆☆☆☆

Documentation: ☆☆☆☆☆

Customization: ☆☆☆

Maintained: ☆☆☆☆☆

Backend: Tensorflow 1.14.

Acme — Github

Choices of RL algorithms: ☆☆☆☆

Documentation: ☆☆☆

Customization: ☆☆☆☆

Maintenance: ☆☆☆☆☆

Backend: Tensorflow v2+ and JAX

If you have already decided on what framework to use, all you need now is an environment. You can start using OpenAI Gym, which is already used in most examples of these frameworks, but if you want to try RL on other tasks such as Trading stocks, networking or producing recommendations, you can find a comprehensible list of ready-to-use environments here:

Reinforcement Learning Environments

I’ve been lately working with Reinforcement Learning (RL) and I have found there are lots of great articles, tutorials…

medium.com

If you know about any other good RL framework, please let me know in responses below! Thanks for reading! :)

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持码农网

查看所有标签

猜你喜欢:

5 Frameworks for Reinforcement Learning on Python

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

R语言编程艺术

（美）Norman Matloff / 陈堰平、邱怡轩、潘岚锋等 / 机械工业出版社 / 2013-5 / 69.00

【编辑推荐】这本书涵盖了R语言编程的诸多方面，尤其在面向对象编程、程序调试、提升程序运行速度以及并行计算等方面，填补了同类图书的空白。关于程序调试的章节更是作者多年经验的总结。不管是初学者还是有一定编程经验的读者，阅读这本书都会有所收获。 ——统计之都【内容简介】 R语言是世界上最流行的用于数据处理和统计分析的脚本语言。考古学家用它来跟踪古代文明的传播，医药公司用它来探......一起来看看《R语言编程艺术》这本书的介绍吧!

码农工具

JS 压缩/解压工具

在线压缩/解压 JS 代码

随机密码生成器

多种字符组合密码