3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep…

栏目: IT技术 · 发布时间: 5年前

内容简介：3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep LearningRandom Forest is a better choice than neural networks because of a few main reasons. Here’s what you need to know comparing machine learning to deep learnin

3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep Learning

Random Forest is a better choice than neural networks because of a few main reasons. Here’s what you need to know comparing machine learning to deep learning.

James Montantes

Feb 4 ·5min read

Neural networks have been shown to outperform a number of machine learning algorithms in many industry domains. They keep learning until it comes out with the best set of features to obtain a satisfying predictive performance. However, a neural network will scale your variables into a series of numbers that once the neural network finishes the learning stage, the features become indistinguishable to us.

3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep… — Image Source

If all we cared about was the prediction, a neural net would be the de-facto algorithm used all the time. But in an industry setting, we need a model that can give meaning to a feature/variable to stakeholders. And these stakeholders will likely be anyone other than someone with a knowledge of deep learning or machine learning.

What’s the Main Difference Between Random Forest and Neural Networks?

Both the Random Forest and Neural Networks are different techniques that learn differently but can be used in similar domains. Random Forest is a technique of Machine Learning while Neural Networks are exclusive to Deep Learning.

What are Neural Networks?

A Neural Network is a computational model loosely based on the functioning cerebral cortex of a human to replicate the same style of thinking and perception. Neural Networks are organized in layers made up of interconnected nodes which contain an activation function that computes the output of the network.

Neural nets are another means of machine learning in which a computer learns to perform a task by analyzing training examples. As the neural net is loosely based on the human brain, it will consist of thousands or millions of nodes that are interconnected. A node can be connected to several nodes in the layer beneath it, from which it receives data, and several nodes above it which receive data. Each incoming data point receives a weight and is multiplied and added. A bias is added if the weighted sum equates to zero and then passed to the activation function.

The Architecture of Neural Networks

A Neural Network has 3 basic architectures:

Single Layer Feedforward Networks

It is the simplest network that is an extended version of the perceptron. It has additional hidden nodes between the input layer and output layer.

2. Multi Layer Feedforward Networks

This type of network has one or more hidden layers except for the input and output. Its role is to intervene in data transfer between the input and output layer.

3. Recurrent Networks

Recurrent neural networks are similar to the above but are widely adopted to predict sequential data such as text and time series. The most famous Recurrent Neural Network is the ‘ Long — Short Term Memory’ Model (LSTM) .

What is Random Forest?

Random Forestis an ensemble of Decision Trees whereby the final/leaf node will be either the majority class for classification problems or the average for regression problems.

A random forest will grow many Classification trees and for each output from that tree, we say the tree ‘ votes’ for that class. A tree is grown using the following steps:

A random sample of rows from the training data will be taken for each tree.
From the sample taken in Step (1), a subset of features will be taken to be used for splitting on each tree.
Each tree is grown to the largest extent specified by the parameters until it reaches a vote for the class.

Why Should You Use Random Forest?

The fundamental reason to use a random forest instead of a decision tree is to combine the predictions of many decision trees into a single model. The logic is that a single even made up of many mediocre models will still be better than one good model. There is truth to this given the mainstream performance of random forests. Random forests are less prone to overfitting because of this.

Over-fitting can occur with a flexible model like decision trees where the model with memorize the training data and learn any noise in the data as well. This will make it unable to predict the test data.

A random forest can reduce the high variance from a flexible model like a decision tree by combining many trees into one ensemble model.

When Should You Use Random Forest Versus a Neural Network?

Random Forest is less computationally expensive and does not require a GPU to finish training. A random forest can give you a different interpretation of a decision tree but with better performance. Neural Networks will require much more data than an everyday person might have on hand to actually be effective. The neural network will simply decimate the interpretability of your features to the point where it becomes meaningless for the sake of performance. While that may sound reasonable to some, it is dependent on each project.

If the goal is to create a prediction model without care for the variables at play, by all means use a neural network, but you’ll need the resources to do so. If an understanding of the variables are required, then whether we like it or not, typically what happens in this situation is that performance will have to take a slight hit to make sure that we can still understand how each variable is contributing to the prediction model.

Anything I’m missing here?

Please let me know and I’ll be glad to add it in.

以上所述就是小编给大家介绍的《3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep…》，希望对大家有所帮助，如果大家有任何疑问请给我留言，小编会及时回复大家的。在此也非常感谢大家对码农网的支持！

查看所有标签

猜你喜欢:

3 Reasons to Use Random Forest Over a Neural Network–Comparing Machine Learning versus Deep…

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

互联网金融手册

谢平、邹传伟、刘海二 / 中国人民大学出版社 / 2014-4-1 / 68.00元

本书作者在2012年4月7日“金融四十人年会”上首次公开提出了“互联网金融”概念。在短短两年中，互联网金融已经成为中国金融界和IT界最热门的词汇之一，相关创业活动也非常活跃。本书是作者两年来深入研究、思考的结晶，畅想了金融与IT结合的未来图景，将理论与实践高度融合，与读者分享了许多深具洞察力的观点。本书力图规范互联网金融的定义，完善互联网金融的理论体系，分析互联网金融目前的六种主要类型——金融互联......一起来看看《互联网金融手册》这本书的介绍吧!

码农工具