Clearly Explained: Ensemble learning methods, the heart of Machine Learning

栏目: IT技术 · 发布时间: 4年前

内容简介：So, now that we have covered all the fundamentals of ensemble methods, you can proceed to try them hands-on to enhance your understanding even further.You can check out my following other posts on Machine learning which were well-received.

Machine Learning Fundamentals

Clearly Explained: Ensemble learning methods, the heart of Machine Learning

Bagging-Boosting demystified, explained unconventionally, it’ll be worth your time.

Juhi

May 28 ·8min read

Clearly Explained: Ensemble learning methods, the heart of Machine Learning — Photo by Owen Beard on Unsplash

I had published a post on “ Clearly Explained: What is Bias-Variance tradeoff, Overfitting & Underfitting ”. This comprehensive article serves as an important prequel to this post if you are a newbie or would just like to brush up the concepts of bias and variance before diving in with full force in the sea of Ensemble modeling. Having said that, let’s move on to know more about Ensemble modeling. I will quote some real-life examples to simplify the concepts of what, why, and how of the ensemble models with a focus on bagging and boosting techniques .

Scenario 1:You require a new pair of headphones. Now, is it likely that you’ll just walk into a store and buy the headphones that the salespeople show you? At the juncture of this day and age, I am sure that the answer is NO because we rely heavily on our “ research ” before buying anything nowadays. You would browse a few web technology portals and check the user reviews and would then compare different models that interest you while checking for their features and prices. You will also probably ask your friends and colleagues for their opinion. In short, you wouldn’t directly reach a conclusion, but will instead make an informed decision after thoroughly researching your way through.

Now, we can take a look at the technical definition of Ensemble learning methods.

What is an Ensemble method?

Ensemble models in machine learning combine the decisions from multiple models to improve the overall performance. They operate on a similar idea as employed while buying headphones.

The main causes of error in learning models are due to noise, bias, and variance .

Ensemble methods help to minimize these factors. These methods are designed to improve the stability and accuracy of Machine Learning algorithms.

Scenario 2:Let’s suppose that you have developed a health and fitness app. Before making it public, you wish to receive critical feedback to close down the potential loopholes, if any. You can resort to one of the following methods, read and decide which method is the best:

You can take the opinion of your spouse or your closest friends.
You can ask a bunch of your friends and office colleagues.
You can launch a beta version of the app and receive feedback from the web development community and non-biased users.

No brownie points for guessing the answer :D, Yes, of course, we will roll with the third option.

Now, pause and think about what you just did. You took multiple opinions from a large enough bunch of people and then made an informed decision based on them. This is what Ensemble methods also do.

Ensemble models in machine learning combine the decisions from multiple models to improve the overall performance.

Scenario 3:Take a look at the following picture; we can see a group of blindfolded children playing the game of “Touch and tell” while examining an elephant that none of them had ever seen before. Each of them will have a different version as to how does an elephant looks like because each of them is exposed to a different part of the elephant. Now, if we give them the task of submitting a report on elephant description, their individual reports will be able to describe only one part accurately as per their experience but collectively they can combine their observations to give a very accurate report on the description of an elephant.

Similarly, ensemble learning methods employ a group of models where the combined result out of them is almost always better in terms of prediction accuracy as compared to using a single model.

Ensembles are a divide and conquer approach used to improve performance.

Now, let’s dive into some of the important Ensemble techniques.

Simple Ensemble Techniques

Taking the mode of the results

MODE: The mode is a statistical term that refers to the most frequently occurring number found in a set of numbers.

In this technique, multiple models are used to make predictions for each data point. The predictions by each model are considered as a separate vote. The prediction which we get from the majority of the models is used as the final prediction.

For instance: We can understand this by referring back to Scenario 2 above. I have inserted a chart below to demonstrate the ratings that the beta version of our health and fitness app got from the user community. ( Consider each person as a different model )

Output= MODE=3, as majority people voted this

2. Taking the average of the results

In this technique, we take an average of predictions from all the models and use them to make the final prediction.

AVERAGE= sum(Rating*Number of people)/Total number of people= (1*5)+(2*13)+(3*45)+(4*7)+(5*2)/72 = 2.833 =Rounded to nearest integer would be 3

3. Taking the weighted average of the results

This is an extension of the averaging method. All models are assigned different weights defining the importance of each model for prediction. For instance, if about 25 of your responders are professional app developers, while others have no prior experience in this field, then the answers by these 25 people are given more importance as compared to the other people.

For example: For posterity, I am trimming down the scale of the example to 5 people

WEIGHTED AVERAGE= (0.3*3)+(0.3*2)+(0.3*2)+(0.15*4)+(0.15*3) =3.15 = rounded to nearest integer would give us 3

Advanced Ensemble techniques

We’ll learn about Bagging and Boosting techniques now. But, to use them you must select a base learner algorithm. For example, if we choose a classification tree, Bagging and Boosting would consist of a pool of trees as big as we want.

1. Bagging (Bootstrap AGGregatING)

Bootstrap Aggregating is an ensemble method. First, we create random samples of the training data set with replacement (subsets of training data set). Then, we build a model (classifier or Decision tree) for each sample. Finally, the results of these multiple models are combined using average or majority voting.

As each model is exposed to a different subset of data and we use their collective output at the end, so we are making sure that problem of overfitting is taken care of by not clinging too closely to our training data set. Thus, Bagging helps us to reduce the variance error.

Combinations of multiple models decrease variance, especially in the case of unstable models, and may produce a more reliable prediction than a single model.

Random forest technique actually uses this concept but it goes a step ahead to further reduce the variance by randomly choosing a subset of features as well for each bootstrapped sample to make the splits while training.

2. Boosting

Boosting is an iterative technique that adjusts the weight of an observation based on the last classification. If an observation was classified incorrectly, it tries to increase the weight of this observation and vice versa.

Boosting in general decreases the bias error and builds strong predictive models. Boosting has shown better predictive accuracy than bagging, but it also tends to over-fit the training data as well. Thus, parameter tuning becomes a crucial part of boosting algorithms to make them avoid overfitting.

Boosting is a sequential technique in which, the first algorithm is trained on the entire data set and the subsequent algorithms are built by fitting the residuals of the first algorithm, thus giving higher weight to those observations that were poorly predicted by the previous model.

It relies on creating a series of weak learners each of which might not be good for the entire data set but is good for some part of the data set. Thus, each model actually boosts the performance of the ensemble.

Summary of differences between Bagging and Boosting

Advantages/Benefits of ensemble methods

Ensemble methods are used in almost all the ML hackathons to enhance the prediction abilities of the models.

Let’s take a look at the advantages of using ensemble methods:

More accurate prediction results - We can compare the working of the ensemble methods to the Diversification of our financial portfolios. It is advised to keep a mixed portfolio across debt and equity to reduce the variability and hence, to minimize the risk. Similarly, the ensemble of models will give better performance on the test case scenarios (unseen data) as compared to the individual models in most of the cases.
Stable and more robust model - The aggregate result of multiple models is always less noisy than the individual models. This leads to model stability and robustness.
Ensemble models can be used to capture the linear as well as the non-linear relationships in the data . This can be accomplished by using 2 different models and forming an ensemble of the two.

Disadvantages of ensemble methods

Reduction in model interpret-ability- Using ensemble methods reduces the model interpret-ability due to increased complexity and makes it very difficult to draw any crucial business insights at the end.
Computation and design time is high- It is not good for real-time applications.
The selection of models for creating an ensemble is an art that is really hard to master.

So, now that we have covered all the fundamentals of ensemble methods, you can proceed to try them hands-on to enhance your understanding even further.

You can check out my following other posts on Machine learning which were well-received.

Top 10 model evaluation metrics for classification ML models

Explained unconventionally, this will serve as an exhaustive list for assessing classification Machine learning models.

towardsdatascience.com

Clearly Explained: 4 types of Machine learning algorithms

Definitions, purpose, popular algorithms and use cases- all explained

towardsdatascience.com

Watch out this space for more for regular posts on Machine Learning, Data Science, and Statistics.

Thank you for reading!:)

Happy Learning:)

以上就是本文的全部内容，希望本文的内容对大家的学习或者工作能带来一定的帮助，也希望大家多多支持码农网

查看所有标签

猜你喜欢:

Clearly Explained: Ensemble learning methods, the heart of Machine Learning

本站部分资源来源于网络，本站转载出于传递更多信息之目的，版权归原作者或者来源机构所有，如转载稿涉及版权问题，请联系我们。

码农书籍

Unity游戏设计与实现

[日]加藤政树 / 罗水东 / 人民邮电出版社 / 2015-2 / 79.00元

本书出自日本知名游戏公司万代南梦宫的资深开发人员之手，面向初级游戏开发人员，通过10个不同类型的游戏实例，展示了真正的游戏设计和实现过程。本书的重点并不在于讲解Unity的各种功能细节，而在于核心玩法的设计和实现思路。每个实例都从一个idea 开始，不断丰富，自然而然地推出各种概念，引导读者思考必要的数据结构和编程方法。掌握了这些思路，即便换成另外一种引擎，也可以轻松地开发出同类型的游戏。 ......一起来看看《Unity游戏设计与实现》这本书的介绍吧!

码农工具

Clearly Explained: Ensemble learning methods, the heart of Machine Learning

Machine Learning Fundamentals

Clearly Explained: Ensemble learning methods, the heart of Machine Learning

Bagging-Boosting demystified, explained unconventionally, it’ll be worth your time.

What is an Ensemble method?

Ensembles are a divide and conquer approach used to improve performance.

Simple Ensemble Techniques

2. Taking the average of the results

3. Taking the weighted average of the results

Advanced Ensemble techniques

1. Bagging (Bootstrap AGGregatING)

2. Boosting

Summary of differences between Bagging and Boosting

Advantages/Benefits of ensemble methods

Disadvantages of ensemble methods

Top 10 model evaluation metrics for classification ML models

towardsdatascience.com

Clearly Explained: 4 types of Machine learning algorithms

Definitions, purpose, popular algorithms and use cases- all explained

towardsdatascience.com

Unity游戏设计与实现

CSS 压缩/解压工具

图片转BASE64编码

HEX HSV 转换工具