Traffic Sign Classification using Residual Networks(ResNet)

栏目: IT技术 · 发布时间: 4年前

内容简介:Deep Convolutional Neural Networks(CNNs)are widely used to solve various computer vision tasks in the field of Artificial Intelligence. This article focuses on developing a deep learning model in order to recognize traffic signs. :x::no_entry_sign::no_pede

Deep Residual Learning to Classify Traffic Signs

Jun 6 ·4min read

Traffic Sign Classification using Residual Networks(ResNet)

Photo by Hanns Adrian Böhme on Unsplash

Deep Convolutional Neural Networks(CNNs)are widely used to solve various computer vision tasks in the field of Artificial Intelligence. This article focuses on developing a deep learning model in order to recognize traffic signs. :x::no_entry_sign::no_pedestrians::no_bicycles:

Table of Contents

  • Data Analysis
  • Create a ResNet Model
  • Model Training
  • Model Evaluation
  • Predictions
  • References

First of all, we need a dataset to train the deep learning model to recognize traffic signs. Kaggle Datasets is the best platform to find datasets for different tasks. Such as Machine Learning(ML), Deep Learning(DL), and Data Science.

Here is one of the datasets contains nearly 73,139 diverse images of traffic signs of 43 classes.

Data Analysis

In this section, we are going to use a simple way to analyze the dataset.

Here is a simple count plot to analyze the spread of data in classes. The below code is used to plot the graph:

Traffic Sign Classification using Residual Networks(ResNet)

Countplot w.r.t to classes

Let’s visualize some of the samples from the dataset. This will help us to understand the data. The below code serves the purpose by plotting 100 images from the dataset.

Traffic Sign Classification using Residual Networks(ResNet)

Images from the Dataset

Create a ResNet Model

In this section, we are going to create a deep learning model to recognize traffic signs.

Residual Network(ResNet)

Microsoft introduced the deep residual learning framework to overcome the ‘degradation’ problem which is a hard optimization task. The shortcut connections i.e., skipping one or more layers as shown in the below figure.

Traffic Sign Classification using Residual Networks(ResNet)

Skip Connection in Residual Networks

These shortcut connections perform identity mapping and the outputs are added to the outputs of stacked layers. This has solved many problems such as :

  • Easy to optimize
  • It gains accuracy from greatly increased depth, producing results that are better than previous network architectures.

For a better understanding of deep residual learning. Use the research paper entitled ‘Deep Residual Learning for Image Recognition’ which is freely available on arxiv.

We are going to use the TensorFlow applications module which provides different popular deep learning models with pretrained weights to use.

We are going to use ResNet50 architecture without pretrained weights. We add the dense layer with softmax activation at the end to predict the classes. Below is used to create the model.

You can see the visualization of the model created using the plot_model method.

Model Training

These are the parameters used during the training process. The batch size as 32, epochs 50, learning rate as 0.001, loss metric ‘Categorical Cross Entropy’, optimizer as ‘Adam’. The callbacks ModelCheckpoint, EarlyStopping, ReduceLROnPlateau, and CSVLogger are used in the training of the ResNet50 model. You can use the below link for understanding the nuts and bolts of callbacks.

The below code is used to compile and fit the model.

The graphs between accuracy over epochs on training and validation data.

Traffic Sign Classification using Residual Networks(ResNet)

Accuracy Graph

The graph between loss over epochs on training and validation data.

Traffic Sign Classification using Residual Networks(ResNet)

Loss Graph

You can see the loss and accuracy converges after 20 epochs.

Model Evaluation

Classification Report

Let’s see the classification report which helps to evaluate the model.

The output results in the form precision, recall, F1 score with respect to each class.

Confusion Matrix

The confusion matrix is used to describe the performance of the classification model. The below code is used to generate confusion matrix:

The resultant confusion matrix is shown below:

Traffic Sign Classification using Residual Networks(ResNet)

Confusion Matrix

Classwise Accuracy

The classwise accuracy can be derived using the below code:

Predictions

Few samples from the unseen data are used for predicting the class labels using the trained ResNet50 model. The below code is used for this purpose:

The prediction of the unseen data is shown below:

Traffic Sign Classification using Residual Networks(ResNet)

Predictions

The code that I have written for the task is available as Kaggle Notebook. Feel free to use it. Here is the link:

References


以上就是本文的全部内容,希望对大家的学习有所帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

计算机视觉

计算机视觉

Richard Szeliski / 艾海舟、兴军亮 / 清华大学出版社 / 2012-1 / 109.00元

《计算机视觉——算法与应用》探索了用于分析和解释图像的各种常用技术,描述了具有一定挑战性的视觉应用方面的成功实例,兼顾专业的医学成像和图像编辑与交织之类有趣的大众应用,以便学生能够将其应用于自己的照片和视频,从中获得成就感和乐趣。本书从科学的角度介绍基本的视觉问题,将成像过程的物理模型公式化,然后在此基础上生成对场景的逼真描述。作者还运用统计模型来分析和运用严格的工程方法来解决这些问题。 本......一起来看看 《计算机视觉》 这本书的介绍吧!

在线进制转换器
在线进制转换器

各进制数互转换器

UNIX 时间戳转换
UNIX 时间戳转换

UNIX 时间戳转换