Food for Thought — Paper Tuesday

栏目: IT技术 · 发布时间: 6年前

内容简介:Every Tuesday I highlight an interesting paper that I came across in research or work. I hope that my review can help you get the juiciest part of the paper under 2 minutes!It is widely accepted that CNNs learn images by extracting shape features like curv

Food for Thought — Paper Tuesday

Use Shape-biased Data Improves Accuracy and Robustness

Every Tuesday I highlight an interesting paper that I came across in research or work. I hope that my review can help you get the juiciest part of the paper under 2 minutes!

Basic Ideas

It is widely accepted that CNNs learn images by extracting shape features like curves and edges. However, a group of researchers from University of Tuebingen and Edinburgh challenged this belief in their ICLR 2019 paper ImageNet-Trained CNNs Are Biased Towards Texture; Increasing Shape Bias Improves Accuracy and Robustness .

From the paper

Here’s the link: https://openreview.net/pdf?id=Bygh9j09KX

By cleverly crafting several experiments, the researchers demonstrated CNNs are more biased toward image textures than people would expect. From that, they further found that shape-enhanced dataset can serve as an effective data augmentation method that improves model accuracy and robustness.

The researchers argued that CNNs are heavily biased toward local features, perhaps due to the small perception field of convolution filters. Their argument is supported by CNN’s surprisingly low performance on texture-free images demonstrated in the image below

From the paper

As demonstrated in the figure, all major-league architectures like AlexNet, GoogleNet, and VGG16 experience significant performance drops when texture information is removed (silhouette and edge). Meanwhile, CNNs yield high confidence even when shape information is removed as long as texture is present (texture).

Results

In order to further test their hypothesis, the researchers generated a new dataset called Stylized-ImageNet (SIN), whose images’ local texture features are replaced by uninformative random features.

From the paper

If CNNs are biased toward local texture features, we would expect CNNs trained on the original ImageNet dataset to perform poorly on SIN. This is indeed the case, as demonstrated in the following table

From the paper

Okay, now we are convinced that CNNs are biased toward local textures. But how can we use this inforamtion to our advantage? The researchers demonstrated that models jointly trained on SIN and IN is more robust to image distortion (noise, cropping, filtering…) and achieve outstanding accuracies in image classification and object detection.

From the paper
From the paper

Some Thoughts

For a long time I was convinced that CNNs are capable of image classification because of its powerful edge detectors. This paper opened a new door for us — there are some many handwaving explaination and understanding of neural networks. There are still a lot of theoretical work ahead to understand even one of the simplest forms of neural networks!


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

数学建模(原书第5版)

数学建模(原书第5版)

[美] Frank R. Giordano、[美] William P.Fox、[美] Steven B.Horton / 叶其孝、姜启源 / 机械工业出版社 / 2014-10-1 / 99.00元

《华章数学译丛:数学建模(原书第5版)》旨在指导学生初步掌握数学建模的思想和方法,共分两大部分:离散建模和连续建模,通过本书的学习,学生将有机会在创造性模型和经验模型的构建、模型分析以及模型研究方面进行实践,增强解决问题的能力。 《华章数学译丛:数学建模(原书第5版)》对于用到的数学知识力求深入浅出,涉及的应用领域相当广泛,适合作为高等院校相关专业的数学建模教材和参考书,也可作为参加国内外数......一起来看看 《数学建模(原书第5版)》 这本书的介绍吧!

图片转BASE64编码
图片转BASE64编码

在线图片转Base64编码工具

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具

html转js在线工具
html转js在线工具

html转js在线工具