Doesn't CNN Know The Location?
3 main points
✔️ CNN encodes location information
✔️ 0 padding to learn location information
✔️ The location information is stored in deeper layers.
How Much Position Information Do Convolutional Neural Networks Encode?
written by Md Amirul Islam, Sen Jia, Neil D. B. Bruce
(Submitted on 22 Jan 2020)
Comments: Accepted to ICLR2020
Subjects: Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Paper Official Code COMM Code
Introduction
CNN has been successful in many computer vision tasks such as classification and detection tasks. Basically, CNNs are mainly used as a feature extractor for objects. This is because they are trained by local filters and are basically considered to have no location information. Therefore, this point has been pointed out, and models that mainly acquire location information, such as capsule nets, have been proposed. So many of you think that CNN has very little location information?
However, this study may change that implicit idea.
See Figure 1.
Figure 1: Prominent areas of the image.
Look at the top row. On the left, there is a pronounced area on the man in the middle. Similarly, you can see that most of the images have a pronounced area around the middle. The next image is shown in the bottom row with the image cropped by the blue line in the top row. Notice something strange here. Since we have only cropped the image, there is no change in the texture and so on, so there should be no change in the noticeable area. However, the salient area of the cropped image is clearly shifted to the left (the center region). (I think this is because we learned that the salient region is in the center of the image in terms of the training image, which is why this happened. ) This is contrary to our implicit belief that CNNs do not have location information.
From this result, the authors hypothesize that CNN may be encoding (acquiring) location information that humans just don't understand. This article is about CNN's location information.
This paper was accepted into ICLR2020 and all of the reviewers gave it a perfect score (8 points) (only 34 out of 2594 papers received a perfect score). If you are interested in the scores of the other papers, see here.
To read more,
Please register with AI-SCHOLAR.
ORCategories related to this article