Abstract
In this paper, a Modified Capsule Neural Network (Mod-CapsNet) with a pooling layer but without the squash function is used for recognition of indoor home scenes which are represented in grayscale. This Mod-CapsNet produced an accuracy of 70% compared to the 17.2% accuracy produced by a standard CapsNet. Since there is a lack of larger datasets related to indoor home scenes, to obtain better accuracy with smaller datasets is also one of the important aims in the paper. The number of images used for training and testing is 20,000 and 5000 respectively, all of dimension 128X128. The analysis proves that in the indoor home scene recognition task the combination of the capsule without a squash function and with max-pooling layers works better than by using capsules with convolutional layers. Indoor home scenes are specifically focused towards analysing capsules performance on datasets whose images have similarities but are, nonetheless, quite different. For example, tables may be present in living rooms and dining rooms even though these are quite different rooms.
Original language | English |
---|---|
Title of host publication | 2020 International Joint Conference on Neural Networks (IJCNN) |
Place of Publication | Piscataway, NJ. |
Publisher | IEEE |
Number of pages | 6 |
ISBN (Print) | 9781728169279 |
DOIs | |
Publication status | Published - 28 Aug 2020 |
Event | 2020 International Joint Conference on Neural Networks (IJCNN)- IEEE World congress on computational intelligence(WCCI) 2020: IJCNN - glasgow, Glasgow, United Kingdom Duration: 19 Jul 2020 → 24 Jul 2020 Conference number: 48605X https://wcci2020.org/ijcnn-sessions/ https://wcci2020.org/ |
Conference
Conference | 2020 International Joint Conference on Neural Networks (IJCNN)- IEEE World congress on computational intelligence(WCCI) 2020 |
---|---|
Abbreviated title | IJCNN |
Country/Territory | United Kingdom |
City | Glasgow |
Period | 19/07/20 → 24/07/20 |
Internet address |
Keywords
- Capsule Neural Network
- Modified Capsule Neural Network
- capsules
- pooling layer
- scene recognition