A Comparative Study of Different CNN Models in City Detection Using Landmark Images

Junayed, Masum Shah; Jeny, Afsana Ahsan; Neehal, Nafis; Atik, Syeda Tanjila; Hossain, Syed Akhter

doi:10.1007/978-981-13-9181-1_48

Masum Shah Junayed⁹,
Afsana Ahsan Jeny⁹,
Nafis Neehal⁹,
Syeda Tanjila Atik⁹ &
…
Syed Akhter Hossain⁹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1035))

Included in the following conference series:

International Conference on Recent Trends in Image Processing and Pattern Recognition

720 Accesses
8 Citations

Abstract

Navigation assistance using different local Landmarks is an emerging research field now-a-days. Landmark images taken from different camera angles are being vividly used alongside the GPS (Global Positioning System) data to determine the location of the user and help user with navigation. However, determining the location of the user by recognizing the landmarks from different images, without the help of GPS, can be a worthy research trend to explore. Hence, in this paper, we have conducted a comparative study of 3 different popular CNN models, namely - Inception V3, MobileNet and ResNet50, and they have achieved an overall accuracy of 99.7%, 99.5% and 99.7% respectively while determining cities using landmark images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

https://en.wikipedia.org/wiki/Keras
https://arxiv.org/pdf/1512.00567.pdf
https://www.kaggle.com/keras/resnet50/home
https://arxiv.org/abs/1603.04467
Xia, X., Xu, C.: Inception-v3 for flower classification. In: 2017 2nd International Conference on Image, Vision and Computing (2017)
Google Scholar
http://wiki.ubc.ca/Course:CPSC522/Convolutional_Neural_Networks#cite_note-wiki-3
https://arxiv.org/abs/1704.04861
https://en.wikipedia.org/wiki/Confusion_matrix
https://www.pyimagesearch.com/2017/03/20/imagenet-vggnet-resnet-inception-xception-keras/
Mata, M., Armingol, J.M., de la Escalera, A., Salichs, M.A.: A visual landmark recognition system for topological navigation of mobile robots. In: Proceedings 2001 ICRA. IEEE International Conference on Robotics and Automation (Cat. No. 01CH37164, 21–26 May 2001
Google Scholar
Li, Y., Crandall, D.J., Huttenlocher, D.P.: Landmark classification in large-scale image collections. In: 2009 IEEE 12th International Conference on Computer Vision, 29 September 2009–2 October 2009
Google Scholar
Zheng, Y.-T., Zhao, M., Song, Y., Adam, H.: Tour the world: building a web-scale landmark recognition engine. In: 2009 IEEE Conference on Computer Vision and Pattern Recognition, 20–25 June 2009
Google Scholar
Elizalde, B., Chao, G.-L., Zeng, M., Lane, I.: City-identification of flickr videos using semantic acoustic features. arXiv: 1607.03257v1 [cs.MM], 12 July 2016
Gavai, N.R., Jakhade, Y.A., Tribhuvan, S.A., Bhattad, R.: MobileNets for flower classification using tensorflow. In: 2017 International Conference on Big Data, IoT and Data Science (BID), 20–22 December 2017. Vishwakarma Institute of Technology, Pune (2017)
Google Scholar
Kim, W., Choi, H.-K., Jang, B.-T., Lim, J.: Driver distraction detection using single convolutional neural network. In: 2017 International Conference on Information and Communication Technology Convergence (ICTC), 18–20 October 2017
Google Scholar
https://en.wikipedia.org/wiki/Flowchart
https://en.wikipedia.org/wiki/Precision_and_recall
He, K., Zhang, X., Ren, S., Sun, J: Deep residual learning for image recognition. arXiv: 1512.03385v1 [cs.CV], 10 December 2015
https://github.com/keras-team/keras/issues/3755
https://datascience.stackexchange.com/questions/15989/micro-average-vs-macro-average-performance-in-a-multiclass-classification
Li, X., Wu, C., Zach, C., Lazebnik, S., Frahm, J.-M.: Using iconic scene graphs for modeling and recognition of landmark images collections, 16 April 2011
Google Scholar

Download references

Author information

Authors and Affiliations

Daffodil International University, Dhaka, 1207, Bangladesh
Masum Shah Junayed, Afsana Ahsan Jeny, Nafis Neehal, Syeda Tanjila Atik & Syed Akhter Hossain

Authors

Masum Shah Junayed
View author publications
You can also search for this author in PubMed Google Scholar
Afsana Ahsan Jeny
View author publications
You can also search for this author in PubMed Google Scholar
Nafis Neehal
View author publications
You can also search for this author in PubMed Google Scholar
Syeda Tanjila Atik
View author publications
You can also search for this author in PubMed Google Scholar
Syed Akhter Hossain
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Afsana Ahsan Jeny .

Editor information

Editors and Affiliations

Department of Computer Science, University of South Dakota, Vermillion, SD, USA
K. C. Santosh
Solapur University, Solapur, India
Ravindra S. Hegadi

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Junayed, M.S., Jeny, A.A., Neehal, N., Atik, S.T., Hossain, S.A. (2019). A Comparative Study of Different CNN Models in City Detection Using Landmark Images. In: Santosh, K., Hegadi, R. (eds) Recent Trends in Image Processing and Pattern Recognition. RTIP2R 2018. Communications in Computer and Information Science, vol 1035. Springer, Singapore. https://doi.org/10.1007/978-981-13-9181-1_48

Download citation

DOI: https://doi.org/10.1007/978-981-13-9181-1_48
Published: 20 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-9180-4
Online ISBN: 978-981-13-9181-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics