Open Access Open Access  Restricted Access Subscription or Fee Access

Comparison of Self Developed Deep CNN Architecture with State of Art Existing Techniques

N. K. Darwante, Dr. U. B. Shinde

Abstract


The growth of consistent and tough image identification systems is a main challenge in artificial intelligence and computer vision. Understanding to identify objects requires classifier that are able to process relevant information under image variation like scaling, rotations, light and background changes, etc. Deep Convolutional Neural Networks can be trained using standard image datasets and own datasets but an optimized Deep CNN systems designed for more accuracy with minimum training images.

In our system, major finding includes various training methods for deep CNN architectures on existing and our own developed dataset which is the collection of real images. In our enhanced method proved that our deep CNN architecture get better accuracy and quality (attribute) properties with minimum training.


Keywords


Convolutional, NavNet, Darwante, Deep, CNN

Full Text:

PDF

References


J. Sung, S. H. Jin, I. Lenz, and A. Saxena, “Robobarista: Learning to Manipulate Novel Objects via Deep Multimodal Embedding,” ArXiv e-prints, Jan. 2016.

L. Pinto and A. Gupta, “Supersizing self-supervision: Learning to grasp from 50k tries and 700 robot hours,” in 2016 IEEE International Conference on Robotics and Automation (ICRA), May2016, pp.3406– 3413.

C. Ciliberto, S. R. Fanello, L. Natale, and G. Metta, “A heteroscedastic approach to independent motion detection for actuated visual sensors,” in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2012, pp. 3907–3913.

L. van der Maaten, “Accelerating t-sne using tree-based algorithms,” Journal of Machine Learning Research, vol. 15, pp. 3221–3245, 2014. [Online]. Available: http://jmlr.org/papers/v15/vandermaaten14a.html

A. Tacchetti, P. K. Mallapragada, M. Santoro, and L. Rosasco, “GURLS: A Least Squares Library for Supervised Learning,” Journal of Machine Learning Research, vol. 14, pp. 3201–3205, 2013. [Online]. Available:http://jmlr.org/papers/v14/tacchetti13a.html.

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June2016.

N. Snderhauf, F. Dayoub, S. McMahon, B. Talbot, R. Schulz, P. Corke, G. Wyeth, B. Upcroft, and M. Milford, “Place categorization and semantic mapping on a mobile robot,” in 2016 IEEE International Conference on Robotics and Automation (ICRA), May2016, pp.5729– 5736.

Snderhauf, S. Shirazi, F. Dayoub, B. Upcroft, and M. Milford, “On the performance of convnet features for place recognition,” in Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on, Sept2015, pp.4297–4304.

J P. Sermanet, D. Eigen, X. Zhang, M. Mathieu, R. Fergus, and Y. Le- Cun, “OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks,” ArXiv e-prints, Dec. 2013.

G. Metta, L. Natale, F. Nori, G. Sandini, D. Vernon, L. Fadiga, C. von Hofsten, K. Rosander, M. Lopes, J. Santos-Victor, A. Bernardino, and L. Montesano, “The icub humanoid robot: an open-systems platform for research in cognitive development.” Neural networks : the official journal of the International Neural Network Society, vol. 23, no. 8-9, pp. 1125–34, 1 2010.

L. Montesano, “The icub humanoid robot: an open-systems platform for research in cognitive development.” Neural networks : the official journal of the International Neural Network Society, vol. 23, no. 8-9, pp. 1125–34, 1 2010.

D. Held, S. Thrun, and S. Savarese, “Robust single-view instance recognition,” in 2016 IEEE International Conference on Robotics and Automation (ICRA), May 2016, pp. 2152–2159.

Y. LeCun, B. Boser, J. S. Denker, D. Henderson, R. E. Howard, W. Hubbard, and L. D. Jackel, “Backpropagation applied to hand- written zip code recognition,” Neural computation, vol. 1, no. 4, pp. 541–551, 1989.

Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell, “Caffe: Convolutional architecture for fast feature embedding,” in Proceedings of the ACM International Conference on Multimedia - MM ’14. ACM Press, 11 2014, pp. 675–678.

K. Lai, L. Bo, X. Ren, and D. Fox, “A large-scale hierarchical multi-view RGB-D object dataset,” in 2011 IEEE International Conference on Robotics and Automation. IEEE, 5 2011, pp. 1817– 1824. [Online]. Available: http://ieeexplore.ieee.org/articleDetails.jsp?arnumber=5980382

S. R. Fanello, C. Ciliberto, L. Natale, and G. Metta, “Weakly supervised strategies for natural object recognition in robotics,” IEEE International Conference on Robotics and Automation, pp. 4223–4229, 5 2013.

C. Ciliberto, S. R. Fanello, M. Santoro, L. Natale, G. Metta, and L. Rosasco, “On the impact of learning hierarchical representations for visual recognition in robotics,” in 2013 IEEE/RSJ International Conference on Intelligent Robots and Systems, Nov 2013, pp. 3759– 3764.

G. Pasquale, C. Ciliberto, F.Odone, L. Rosasco, and L. Natale, “Teaching iCub to recognize objects using deep Convolutional Neural Networks,” vol. 43, 2015, pp. 21–25. [Online]. Available: http://www.jmlr.org/proceedings/papers/v43/pasquale15

C. Ciliberto, U. Pattacini, L. Natale, F. Nori, and G. Metta, “Re- examining lucas-kanade method for real-time independent motion detection: Application to the icub humanoid robot,” in 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 2011, pp. 4154–4160.

O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, Z. Huang, A. Karpathy, A. Khosla, M. Bernstein, A. C. Berg, and L. Fei-Fei, “Imagenet large scale visual recognition challenge,” International Journal of Computer Vision, vol. 115, no. 3, pp. 211– 252, 2015. [Online]. Available: http://dx.doi.org/10.1007/s11263-015- 0816-y

C. M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics). Secaucus, NJ, USA: Springer-Verlag NewYork, Inc.,2006.

M. Schwarz, H. Schulz, and S. Behnke, “Rgb-d object recognition and pose estimation based on pre-trained convolutional neural network features,” in 2015 IEEE International Conference on Robotics and Automation(ICRA),May2015,pp.1329–1335.

M. D. Zeiler and R. Fergus, Visualizing and Understanding Convolutional Networks. Cham: Springer International Publishing, 2014,pp.818–833.[Online].Available:http://dx.doi.org/10.1007/978-3-319-10590-1 53

M. Oquab, L. Bottou, I. Laptev, and J. Sivic, “Learning and transferring mid-level image representations using convolutional neural networks,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2014.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, “Imagenet classification with deep convolutional neural networks,” in Advances in Neural Information Processing Systems 25, F. Pereira, C. J. C. Burges,L. Bottou, and K. Q. Weinberger, Eds. Curran Associates, Inc., 2012, pp. 1097–1105. [Online]. Available: http://papers.nips.cc/paper/4824- imagenet-classification-with-deep-convolutional-neural-networks.pdf

C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, and A. Rabinovich, “Going deeper with convolutions,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015.

K. Chatfield, K. Simonyan, A. Vedaldi, and A. Zisserman, “Return of the devil in the details: Delving deep into convolutional nets,” in British Machine Vision Conference, 2014.

K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” in International Conference on Learning Representations, 2015.

K. He, X. Zhang, S. Ren, and J. Sun, “Delving deep into rectifiers: Surpassing human-level performance on imagenet classification,” in The IEEE International Conference on Computer Vision (ICCV), December 2015.

A. Eitel, J. T. Springenberg, L. Spinello, M. Riedmiller, and W. Bur- gard, “Multimodal deep learning for robust rgb-d object recognition,” in Intelligent Robots and Systems (IROS), 2015 IEEE/RSJ International Conference on, Sept 2015, pp. 681–687.

D.C. Cire¸san, U. Meier, J. Masci, L.M. Gambardella, and J. Schmidhuber. High-performance neural networks for visual object classification. Arxiv preprint arXiv: 1102.0183, 2011.

J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. ImageNet: A Large-Scale Hierarchical Image Database. In CVPR09, 2009.


Refbacks

  • There are currently no refbacks.