A Scene Recognition Algorithm Based On Multi-Instance Learning

Tao Wang; Wenqing Chen; Bailing Wang

doi:10.21307/ijssis-2017-716

.blurhash-client-img { display: none !important; }

A Scene Recognition Algorithm Based On Multi-Instance Learning

International Journal on Smart Sensing and Intelligent Systems

Volume 7 (2014): Issue 4 (January 2014)

By: Tao Wang, Wenqing Chen and Bailing Wang

Open Access

|Dec 2014

Abstract

In Bag of Words image presentation model, visual words are generated by unsupervised clustering, which leaves out the spatial relations between words and results in such shorting comings as limited semantic description and weak discrimination. To solve this problem, we propose to substitute visual words by visual phrases in this article. Visual phrases built according to spatial relations between words are semantic distrainable, and they can improve the accuracy of Bag of Words model. Considering the traditional classification method based on Bag of Words model is vulnerable to the background, block and scalar variance of an image, we propose in this article a multiple visual words learning method for image classification, which is based on the concept of visual phrases combined with Multiple Instance Learning. The final classification model is able to show the spatial features of image classes. Experiments performed on standard image testing sets, Caltech 101 and Scene 15, show the satisfying performance of this algorithm.

References

Iliadis, M. ; Seunghwan Yoo ; Xin Xin ; Katsaggelos, A.K,Virtual touring: A Content Based Image Retrieval application, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW), pp.1 - 4,2013..10.1109/ICMEW.2013.6618285
Search in Google Scholar Back to article
Chucai Yi ; YingLi Tian,Localizing Text in Scene Images by Boundary Clustering, Stroke Segmentation, and String Fragment Classification, IEEE Transactions on Image Processing,Volume: 21, Issue: 9,pp.4256 - 4268,2012.
Search in Google Scholar Back to article
Maron O, Lozano-Perez T. A Framework for Multiple-Instance Learning [C]. Proceedings of Neural Information Processing Systems, 10: 570-576, 1998.
Search in Google Scholar Back to article
Aissam Bekkari, et al., SVM Classification of Urban High-Resolution Imagery Using Composite Kernels and Contour Information, International Journal of Advanced Computer Science and Applications, vol. 4, no. 7, 2013.10.14569/IJACSA.2013.040718
Search in Google Scholar Back to article
Wu Z, Ke QF, Sun J. 2009. Bundling features for large-scale partial-duplicate web image search[C]. In Proc. CVPR, 25-32.
Search in Google Scholar Back to article
Hong Pan ; Yaping Zhu ; Qin, A.K. ; Liangzheng Xia,Mining heterogeneous class-specific codebook for categorical object detection and classification,Image Processing (ICIP), 2013 20th IEEE International Conference on,pp. 3132 - 3136,2013..10.1109/ICIP.2013.6738645
Search in Google Scholar Back to article
Liu D, Hua G, Viola P, Chen T. 2008. Integrated Feature Selection and Higher-order Spatial Feature Extraction for Object Categorization[C]. Proceeding of the 26th IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK.10.1109/CVPR.2008.4587403
Search in Google Scholar Back to article
M.Iwahara, S.C.Mukhopadhyay, S.Yamada and F.P.Dawson, “Development of Passive Fault Current Limiter in Parallel Biasing Mode”, IEEE Transactions on Magnetics, Vol. 35, No. 5, pp 3523-3525, September 1999.10.1109/20.800577
Search in Google Scholar Back to article
Zheng YT, Zhao M, Neo SY, Chua TS, Tian Q. 2008. Visual Synset: towards a Higher-level Visual Representation[C]. In Proc.CVPR, Achorage, Alaska, U.S.10.1007/s00371-008-0294-0
Search in Google Scholar Back to article
Yuan YS, Wu Y, Yang M. 2007. Discovery of Collocation Patterns: from Visual Words to Visual Phrases[C]. Proc. of the 25th IEEE Conference on Computer Vision and Pattern Recognition, 1-8.
Search in Google Scholar Back to article
Li FF, Perona P. 2005. A Bayesian Hierarchical Model for Learning Natural Scene Categories [C]. Proceeding of the 23rd IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA, 524-531.
Search in Google Scholar Back to article
Lazebnik S, Schmid C, Ponce J. 2006.Beyond Bags of Features: Spatial Pyramid Matching forRecognition Natural Scene Categories [C]. Proceeding of the 24th IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, USA, 2: 2169 - 2178_O
Search in Google Scholar Back to article
Lowe D G. 2004. Distinctive Image Features form Scale-invariant Keypoints [J]. International Journal of Computer Vision, 60(2): 91 – 110.10.1023/B:VISI.0000029664.99615.94
Search in Google Scholar Back to article
G. Sen Gupta, S.C. Mukhopadhyay, Michael Sutherland and Serge Demidenko, Wireless Sensor Network for Selective Activity Monitoring in a home for the Elderly, Proceedings of 2007 IEEE IMTC conference, Warsaw, Poland, (6 pages).10.1109/IMTC.2007.379172
Search in Google Scholar Back to article
Zhang Q, Goldman S A. 2001. EM-DD: an improved multiple-instance learning technique. Advances in Neural Information Processing Systems, Cambridge, CA: MIT Press, 1073-1080.
Search in Google Scholar Back to article
Huang X, Chen SC, Shy M, et. al. 2002. User concept pattern discovery using relevance feedback and multiple-instance learning for content-based image retrieval [C].MDM/KDD 2002 Workshop Edmonton, 100-108.
Search in Google Scholar Back to article
Fergus R, Li Feifei, Perona P, Zisserman A. 2005. Learning Object Categories from Google’sImage Search [C]. Proceeding of the 10th International Conference on Computer Vision (ICCV), 1816 - 1823.
Search in Google Scholar Back to article
Blei D, Ng A, Jordan M. 2003. Latent Dirichlet Allocation [J]. Journal of Machine Learning Research. 3: 993-1022.
Search in Google Scholar Back to article
Subhas Chandra Mukhopadhyay, and Chien-Hung Liu, “Designing an Integrated Curriculum Platform for Engineering Education: A Hybrid Magnetic Bearing System”, International Journal on Technology and Engineering Education, Vol.7, No.1, pp. 17-31. July 2010.
Search in Google Scholar Back to article
Cao LL, Li FF. 2007. Spatially Coherent Latent Topic Model for Concurrent Object Segmentation and Classification[C]. Proceeding of the 11th IEEE International Conference on Computer Vision. Rio de Janeiro, Brazil, 1080 – 1087.
Search in Google Scholar Back to article
Dalal N, Triggs B. 2005. Histograms of oriented gradients for human detection Computer [C]. Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1(1): 886-893.10.1109/CVPR.2005.177
Search in Google Scholar Back to article
S. C. Mukhopadhyay, G. Sen Gupta and S. Demidenko, “Intelligent Method of Teaching Eletectromagnetics Theory” Measurement Under Virtual Environment”, International Journal on Smart Sensing and Intelligent Systems, Vol. 1, No. 2, June 2008, pp. 443-458.10.21307/ijssis-2017-300
Search in Google Scholar Back to article
Kadir T, Brady M. 2001. Scale, Saliency and Image Description [J]. International Journal of Computer Vision, 45(2): 83-105.10.1023/A:1012460413855
Search in Google Scholar Back to article
N. K. Suryadevara, S. C. Mukhopadhyay. R.K. Rayudu and Y. M. Huang, Sensor Data Fusion to determine Wellness of an Elderly in Intelligent Home Monitoring Environment, Proceedings of IEEE I2MTC 2012 conference, IEEE Catalog number CFP12MT-CDR, ISBN 978-1-4577-1771-0, May 13-16, 2012, Graz, Austria, pp. 947-952.
Search in Google Scholar Back to article
Yanmin LUO, Peizhong LIU and Minghong LIAO, AN ARTIFICIAL IMMUNE NETWORK CLUSTERING ALGORITHM FOR MANGROVES REMOTE SENSING, International Journal on Smart Sensing and Intelligent Systems, VOL. 7, NO. 1, pp. 116 – 134, 201410.21307/ijssis-2017-648
Search in Google Scholar Back to article
Daode Zhang et al., RESEARCH ON CHIPS’ DEFECT EXTRACTION BASED ON IMAGE-MATCHING, International Journal on Smart Sensing and Intelligent Systems, VOL. 7, NO. 1, pp.321 – 336, 2014.10.21307/ijssis-2017-658
Search in Google Scholar Back to article
Sean Dieter Tebje Kelly, Nagender Kumar Suryadevara, and S. C. Mukhopadhyay, “Towards the Implementation of IoT for Environmental Condition Monitoring in Homes” IEEE SENSORS JOURNAL, VOL. 13, NO. 10, OCTOBER 2013, pp. 3846-3853.10.1109/JSEN.2013.2263379
Search in Google Scholar Back to article

Articles in this issue

DOI: https://doi.org/10.21307/ijssis-2017-716 | Journal eISSN: 1178-5608

Journal RSS Feed

Language: English

Page range: 1470 - 1492

Submitted on: May 17, 2014

Accepted on: Oct 12, 2014

Published on: Dec 1, 2014

Published by: Macquarie University, Australia

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

Image Classification,

Multiple Kernel Learning,

Bag of Visual Words,

Spatial Pyramid Matching

Related subjects:

Engineering,

Introductions and overviews,

Engineering, other

© 2014 Tao Wang, Wenqing Chen, Bailing Wang, published by Macquarie University, Australia
This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.

Volume 7 (2014): Issue 4 (January 2014)