Hybrid of Active Learning and Dynamic Self-Training for Data Stream Classification

  • Sogol Haghani Department of Computer Engineering and Data mining Lab Alzahra University Tehran, Iran
  • Mohammad Reza Keyvanpour Department of Computer Engineering Alzahra University Tehran, Iran
  • Mahnoosh Kholghi Computer Engineering Department, Member of young research club Islamic Azad University, Qazvin Branch Qazvin, Iran
Keywords: Computer Science, Data Mining, Semi-supervised learning, Classification; Data Stream

Abstract

Most of the data stream classification methods need plenty of labeled samples to achieve a reasonable result. However, in a real data stream environment, it is crucial and expensive to obtain labeled samples, unlike the unlabeled ones. Although Active learning is one way to tackle this challenge, it ignores the effect of unlabeled instances utilization that can help with strength supervised learning. This paper proposes a hybrid framework named “DSeSAL”, which combines active learning and dynamic self-training to achieve both strengths. Also, this framework introduces variance based self-training that uses minimal variance as a confidence measure. Since an early mistake by the base classifier in self-training can reinforce itself by generating incorrectly labeled data, especially in multi-class condition. A dynamic approach to avoid classifier accuracy deterioration, is considered. The other capability of the proposed framework is controlling the accuracy reduction by specifying a tolerance measure. To overcome data stream challenges, i.e., infinite length and evolving nature, we use the chunking method along with a classifier ensemble. A classifier is trained on each chunk and with previous classifiers form an ensemble of M such classifiers. Experimental results on synthetic and real-world data indicate the performance of the proposed framework in comparison with other approaches.

Downloads

Download data is not yet available.

Author Biographies

Sogol Haghani, Department of Computer Engineering and Data mining Lab Alzahra University Tehran, Iran

received her B.S. in computer science from Kharazmi University, Tehran, Iran. She is currently working toward her master degree in the Department of computer engineering and data mining laboratory at Alzahra University, Tehran, Iran. Her research interests include data mining such as social networks and artificial neural networks.

Mohammad Reza Keyvanpour, Department of Computer Engineering Alzahra University Tehran, Iran

is an Associate Professor at Alzahra University, Tehran, Iran. He received his B.S. in software engineering from Iran University of Science & Technology, Tehran, Iran. He received his M. S. and Ph.D. in software engineering from Tarbiat Modares University, Tehran, Iran. His research interests include information retrieval and data mining.

Mahnoosh Kholghi, Computer Engineering Department, Member of young research club Islamic Azad University, Qazvin Branch Qazvin, Iran

received her B.S. in Software Engineering from Islamic Azad University, Karaj Branch, Karaj, Iran. She also received her M.S. in Software Engineering at Islamic Azad University, Qazvin Branch, Qazvin, Iran. Her research interests include Data Stream Mining and Machine Learning.

References

[1] K. Romer and F. Mattern, "The design space of wireless sensor networks", IEEE Wireless Communications, vol. 11, no. 6, pp. 54–61, 2004. [2] http://www.marketsandmarkets.com/Market-Reports/ wirelesssensor-networks-market-445.html, 2015. [3] P. Harrop, R. Das, "Wireless Sensor Networks (WSN) 20142024: Forecasts, Technologies, Players. [4] S. Madria, V. Kumar, R. Dalvi, "Sensor-Cloud: A Cloud of Virtual Sensors", IEEE Software, vol. 31, no. 2, 2014.
Volume 9- Number 4 – Autumn 2017 26

[5] Y. Liu, K. Ong, and A. Goscinski, "Sensor-Cloud Computing: Novel Applications and Research Problems", Networked Digital Technologies Communications in Computer and Information Science, vol. 294, pp. 475-486, 2012.
[6] P. You and Z. Huang, "Towards an Extensible and Secure Cloud Architecture Model for Sensor Information System", International Journal of Distributed Sensor Networks, 2013. [7] R. Buyya, C. S. Yeo and et al, "Market Oriented Cloud Computing: Vision, Hype and Reality for Delivering IT Services as Computing Utilities", In Proc. of 10th IEEE Conference on HPCC’08, September 2008.
[8] S. Sakr, A. Liu, D. M. Batista, and M. ALomari, "A Survey of Large Scale Data Management Approaches in Cloud Environments", IEEE Communications surveys & tutorials, vol. 13, NO. 3, pp. 311-336, 2011.
[9] M. Yuriyama, T. Kushida, "Sensor-Cloud Infrastructure: Physical Sensor Management with Virtualized Sensors on Cloud Computing", 13th International Conference on Network-based Information Systems, 2010.
[10] S. Misra, S. Chatterjee, M. S. Obaidat, "On Theoretical Modeling of Sensor Cloud: A Paradigm Shift from Wireless Sensor Network", IEEE SYSTEM JOURNAL, vol. 11(2), pp. 1084-1093, 2014.
[11] S. Chatterjee, S. Misra, "Optimal Composition of a Virtual Sensor for Efficient Virtualization Within Sensor-cloud", IEEE ICC 2015 SAC, 2015.
[12] Ch. Zhu, C. M. Leung, E. Ngai, L. Yang, L. Shu, X. Li, "Pricing Models for Sensor-Cloud", cloud computing technology and science, 2015. [13] S. Chatterjee, R. Ladia, S. Misra, "Dynamic Optimal Pricing for Heterogeneous Service-Oriented Architecture of Sensor-cloud Infrastructure", IEEE Transaction on Service Computing, vol. 10(2), pp. 203-216, 2015. [14] S. K. Dash, J. P. Sahoo, S. Mohapatra, and S. P. Pati, "Sensorcloud assimilation of wireless sensor network and the cloud, in Advances in Computer Science and Information Technology", Lecture Notes in Networks and Communications, vol. 84, (2012), pp. 455–464, Springer, 2012.
[15] Y. Lyu, F. Yan, Y. Chen, D. Wang, Y. Shi, N. Agoulmine, "High-performance scheduling model for multisensory gateway of cloud sensor system-based smart-living", Information Fusion (2015), pp. 42-56, 2015.
[16] M. Kurz, G. Holzl, and A. Ferscha, “Goal-Oriented Opportunistic Sensor Clouds”, On the Move to Meaningful Internet Systems: OTM, Lecture Notes in Computer Science, Vol. 7566, (2012), pp.602-619, 2012 [17] Ch. Doukas, I. Maglogiannis, "Managing Wearable Sensor Data through Cloud Computing", IEEE third international conference on cloud computing Technology and Sience (CloudCom), 2011. [18] P. Mell and T. Grance, Draft nist working definition of cloud computing - v15, 2011. [19] P. Rawat, K. Deep Singh, H. Chaouchi, J. Marie Bonnin, "Wireless sensor networks: a survey on recent developments and potential synergies", supercompt, 68:1-48, 2014. [20] N. Mohamed, J. Al-Jaroodi, "A survey on serviceoriented middleware for wireless sensor networks, Service Oriented Computing and Applications", vol. 5, issue 2, pp. 71-85, 2011. [21] Y. Choi, Y. Hong, "Study of Coupling of Software-Defined Networking and Wireless Sensor Networks", ICUFN, 2016. [22] T. Luo, H. Tan, T. Quek, "Sensor OpenFlow: Enabling Software-Defined Wireless Sensor Networks", IEEE Communications Letter, vol. 16, no. 11, 2012.
[23] M. Eggert and et al, Sensor Cloud: Towards the Interdisciplinary Development of a Trustworthy Platform for
Globally Interconnected Sensors and Actuators, Trusted Cloud Computing (2014), pp.203-218, 2014. [24] M. Mahmoud and X. Shen, “A cloud-based scheme for protecting source-location privacy against hotspot-locating attack in wireless sensor networks,” IEEE Transactions on Parallel and Distributed Systems, vol. 23(10), pp. 1805–1818, Oct 2012. [25] Ch. Zhu, Z. Sheng, V. C. M. Leung, L. Shu, and L. T. Yang, "Towards Offering More Useful Data Reliably to Mobile Cloud from Wireless Sensor Network", IEEE TRANSACTION ON EMERGING IN COMPUTING, vol. 3(1), pp. 84-94, 2014. [26] P.Pande, A. R. Padwalkar, “Internet of Things-A Future of Intenet: A Survey”, International Journal of Advanced Research in Computer Science and management Studies, vol. 2(2), 2014. [27] F. Salim, U. Haque, Urban computing in the wild: A survey on large scale participation and citizen engagement with ubiquitous computing, cyber physical systems, and Internet of Things, International Journal of Human-Computer Studies (2015), http://dx.doi.org/10.1016/j.ijhcs.2015.03.003. [28] K. T. Lan, What’s Next? Sensor+Cloud?, in Proceeding of the 7th International Workshop on Data Management for Sensor Networks, pp. 978–971, ACM Digital Library, 2010. [29] R. S. Ponmagal and J. Raja, An extensible cloud architecture model for heterogeneous sensor services, International Journal of Computer Science and Information Security,9(1), 2011. [30] I. Khan, F. Belqasmi, R. Glitho, N. Crespi, M. Morrow, P. Polakos, "Wireless Sensor Network Virtualization: A Survey", IEEE communication Survey & Tutorials, Vol. 18(1), 2016.
[31] S. Misra, A. Singh, S. Chatterjee, A. K. Mandal, "QoS-aware sensor allocation for target tracking in sensor-cloud", Ad Hoc Networks, vol. 33, pp. 140-153, 2015. [32] S. Chatterjee, S. Misra, S. U. Khan, "Optimal Data Center Scheduling for Quality of Service Management in Sensorcloud", IEEE Transaction on Cloud Computing, pp.1, 2015. [33] Sensor-Cloud. Available online: http://www.sensorcloud.com /news/shelburne-vineyard-relieswireless-sensors-and-cloudmonitor -its-vines. [34] Z. Khalid, N. Fisal, M. Rozaini, "A Survey of Middleware for Sensor and Network Virtualization", Sensors, vol. 14, pp. 24046-24097, 2014.
[35] Y. Xu, S. Helal, M. T. Thai, M. Schmalz, "Optimizing Push/Pull Envelopes for Energy-Efficient Cloud-Sensor Systems", in Proceedings of the 14th ACM international conference (MSWiM ’11), 2011. [36] Ch. Zhu, V. C. M. Leung, T. Yang, L. Shu, "Collaborative Location-based Sleep Scheduling for Wireless Sensor Networks Integrated with Mobile Cloud Computing", IEEE Transaction on computers, 64(7), (2014), pp. 1844-1856, 2014. [37] D. Phan, J. Suzuki, Sh. Omura, K. Oba, and A. Vasilakos, "Multi-objective Communication Optimization for Cloudintegrated Body Sensor Networks", 14th IEEE/ACM International Symposium on cluster, cloud and Grid Computing, 2014. [38] R. Hummen, M. Henze, D. Catrein, K. Wehrle, "A Cloud Design for User-controlled Storage and Processing of Sensor Data", IEEE 4th International Conference on Cloud Computing Technology and Science (CloudCom), 2012. [39] M. Henze, R. Hummen., K. Wehrle, "The Cloud Needs CrossLayer Data Handling Annotations", IEEE Security and Privacy Workshops, 2013. [40] M. Henze, M. Gro_fengels, M. Koprowski, K. Wehrle, "Towards Data Handling Requirements-aware Cloud Computing", IEEE International Conference on Cloud Computing Technology and Science (CloudCom), 2013.
Volume 9- Number 4 – Autumn 2017 27

[41] T-D. Nguyen, E-N. Huh, "An efficient key management for secure multicast in sensor-cloud", First ACIS/JNU International Conference on Computers, Networks, Systems, and Industrial Engineering, 2011.
[42] H. A. Dinesha, R. Monica, V. K. Agrawal, "Formal Modeling for Multi-Level Authentication in Sensor-Cloud Integration System", International Journal of Applied Information System (IJAIS), 2(3), 2012.
[43] M. Henze, R. Hummen, R. Matzutt, D. Catrein, K. Wehrle," Maintaining User Control While Storing and Processing Sensor Data in the Cloud", International Journal of Grid and High-Performance Computing, 5(4), pp. 97-112, 2013. [44] L. Ramaswamy, V. Lawson, S. V. Gogineni, "Toward a A Quality-Centric Big Data Architecture for Federated Sensor Services", IEEE International Congress on big Data, 2013.
[45] L. D. P. Mendes, J. P. C. Rodrigues, J. Lloret, S. Sendra, "CrossLayer Dynamic Admission Control for Cloud-based Multimedia Sensor Networks", IEEE SYSTEMS JOURNAL, 8(1), pp.235246, 2013.
[46] J. Melchor, M. Fukuda, "A Design of Flexible Data Channels for Sensor-Cloud Integration", Proceedings of the International Conference on Systems Engineering (ICSEng’2011), pp. 251256, 2011.
[47] H. T.Dinh, C. Lee, D. Niyato, and P. Wang, "A Survey of Mobile Cloud Computing: Architecture, Applications, and Approaches", Wireless Communications and Mobile Computing WileyOnline Library, 2011.
Volume 9- Number 4-5-Autumn 2017
Published
2018-08-11
How to Cite
Haghani, S., Keyvanpour, M. R., & Kholghi, M. (2018, August 11). Hybrid of Active Learning and Dynamic Self-Training for Data Stream Classification. International Journal of Information & Communication Technology Research, 9(4), 37-49. Retrieved from http://journal.itrc.ac.ir/index.php/ijictr/article/view/349
Section
Information Technology