Big Data Mining Techniques In IoT, Challenges and Architectures
Abstract
Nowadays, data is globally viewed as the most valuable resource, and the Internet of Things (IoT) has been playing an essential role ever since the time it emerged. Modern data sets are so complicated that cannot be handled by traditional software and hardware. Three major characteristics of the present time generated data are volume, velocity and variety, which have resulted in developing a concept called big data. Such characteristics have turned the routines of receiving, storing, processing, analyzing, and visualizing big data into a challenging issue. In the current competitive world, analyzing big data is critically important. The significance of big data doesn’t refer to the amount of data which is accessed by a company or organization, rather it depends on how the question data is used. Processing and analyzing the collected data helps enterprises to gain the due insights and benefit from them compatible with strategic decisions. Over the past few years, some novel frameworks and tools have been presented for storing, processing and analyzing big data so that their relevant know-how and thus, working with such large-scale data can provide the specialists in this field with various research areas and job opportunities. This paper has addressed big data in the IoT in which the issues about data mining architectures have been discussed. One of the prominent architectures raised in this field is the IoT-based multi-layered data mining model which is divided into four layers: data collection, data management, event management, and data processing service. Another architecture considered in this paper is the distributed data mining model whose main goal is pre-processing the distributed data before being submitted to the central receiver (core infrastructure) in order to reduce energy consumption in the central nodes. Grid-based data mining infrastructure in the IoT pursuing the objective to focus on the strategies to increase portability and situational awareness is another model which has been dealt with in the references. Another architecture is the data mining model that predicts the integration of several technologies in the IoT. In this architecture, the integration of several technologies including cloud computing, grid computing, pervasive networks, secure networks, etc. are dealt with. Another architecture under discussion is the IoT-based data mining on the cloud computing platform whose objective is to integrate the layers of extraction, management, exploration, and interpretation. Finally, a three-dimensional five-layered architecture was proposed for mining big data in the IoT, the main idea of which is a three-dimensional device-layer architecture consisting of the device layer, raw data layer, data collection layer, data processing layer, and service layer. The other two dimensions cover the issues like complying with the standards, security and privacy, data management, data perception, and data interpretation.
Keywords:
Data mining, Big data, Internet of things, Data mining architectureReferences
- [1] Venu, N., Kumar, A., & Vaigandla, K. K. (2022). Review of internet of things (IoT) for future generation wireless communications. International journal for modern trends in science and technology, 8(3), 1–8. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4232170
- [2] Zhong, Y., Chen, L., Dan, C., & Rezaeipanah, A. (2022). A systematic survey of data mining and big data analysis in internet of things. The journal of supercomputing, 78(15), 18405–18453. http://dx.doi.org/10.1007/s11227-022-04594-1
- [3] Sunhare, P., Chowdhary, R. R., & Chattopadhyay, M. K. (2022). Internet of things and data mining: An application oriented survey. Journal of king saud university-computer and information sciences, 34(6), 3569–3590. https://doi.org/10.1016/j.jksuci.2020.07.002
- [4] Bi, Z., Jin, Y., Maropoulos, P., Zhang, W. J., & Wang, L. (2023). Internet of things (IoT) and big data analytics (BDA) for digital manufacturing (DM). International journal of production research, 61(12), 4004–4021. http://dx.doi.org/10.1080/00207543.2021.1953181
- [5] Qi, Q., Xu, Z., & Rani, P. (2023). Big data analytics challenges to implementing the intelligent Industrial Internet of Things (IIoT) systems in sustainable manufacturing operations. Technological forecasting and social change, 190, 122401. http://dx.doi.org/10.1016/j.techfore.2023.122401
- [6] Brohi, S., Marjani, M., Hashem, I., Ramiah Pillai, T., Kaur, S., & Amalathas, S. (2019). A data science methodology for internet-of-things. In Emerging technologies in computing (pp. 178–186). http://dx.doi.org/10.1007/978-3-030-23943-5_13
- [7] Shirvanian, N., Shams, M., & Rahmani, A. M. (2022). Internet of things data management: A systematic literature review, vision, and future trends. International journal of communication systems, 35(14), e5267. https://doi.org/10.1002/dac.5267
- [8] Li, X., Liu, H., Wang, W., Zheng, Y., Lv, H., & Lyu, Z. (2022). Big data analysis of the Internet of Things in the digital twins of smart city based on deep learning. Future generation computer systems, 128(10), 167–177. http://dx.doi.org/10.1016/j.future.2021.10.006
- [9] Ali, A., Hussain, T., Tantashutikun, N., Hussain, N., & Cocetta, G. (2023). Application of smart techniques, internet of things and data mining for resource use efficient and sustainable crop production. Agriculture, 13(2), 1–22. https://doi.org/10.3390/agriculture13020397
- [10] Zhang, H., & Yuan, S. (2023). How and when does big data analytics capability boost innovation performance? Sustainability, 15(5), 1–19. https://doi.org/10.3390/su15054036
- [11] Abughazala, M., & Muccini, H. (2023). Modeling data analytics architecture for data-driven IoT applications using DAT. IEEE. http://dx.doi.org/10.1109/ICSA-C57050.2023.00066
- [12] Yang, H., Zhou, L., Cai, J., Shi, C., Yang, Y., Zhao, X., … & Yin, X. (2022). Data mining techniques on astronomical spectra data. II : Classification analysis, 518(4), 5904–5928. https://doi.org/10.1093/mnras/stac3292
- [13] Alrehaili, G., Galam, N., Alawad, R., & Albraheem, L. (2023). Cloud-Based big data analytics on IoT applications. IEEE. http://dx.doi.org/10.1109/ITIKD56332.2023.10100150
- [14] Finogeev, A. G., Parygin, D. S., & Finogeev, A. A. (2017). The convergence computing model for big sensor data mining and knowledge discovery. Human-centric computing and information sciences, 7(1), 1–11. https://doi.org/10.1186/s13673-017-0092-7