Hadoop solution for large data protection
Fìz.-mat. model. ìnf. tehnol. 2021, 33:23-27
DOI:
https://doi.org/10.15407/fmmit2021.33.023Keywords:
Hadoop technology, Data processing, large data, Big Data, data security, information protectionAbstract
Investigated one of large data problems of - providing protection in the process of accumulation and processing. The case of application of Hadoop technology and its latest modification Apache Hadoop 3.3.0 is considered. A solution is proposed with strengthening the protection of processed data, connecting the Apache Knox Gateway, Apache Ranger and Apache Atlas tools. The possibil-ity of using data obtained as a result of the work of local databases, electronic archives, database management systems and individual users is provided. The solution also features the use of a pri-vate cloud and cryptographic algorithms. An example of the implementation of a secure solution to the problem of Intelligent Data Analysis is given on the example of a parallel version of the problem of finding association rules when working with unstructured data of large volumes.
References- The 2020 Data Attack Surface Report, Arcserve, 2 Dec 2020
- Maslova, N., Fedorko, M. (2018). Features of Big Data Protection, January 2018, URL https://www.researchgate.net/publication/328821496 FEATURES_OF_BIG_DATA_PROTECTION DOI: 10.31474/1996-1588-2018-1-26-41-47
- The Apache Software Foundation. Apache Hadoop, 28 July, 2020, URL https://blogs.apache.org/hadoop/entry/announce-apache-hadoop-3-3
DOI https://doi.org/10.1002/9781119281320.ch7 - Polovynka, O., Dmitrieva, O. (2020). Research of the efficiency of the apriori group algorithms on different database sizes. - ScientificWorldJournal Issue No6 Part 1 December 2020.