Hadoop solution for large data protection

Fìz.-mat. model. ìnf. tehnol. 2021, 33:23-27

Authors

  • Nataliya Maslova Donetsk National Technical University, 2 Shibankova Square, Pokrovsk, Ukraine
  • Olha Polovynka Donetsk National Technical University, 2 Shibankova Square, Pokrovsk, Ukraine

DOI:

https://doi.org/10.15407/fmmit2021.33.023

Keywords:

Hadoop technology, Data processing, large data, Big Data, data security, information protection

Abstract

Investigated one of large data problems of - providing protection in the process of accumulation and processing. The case of application of Hadoop technology and its latest modification Apache Hadoop 3.3.0 is considered. A solution is proposed with strengthening the protection of processed data, connecting the Apache Knox Gateway, Apache Ranger and Apache Atlas tools. The possibil-ity of using data obtained as a result of the work of local databases, electronic archives, database management systems and individual users is provided. The solution also features the use of a pri-vate cloud and cryptographic algorithms. An example of the implementation of a secure solution to the problem of Intelligent Data Analysis is given on the example of a parallel version of the problem of finding association rules when working with unstructured data of large volumes.

References
  1. The 2020 Data Attack Surface Report, Arcserve, 2 Dec 2020
  2. Maslova, N., Fedorko, M. (2018). Features of Big Data Protection, January 2018, URL https://www.researchgate.net/publication/328821496 FEATURES_OF_BIG_DATA_PROTECTION DOI: 10.31474/1996-1588-2018-1-26-41-47
  3. The Apache Software Foundation. Apache Hadoop, 28 July, 2020, URL https://blogs.apache.org/hadoop/entry/announce-apache-hadoop-3-3
    DOI https://doi.org/10.1002/9781119281320.ch7
  4. Polovynka, O., Dmitrieva, O. (2020). Research of the efficiency of the apriori group algorithms on different database sizes. - ScientificWorldJournal Issue No6 Part 1 December 2020.

Published

2021-09-02

How to Cite

Maslova, N., & Polovynka, O. (2021). Hadoop solution for large data protection: Fìz.-mat. model. ìnf. tehnol. 2021, 33:23-27. PHYSICO-MATHEMATICAL MODELLING AND INFORMATIONAL TECHNOLOGIES, (33), 23–27. https://doi.org/10.15407/fmmit2021.33.023