PROCESSING AND STORAGE
E.A. Mikrin, S.R. Somov Overview of models and methods to ensure information integrity in a distributed data processing systems
MATHEMATICAL MODELLING AND DATA ANALYSIS
NONLINEAR CONTROL SYSTEMS
PATTERN RECOGNITION
INTELLIGENT SYSTEMS
COMPUTING SYSTEMS AND NETWORKS
АPPLICATION
E.A. Mikrin, S.R. Somov Overview of models and methods to ensure information integrity in a distributed data processing systems

Abstract.

This article provides a brief overview of the formal models and methods used to ensure the safety of information in data processing systems of different scale and purpose. The main attention is paid to methods of ensuring the safety of information in distributed systems operating on the basis of computer networks. The main reasons for occurrence of incidents with data are listed and their influence on business of the companies is described. Considered the data replication method that provides not only the security of data in distributed systems, but also good performance of these systems. An example of replicas placement optimization in GRID-networks is given. Considered the use of formal models and methods of information recovery enhancing data integrity in the system. Also, the concept of using the structural and technological reserve for improving the performance of distributed systems is considered.

Keywords:

data integrity, distributed systems, replication, information recovery, structural and technological reserve.

PP. 5-28.

References

1. Kazarin O.V. Bezopasnost' programmnogo obespechenija komp'juternyh sistem [Security of computer systems software]. –  Moscow. : MGUL, 2003. – 212 p.
2. Vosstanovlenie dannyh [Data recovery]. Available at: http://www.datarecovery.ru/» (accessed November 12, 2016).
3. Frolov A., Frolov G. Sohrannost' i vosstanovlenie komp'juternyh dannyh: teorija i praktika [Computer data safety and   recovery: theory and practice] // Byte Russia - 2001, - vol.1
4. Azzam Sleit и др. A Dynamic Object Fragmentation and Replication Algorithm In Distributed Database Systems//American  Journal of Applied Sciences 4 (8): 613-618, 2007
5. Managing the costs of downtime. Continuous real-time data replication & clustering software: the next level of disaster  recovery. Constant Data, Inc. August 2004. p.-8. Available at: http://costkiller.net/tribune/Tribu-PDF/Managing-the-Cost-of-Downtime.pdf (Accessed February 07, 2017).
6. "2001 Cost of Downtime Online Survey," by Contingency Planning Research, a division of Eagle Rock Alliance, West
Orange, N.J. Aug. 2001. Available at: http://www.eaglerockltd.com/ (Accessed April 10, 2017).
7. Acronis Global Disaster recovery Index. Available at: http://www.acronis.com/ru-ru/pr/ (Accessed November 03, 2016).
8. Data loss, poor recovery looms large. IT-Online on Mar 2, 2015. Available at: http://it-online.co.za/2015/03/02/data-loss-poorrecovery-looms-large/ (Accessed November 08, 2016).
9. Kulba V.V., Tsvirkun A.D. Nekotorye zadachi optimal'nogo rezervirovanija informacionnyh massivov [Certain problems of  optimal reservation of information files]. Automation and Remote Control. 1971, vol.6, p. 92-98.
10. Turksen, I.B., Kulba V.V. File Redundancy in Information Systems. Working Paper #76-015, Department of Industrial  Engineering, University of Toronto. 1976.
11. Turksen, I.B., Kulba V.V. Strategies of File Redundancy in Information Systems. Working Paper #78-013, Department of  Industrial Engineering, University of Toronto. 1978.
12. Kulba V.V. Analiz strategij rezervirovanija informacionnyh massivov v ASU [Analysis of strategies for reserving information
arrays in the automated control system]. - Sb. trudov. Vyp. 14. Metody i modeli planirovanija i upravlenija v diskretnyh  proizvodstvennyh sistemah. [Proceedings Iss. 14. Methods and models of planning and control in discrete production systems].  Moscow, Institute of Control Sciences, 1977, p. 20-32.
13. Eswaran K. P. Placement of records in a file and file allocation in a computer. // IFIP Congress. 1974. P. 304–307.
14. Morgan H. L., Levin K. D. Optimal program and data locations in computer networks // Commun. ACM. 1977. Vol. 20, no.  5. P. 315–322.
15. Kulba V.V., Somov S.K., Shelkov A.B. Rezervirovanie dannyh v setjah JeVM. Kazan' . [Data reservation in computer   networks]. Kazan, Kazan University Publishing House, 1987, - 175 p.
16. Ma Moses, Athans Michael. Optimal File Allocation Problems for Distributed Data Bases in Unreliable Computer Networks//
Massachusetts Inst. of Tech Cambridge Lab for Information and Decision Systems. 1982,- 7 p.
17. Suri R. A decentralized approach to optimal file allocation in computer networks // Decision and Control including the  Symposium on Adaptive Processes, 1979 18th IEEE Conference on. Vol.
18. 1979. dec. P. 141–146.18. Mahmoud S.A., Riordon J.S. Optimal Allocation of Resources in Distributed Information  networks. //ACM Transactions on Database Systems, 1976, Vol.l, N.4, p. 66-78.
19. Fisher M.L., Hochbaum D.S. Database location in computer networks. - Journ. ACM, 1980, Vol. 27, N. 4, p. 718- 735.
20. Coffman E.G., Gelenbe E., Platean B. Optimization of the number of copies in a distributed data base. – IEEE Transactions of Software Eng., 1981, Vol. 7, N. 1, p. 78-84.
21. Mikrin E.A., Somov S.K. Optimal'noe operativnoe rezervirovanie informacii v sistemah obrabotki dannyh na baze  vychislitel'nyh setej [The optimal online information backup in the data processing systems based on computer networks  redundancy] // Control Sciences – 2016. - vol.5. pp. 47-56.
22. Mikrin E.A., Somov S.K. Optimizacija rezervirovanija informacii v raspredelennyh sistemah obrabotki dannyh real'nogo  vremeni [Information reservation optimization in real-time distributed data processing systems] //Control Sciences – 2016. –   vol.6. pp. 47-52.
23. Abdalla H. I. A synchronized design technique for efficient data distribution // Computers in Human Behavior. - 2014, vol. 30, pp. 427–435.
24. Mansouri N. Adaptive data replication strategy in cloud computing for performance improvement // Frontiers of Computer Science (print). - 2016, 10(5), p. 925–935
25. Sahoo J., Salahuddin M.A., Glitho R. A Survey on Replica Server Placement Algorithms for Content Delivery Networks. –
2016. IEEE Communications Surveys & Tutorials, p. 30. Available at:
https://arxiv.org/ftp/arxiv/papers/1611/1611.01729.pdf. (Accessed July 28, 2017).
26. Singh A., Kahlon K. S. Non-replicated dynamic data allocation in distributed database system // International Journal of  Computer Science and Network Security, vol. 9, no. 9, 2009.
27. Kulba V.V., Shelkov A.B. Pelihov V.P. Strategii rezervirovanija informacionnyh massivov [Strategies for reserving  information arrays] // Proceedings. Postroenie avtomatizirovannyh sistem obrabotki dannyh [Construction of automated data  processing systems] Issue. 16. / Institute of Control Sciences. - Moscow., - 1978. - pp. 26-42.
28. Feller V. Vvedenie v teoriju verojatnostej i ee prilozhenija. [An introduction to probability theory and its applications]. Vol.1 –Moscow, Publishing house Mir, 1984.-528 p.
29. Connolly T., Begg C. Bazy dannyh. Proektirovanie, realizacija i soprovozhdenie. Teorija i praktika. 3-e izdanie [Data Base Systems. A Practical Approach to Design, Implementation, and Management. 3 Edition]. - Moscow: Publishing house "William".  -2003. — 1440 p.:
30. Raspedelennye Sistemy. Principy i paradigmy [Distributed Systems. Principles and paradigms]. /A. Tanenbaum, M. Steen. – Moscow, 2003. – 877 p.
31. Aleshin I. Udalennaja replikacija dannyh dlja zashhity informacionnyh resurso [Remote replication of data to protect  information resources] // IKS-online. – 2006. – vol. 5. Available at: http://www.iksmedia.ru/articles/27831-Udalennayareplikaciya-dannyx-dlya.html#ixzz4Qq71Qd00 (Accessed June 24, 2017)
32. Apanasevich D.A. 2008. Matematicheskoe i programmnoe obespechenie asinhronnoj replikacii dannyh reljacionnyh SUBD metodom vydelenija ob#ektov [Mathematical and software of asynchronous data replication of relational DBMS by means of  object selection]. PhD Thesis. – Voronezh: VGTU. 20 p.
33. Alireza Souri, Amir masoud Rahmani. A Survey for Replica Placement Techniques in Data Grid Environment// I.J. Modern Education and Computer Science - 2014, vol.5, 46-51
34. Belousov V.E. 2005. Algoritmy replikacii dannyh v raspredelennyh sistemah obrabotki informacii [Algorithms of data  replication in distributed information processing systems]. PhD Thesis. –Penza: PGU, 2005. 28 p.
35. Lebedev A. Udalennaja replikacija: kriterii vybora [Remote Replication: Selection Criteria] //Storage News - 2006, - vol.2,  pp. 6-9.
36. Chernyshev G.A. Obzor podhodov k organizacii fizicheskogo urovnja v SUBD [Overview of approaches to the organization of the physical layer in the DBMS] // Proceedings of SPIIRAS. - St. Petersburg, 2013. Iss. 1(24). - pp. 222 – 275.
37. Ozsu M. T., Valduriez P. Principles of distributed database systems (2nd ed.). Upper Saddle River, NJ, USA : Prentice-Hall, Inc., 1999.- C. 845.
38. Singh. A., Kahlon S.K., Virk R.S. Nonreplicated Static Data Allocation in Distributed Databases Using Biogeography-Based Optimization// Chinese Journal of Engineering, 2014. p. 1-9.
39. Sahoo J., Salahuddin M.A., Glitho R. A Survey on Replica Server Placement Algorithms for Content Delivery Networks. IEEE Communications Surveys & Tutorials. 2016. p. 30.
40. Alireza Souri, Amir masoud Rahmani. A Survey for Replica Placement Techniques in Data Grid Environment. I.J. Modern Education and Computer Science. V.5. 2014. p.46-51.
41. Foster I., Kesselman C., Tuecke S. The Anatomy of the Grid: Enabling Scalable Virtual Organizations. International J.  Supercomputer Applications, 15(3), 2001. Available at: http://www.globus.org/alliance/publications/papers/anatomy.pdf  (Accessed April 25, 2017).
42. I. Foster, C. Kesselman,. The Grid 2: Blueprint for a New Computing Infrastructure, 2 Edition. 2004, Morgan Kaufmann Publishers, p.748.
43. Rahman R. M., Barker K., Alhajj R. Replica Placement Strategies in Data Grid// J Grid Computing, 2008, pp.103-123.
44. Fisher M.L. The Lagrangian relaxation method for solving integer programming roblems//Management Science, 1981, v27,  pp 1-18.
45. Somov S.K. Optimizacija vychislitel'nyh i informacionnyh zadach v GRID setjah [Optimization of computing and information problems in the GRID networks] / Sbornik dokladov Mezhdunarodnoj nauchnoj konferencii «Problema Regional'nogo i municipal'nogo upravlenija» [Collection of reports of the International Scientific Conference "The Problem of Regional and Municipal Management”]. – Moscow.: RGGU, 2009.pp. 192–196.
46. Somov S.K. 1983. Rezervirovanie programmnyh modulej i informacionnyh massivov v setjah JeVM [Reservation program modules and data arrays in computer networks]: PhD Diss. Moscow, ICS RAS. 217 p.
47. Machmoud S., Riordon J.S. Optimal Allocation of Resources in Distributed Information networks. - ACM Transactions on Database Systems, 1976, Vol.l, N.4, p. 66-78.
48. Chu W.W, File Allocation in a Multiple Computer System. IEEE Transactions on Computers, 1969, Vol. C-18, N. 10, p. 885-889.
49. Casey R.G. Allocations of copies of a file in an Information Network. //AFIPS Conference Proceedings, - 1972, Vol. 40. - P. 617-625.
50. Gurin L.S., Dymarskij Ja.S., Merkulov A.D. Zadachi i metody optimal'nogo raspredelenija resursov [Tasks and methods for optimal allocation of resources]. - Moscow: Sov.radio, 1968, - 463 p.
51. Berzin E.A., Optimal'noe raspredelenie resursov i jelementy sinteza system [Optimal resource allocation and elements of system synthesis.]. - Moscow: Sov.radio,, 1974. - 304 p.
52. TADVISER. Rezervnoe kopirovanie i hranenie dannyh [Backup and storage of data]. Available at: http://www.tadviser.ru/index.php (Accessed Jun 29, 2017).
53. A. Goncharov. Metody zashhity dannyh: obzor reshenij [Data protection methods: review of solutions]. - Storage News, 2006, vol. 2 (27), p. 28-31.
54. S. Verchjonov. KROK: rezervnoe kopirovanie na praktike [CROC: backup in practice]. -Storage News. 2013, vol. 2 (54), p. 12-15.
55. Shelkov A.B., Somov S.K., Korobko V.B. Vosstanovitel'noe rezervirovanie programmnyh modulej i informacionnyh massivov v setjah JeVM [Redundant redundancy of software modules and information arrays in computer networks] // Analiz i sintez optimal'nyh modul'nyh SOD [Analysis and synthesis of optimal modular SOD]: Proceedings of the Institute of Control
Sciences. - Moscow: ICS, 1984.
56. Prosvirnin V.N., Koshelev V.A., Somov S.K. Avtomatizacija soprovozhdenija arhiva magnitnyh nositelej [Automation support archive of magnetic media.]. – In the book, "Computer-aided design and engineering." Second All-Union Conference.
Abstract. – Moscow, Institute of Control Sciences, 1983, pp. 110-111.
57. Mamikonov A.G., Kulba V.V., Natkovich B.U., Shelkov A.B. Vosstanovlenie informacii v sistemah obrabotki dannyh  [Information recovery in data processing systems]. Prepr. – Moscow, ICS RAS, 1988.
58. Mikrin E.A., Shelkov A.B., Pavel'ev V.V. Metody vosstanovlenija dannyh v raspredelennyh avtomatizirovannyh sistemah [Methods of data recovery in distributed automated systems] / Scientific publication – М.: Moscow, ICS RAS, 2009. - 68 p.
59. Kulba V.V., Somov S.K. Povyshenie nadezhnosti funkcionirovanija raspredelennyh SOD metodami rezervirovanija i  vosstanovlenija informacii [Increase of reliability of functioning of distributed SOD by methods of reservation and recovery of  information]. Informatizacija i svjaz' [Information and communication] vol.3, 2016, pp.86-94.
60. Mikrin E.A., Somov S.K. Analiz jeffektivnosti strategij vosstanovlenija informacii v raspredelennyh sistemah obrabotki dannyh [Analysis of the effectiveness of information recovery strategies in distributed data processing systems] -  Informacionnye tehnologii i vychislitel'nye sistemy [Information technology and computer systems] – 2016. - vol.3. pp.5-19
61. Shelkov A.B. Vosstanovitel'noe rezervirovanie informacionnyh massivov v ASU [Restorative backup of information arrays in  the automated control system]. // Collection of works of the ICS RAS, Issue 25. – Moscow.: ICS RAS, 1981, - pp. 112-123.
62. Shhaveljov L.V. Operativnaja analiticheskaja obrabotka dannyh: koncepcii i tehnologii [Operative analytical data processing: concepts and technologies]. Available at: http://www.olap.ru/basic/olap_and_ida.asp (Accessed July 06, 2017).
63. Tumanov V. Data Warehouse: s chego nachat'? [Data Warehouse: where to start] //PC Week, (153)29`1998. Available at:
https://www.itweek.ru/infrastructure/article/detail.php?ID=48156 (Accessed July 06, 2017-07-06).
64. Standen J. Data Warehouse vs Data Mart. Available at: http://www.datamartist.com/data-warehouse-vs-data-mart.  (Accessed July 06, 2017).
65. Karsanidze T.V. Strukturno – tehnologicheskoe rezervirovanie dannyh v sistemah, funkcionirujushhih na baze LVS  [structural and technological redundancy of data in systems operating on the basis of LAN]. – In the book. The 4th All-Union  Meeting on Distributed Computing Queuing Systems. Theses of reports. – Moscow, ICS RAS, 1991.
66. Karsanidze T.V. 1992 - Rezervirovanie, vosstanovlenie i registracija informacii v avtomatizirovannyh sistemah upravlenija, funkcionirujushhih na baze lokal'nyh setej JeVM [Reservation, restoration and registration of information in automated control systems operating on the basis of local computer networks]: PhD Diss. Moscow, ICS RAS, 23 p.
67. Mainica E. Optimization algorithms on networks and graphs. - Moscow: Mir, 1981. - 323 p.
 

 

2018 / 02
2018 / 01
2017 / 04
2017 / 03

© ФИЦ ИУ РАН 2008-2018. Создание сайта "РосИнтернет технологии".