ISSN: 2524-0714

Please use this identifier to cite or link to this item: http://elar.tsatu.edu.ua/handle/123456789/6426
Title: Use of ontologies for metadata records analysis in big data
Authors: Rogushina, Julia
Gladun, Anatoly
Pryima, Serhii
Прийма, Сергій Миколайович
Прийма, Сергей Николаевич
Keywords: Big Data;metadata;domain ontology;thesaurus;natural language text;homonymy;multimedia data;standard
Issue Date: 2018
Publisher: Institute for Information Recording of the National Academy of Sciences of Ukraine
Series/Report no.: Information Technologies and Security : 18th International Scientific and Practical Conference - ITS 2018 (Ukraine; Kyiv, 27 November 2018);Vol. 2318 (Р. 46-63)
Abstract: Big Data deals with the sets of information (structured, unstructured, or semi structured) so large that traditional ways and approaches (based on business intelligence decisions and database management systems) cannot be applied to them. Big Data is characterized by phenomenal acceleration of data accumulation and its complication. In different contexts Big Data often means both data of large volume and a set of tools and methods for their processing. Big Data sets are accompanied by metadata which contains a large amount of information about the data, including significant descriptive text information whose understanding by machines lead to better results of Big Data processing. Methods of artificial intelligence and intelligent Web-technologies improve the efficiency of all stages of Big Data processing. Most often this integration concerns the use of machine learning that provides the knowledge acquisition from Big Data and ontological analysis that formalizes for domain knowledge for Big Data analysis. In the paper, the authors present a method for analyzing the Big Data metadata which allows selecting those blocks of information among the heterogeneous sources and data repositories that are pertinent for solving the customer task. Much attention is paid to the matching of the text part of the metadata (metadata annotations) with the text describing the task. We suggest to use for these purposes the methods and instruments of natural language analysis and the Big Data ontology which contains knowledge about the specifics of this domain.
URI: http://elar.tsatu.edu.ua/handle/123456789/6426
Appears in Collections:Кафедра Комп'ютерні науки

Files in This Item:
File Description SizeFormat 
2635.pdf680.21 kBAdobe PDFThumbnail
Show full item record ???jsp.display-item.check???

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.