DUPLICATE DETECTION IN XML DATA USING BAYESIAN NETWORK AND NETWORK PRUNING STRATEGY

Authors

  • Ms. Trupti A. Patil CSE Dept, ADCET, Ashta, Sangli, Maharashtra
  • Siddheshwar V. Patil IT, Dept BIGCE, Kegaon Solapur, Maharashtra
  • Ms. Swapnali G. Patil IT Dept, ADCET, Ashta Sangli, Maharashtra

Keywords:

Data Duplication,, Bayesian Network (BN),, XML Duplicate (XMLDup)

Abstract

Data Duplication causes excess use of storage, excess time and inconsistency. Duplicate detection will help to make sure that accurate data is displayed by identifying and preventing identical or parallel records. On identifying duplicates in relational data, an extensive work has been done so far. But only minor solutions are focused on duplicate detection in additional complex hierarchical structures, like XML data. Hierarchical data means a set of data items that are related to each other by hierarchical relationships such as XML. In the world of XML, no automatically consistent and clearly defined structures like tables are available.

Downloads

Published

2021-03-27

Issue

Section

Articles