Structure and Content Semantic Similarity Detection of EXtensible Markup Language Documents Using Keys

preview-18
  • Structure and Content Semantic Similarity Detection of EXtensible Markup Language Documents Using Keys Book Detail

  • Author : Waraporn Viyanon
  • Release Date : 2010
  • Publisher :
  • Genre : XML (Document markup language)
  • Pages : 246
  • ISBN 13 :
  • File Size : 5,5 MB

Structure and Content Semantic Similarity Detection of EXtensible Markup Language Documents Using Keys by Waraporn Viyanon PDF Summary

Book Description: "XML (eXtensible Mark-up Language) has become the fundamental standard for efficient data management and exchange. Due to the widespread use of XML for describing and exchanging data on the web, XML-based comparison is central issues in database management and information retrieval. In fact, although many heterogeneous XML sources have similar content, they may be described using different tag names and structures. This work proposes a series of algorithms for detection of structural and content changes among XML data. The first is an algorithm called XDoI (XML Data Integration Based on Content and Structure Similarity Using Keys) that clusters XML documents into subtrees using leaf-node parents as clustering points. This algorithm matches subtrees using the key concept and compares unmatched subtrees for similarities in both content and structure. The experimental results show that this approach finds much more accurate matches with or without the presence of keys in the subtrees. A second algorithm proposed here is called XDI-CSSK (a system for detecting xml similarity in content and structure using relational database); it eliminates unnecessary clustering points using instance statistics and a taxonomic analyzer. As the number of subtrees to be compared is reduced, the overall execution time is reduced dramatically. Semantic similarity plays a crucial role in precise computational similarity measures. A third algorithm, called XML-SIM (structure and content semantic similarity detection using keys) is based on previous work to detect XML semantic similarity based on structure and content. This algorithm is an improvement over XDI-CSSK and XDoI in that it determines content similarity based on semantic structural similarity. In an experimental evaluation, it outperformed previous approaches in terms of both execution time and false positive rates. Information changes periodically; therefore, it is important to be able to detect changes among different versions of an XML document and use that information to identify semantic similarities. Finally, this work introduces an approach to detect XML similarity and thus to join XML document versions using a change detection mechanism. In this approach, subtree keys still play an important role in order to avoid unnecessary subtree comparisons within multiple versions of the same document. Real data sets from bibliographic domains demonstrate the effectiveness of all these algorithms"--Abstract, leaves iv-v.

Disclaimer: www.yourbookbest.com does not own Structure and Content Semantic Similarity Detection of EXtensible Markup Language Documents Using Keys books pdf, neither created or scanned. We just provide the link that is already available on the internet, public domain and in Google Drive. If any way it violates the law or has any issues, then kindly mail us via contact us page to request the removal of the link.

Data on the Web

Data on the Web

File Size : 77,77 MB
Total View : 8080 Views
DOWNLOAD

Data model. Queries. Types. Sysems. A syntax for data. XML.. Query languages. Query languages for XML. Interpretation and advanced features. Typing semistructur

Introduction to Information Retrieval

Introduction to Information Retrieval

File Size : 6,6 MB
Total View : 6636 Views
DOWNLOAD

Class-tested and coherent, this textbook teaches classical and web information retrieval, including web search and the related areas of text classification and