A Short Survey of Document Structure Similarity Algorithms

preview-18
  • A Short Survey of Document Structure Similarity Algorithms Book Detail

  • Author : D. Buttler
  • Release Date : 2004
  • Publisher :
  • Genre :
  • Pages : 9
  • ISBN 13 :
  • File Size : 32,32 MB

A Short Survey of Document Structure Similarity Algorithms by D. Buttler PDF Summary

Book Description: This paper provides a brief survey of document structural similarity algorithms, including the optimal Tree Edit Distance algorithm and various approximation algorithms. The approximation algorithms include the simple weighted tag similarity algorithm, Fourier transforms of the structure, and a new application of the shingle technique to structural similarity. We show three surprising results. First, the Fourier transform technique proves to be the least accurate of any of approximation algorithms, while also being slowest. Second, optimal Tree Edit Distance algorithms may not be the best technique for clustering pages from different sites. Third, the simplest approximation to structure may be the most effective and efficient mechanism for many applications.

Disclaimer: www.yourbookbest.com does not own A Short Survey of Document Structure Similarity Algorithms books pdf, neither created or scanned. We just provide the link that is already available on the internet, public domain and in Google Drive. If any way it violates the law or has any issues, then kindly mail us via contact us page to request the removal of the link.

File Size : 67,67 MB
Total View : 7048 Views
DOWNLOAD

Similarity Search and Applications

Similarity Search and Applications

File Size : 73,73 MB
Total View : 3858 Views
DOWNLOAD

This book constitutes the refereed proceedings of the 12th International Conference on Similarity Search and Applications, SISAP 2019, held in Newark, NJ, USA,

Database and Expert Systems Applications

Database and Expert Systems Applications

File Size : 11,11 MB
Total View : 1059 Views
DOWNLOAD

This volume constitutes the refereed proceedings of the 18th International Conference on Database and Expert Systems Applications held in September 2007. Papers