Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/70168
Title: Structural similarity between XML documents and DTDs
Authors: Ng, PKL
Ng, VTY 
Issue Date: 2003
Publisher: Springer
Source: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), 2003, v. 2659, p. 412-421 How to cite?
Journal: Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics) 
Abstract: The use of XML documents in the Internet continues to grow. Need for the analysis of XML documents from heterogeneous sources is arisen, in which documents would conform to different DTDs. In this paper, we propose a measure on the structural similarity among XML documents and DTDs, which is natural to understand and fast to calculate. The measure is defined as a weighted sum of the local measures of document elements with a weighting scheme based on their subtree sizes. While the local measure of an element is defined as its edit distance against its declaration, viewed as regular expression, in the DTD. Based on our definition, an algorithm for edit distance calculation between a string and a regular expression is proposed, which is modified from the algorithm applied in the regular expression matching problem. The advantage of the measure comes with its natural definition and linear complexity.
Description: The International Conference on Computational Science, ICCS 2003, Melbourne, Australia and St. Petersburg, Russia, June 2 - 4, 2003
URI: http://hdl.handle.net/10397/70168
ISBN: 978-3-540-40196-4
978-3-540-44863-1
ISSN: 0302-9743
EISSN: 1611-3349
DOI: 10.1007/3-540-44863-2_41
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.