By a News Reporter-Staff News Editor at Information Technology Newsweekly -- A new study on Information Technology is now available. According to news reporting originating in Thanh Pho Ho Chi Minh, Vietnam, by VerticalNews journalists, research stated, "With the explosive growth of heterogeneous XML sources, data inconsistency has become a serious problem that leads to ineffective business operations and poor decision-making. To address such inconsistency, XML functional dependencies (XFDs) have been proposed to constrain the data integrity of a source."
The news reporters obtained a quote from the research from Vietnam National University, "Unfortunately, existing approaches to XFDs have insufficiently addressed data inconsistency arising from both semantic and structural inconsistencies inherent in heterogeneous XML data sources. This paper proposes a novel approach, called SCAD, to discover anomalies from a given source, which is essential to address prevalent inconsistencies in XML data. Our contribution is twofold. First, we introduce a new type of path and value-based data constraint, called XML Conditional Structural Dependency (XCSD), whereby (i) the paths in XCSD approximately represent groups of similar paths in sources to express constraints on objects with diverse structures; while (ii) the values bound to particular elements express constraints with conditional semantics. XCSD can capture data inconsistency disregarded by XFDs. Second, our proposed SCAD is used to discover XCSDs from a given source. Our approach exploits the semantics of data structures to detect similar paths from the sources, from which a data summary is constructed as an input for the discovery process. This aims to avoid returning redundant data rules due to structural inconsistencies. During the discovery process, SCAD employs semantics hidden in the data values to discover XCSDs. To evaluate our proposed approach, experiments and case studies were conducted on synthetic datasets which contain structural diversity causing XML data inconsistency."
According to the news reporters, the research concluded: "The experimental results show that SCAD can discover more dependencies and the dependencies found convey more meaningful semantics than those of the existing XFDs."
For more information on this research see: Structured content-aware discovery for improving XML data consistency. Information Sciences, 2013;248():168-190. Information Sciences can be contacted at: Elsevier Science Inc, 360 Park Ave South, New York, NY 10010-1710, USA. (Elsevier - www.elsevier.com; Information Sciences - www.elsevier.com/wps/product/cws_home/505730)
Our news correspondents report that additional information may be obtained by contacting L.T.H. Vo, Vietnam Natl Univ, Int Univ, Sch Comp Sci & Engn, Thanh Pho Ho Chi Minh, Vietnam. Additional authors for this research include J.L. Cao, W. Rahayu and H.Q. Nguyen.
Keywords for this news article include: Asia, Vietnam, Thanh Pho Ho Chi Minh, Information Technology
Our reports deliver fact-based news of research and discoveries from around the world. Copyright 2013, NewsRx LLC