Retrieving information chunks from a repository of documents SIT Collected from heterogeneous sources

Authors

  • Raad Alwan Associat Prof., Dept. of Computer Science, Philadelphia University
  • Baydaa Al-Hamadani Asistant prof., . of Computer Science, Zarqa University

DOI:

https://doi.org/10.24297/ijct.v14i3.1999

Keywords:

Heterogeneous resources, data integrity, query relaxation, XML retrieval

Abstract

XML documents are generated from heterogeneous resources. They may share the same data but in different Schema, which make it difficult to retrieve information from them. In this paper we propose a new technique that first; minimizes the size of the XML documents by reducing the redundancy of the structure part and generate the repository for these documents, and second; relaxes and decomposes the XPath query in two stages to determine the relevant documents and the relevant part within these documents. The results show significant precision and recall comparing with the exact XPath queries.

Downloads

Download data is not yet available.

Downloads

Published

2015-01-03

Issue

Section

Research Articles

How to Cite

Retrieving information chunks from a repository of documents SIT Collected from heterogeneous sources. (2015). INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY, 14(3), 5558-5568. https://doi.org/10.24297/ijct.v14i3.1999

Similar Articles

1-10 of 140

You may also start an advanced similarity search for this article.