Xml: Study of Controlling Overlap

-

Authors

  • Amit Dhamija Manav Bharti University Author

Keywords:

XML, ranking techniques, retrieval, redundant content, element retrieval, topic-focused components, test collection, INEX

Abstract

The direct application of standard ranking techniques to retrieveindividual elements from a collection of XML documents often produces a resultset in which the top ranks are dominated by a large number of elements takenfrom a small number of highly relevant documents. This paper presents andevaluates an algorithm that re-ranks this result set, with the aim ofminimizing redundant content while preserving the benefits of elementretrieval, including the benefit of identifying topic-focused componentscontained within relevant documents. The test collection developed by theInitiative for the Evaluation of XML Retrieval (INEX) forms the basis for theevaluation.

Downloads

Download data is not yet available.

Downloads

Published

2012-02-01