Experiments with document archive size detection

S. Wu, F. Gibb, F. Crestani, F. Sebastiani (Editor)

Research output: Chapter in Book/Report/Conference proceedingChapter

8 Citations (Scopus)
23 Downloads (Pure)


The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive, in case the archive itself does not provide such information. In addition, a method for detecting incremental change of the archive size is also presented, which can be useful for deciding if a resource description has become obsolete and needs to be regenerated. An experimental evaluation of these methods shows that they provide quite acurate information.
Original languageEnglish
Title of host publicationAdvances in information retrieval
Subtitle of host publication25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings
Place of PublicationBerlin, Germany
Number of pages10
ISBN (Print)3540012745
Publication statusPublished - 14 Apr 2003

Publication series

NameLecture notes on computer science


  • information retrieval systems
  • document archive
  • archive size


Dive into the research topics of 'Experiments with document archive size detection'. Together they form a unique fingerprint.

Cite this