Experiments with document archive size detection

S. Wu, F. Gibb, F. Crestani, F. Sebastiani (Editor)

Research output: Chapter in Book/Report/Conference proceedingChapter

8 Citations (Scopus)

Abstract

The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive, in case the archive itself does not provide such information. In addition, a method for detecting incremental change of the archive size is also presented, which can be useful for deciding if a resource description has become obsolete and needs to be regenerated. An experimental evaluation of these methods shows that they provide quite acurate information.
LanguageEnglish
Title of host publicationAdvances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings
Place of PublicationBerlin, Germany
PublisherSpringer
Pages294-304
Number of pages10
Volume2633
ISBN (Print)3540012745
Publication statusPublished - Apr 2003

Publication series

NameLecture notes on computer science
PublisherSpringer

Fingerprint

Information retrieval systems
Experiments

Keywords

  • information retrieval systems
  • document archive

Cite this

Wu, S., Gibb, F., Crestani, F., & Sebastiani, F. (Ed.) (2003). Experiments with document archive size detection. In Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings (Vol. 2633, pp. 294-304). (Lecture notes on computer science). Berlin, Germany: Springer.
Wu, S. ; Gibb, F. ; Crestani, F. ; Sebastiani, F. (Editor). / Experiments with document archive size detection. Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings. Vol. 2633 Berlin, Germany : Springer, 2003. pp. 294-304 (Lecture notes on computer science).
@inbook{bd90921daadb4a9b8e28db995054b49a,
title = "Experiments with document archive size detection",
abstract = "The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive, in case the archive itself does not provide such information. In addition, a method for detecting incremental change of the archive size is also presented, which can be useful for deciding if a resource description has become obsolete and needs to be regenerated. An experimental evaluation of these methods shows that they provide quite acurate information.",
keywords = "information retrieval systems, document archive",
author = "S. Wu and F. Gibb and F. Crestani and F. Sebastiani",
year = "2003",
month = "4",
language = "English",
isbn = "3540012745",
volume = "2633",
series = "Lecture notes on computer science",
publisher = "Springer",
pages = "294--304",
booktitle = "Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings",

}

Wu, S, Gibb, F, Crestani, F & Sebastiani, F (ed.) 2003, Experiments with document archive size detection. in Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings. vol. 2633, Lecture notes on computer science, Springer, Berlin, Germany, pp. 294-304.

Experiments with document archive size detection. / Wu, S.; Gibb, F.; Crestani, F.; Sebastiani, F. (Editor).

Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings. Vol. 2633 Berlin, Germany : Springer, 2003. p. 294-304 (Lecture notes on computer science).

Research output: Chapter in Book/Report/Conference proceedingChapter

TY - CHAP

T1 - Experiments with document archive size detection

AU - Wu, S.

AU - Gibb, F.

AU - Crestani, F.

A2 - Sebastiani, F.

PY - 2003/4

Y1 - 2003/4

N2 - The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive, in case the archive itself does not provide such information. In addition, a method for detecting incremental change of the archive size is also presented, which can be useful for deciding if a resource description has become obsolete and needs to be regenerated. An experimental evaluation of these methods shows that they provide quite acurate information.

AB - The size of a document archive is a very important parameter for resource selection in distributed information retrieval systems. In this paper, we present a method for automatically detecting the size (ie the number of documents) of a document archive, in case the archive itself does not provide such information. In addition, a method for detecting incremental change of the archive size is also presented, which can be useful for deciding if a resource description has become obsolete and needs to be regenerated. An experimental evaluation of these methods shows that they provide quite acurate information.

KW - information retrieval systems

KW - document archive

UR - http://www.cis.strath.ac.uk/research/publications/papers/strath_cis_publication_37.pdf

M3 - Chapter

SN - 3540012745

VL - 2633

T3 - Lecture notes on computer science

SP - 294

EP - 304

BT - Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings

PB - Springer

CY - Berlin, Germany

ER -

Wu S, Gibb F, Crestani F, Sebastiani F, (ed.). Experiments with document archive size detection. In Advances in information retrieval : 25th European Conference on IR Research, ECIR 2003, Pisa, Italy, April 14-16, 2003 : proceedings. Vol. 2633. Berlin, Germany: Springer. 2003. p. 294-304. (Lecture notes on computer science).