Adaptive query-based sampling of distributed collections

M. Baillie, L. Azzopardi, F. Crestani

Research output: Contribution to conferencePaper

13 Citations (Scopus)
19 Downloads (Pure)

Abstract

As part of a Distributed Information Retrieval system a de-scription of each remote information resource, archive or repository is usually stored centrally in order to facilitate resource selection. The ac-quisition ofprecise resourcedescriptionsistherefore animportantphase in Distributed Information Retrieval, as the quality of such represen-tations will impact on selection accuracy, and ultimately retrieval per-formance. While Query-Based Sampling is currently used for content discovery of uncooperative resources, the application of this technique is dependent upon heuristic guidelines to determine when a sufficiently accurate representation of each remote resource has been obtained. In this paper we address this shortcoming by using the Predictive Likelihood to provide both an indication of thequality of an acquired resource description estimate, and when a sufficiently good representation of a resource hasbeen obtained during Query-Based Sampling.
Original languageEnglish
Pages316-328
Number of pages12
Publication statusPublished - 2006
Event13th Symposium on String Processing and Information Retrieval (SPIRE 2006) - Glasgow, UK
Duration: 11 Oct 200613 Oct 2006

Conference

Conference13th Symposium on String Processing and Information Retrieval (SPIRE 2006)
CityGlasgow, UK
Period11/10/0613/10/06

Fingerprint

Sampling
Information retrieval systems
Information retrieval

Keywords

  • distributed information retrieval
  • query-based sampling
  • searching

Cite this

Baillie, M., Azzopardi, L., & Crestani, F. (2006). Adaptive query-based sampling of distributed collections. 316-328. Paper presented at 13th Symposium on String Processing and Information Retrieval (SPIRE 2006), Glasgow, UK, .
Baillie, M. ; Azzopardi, L. ; Crestani, F. / Adaptive query-based sampling of distributed collections. Paper presented at 13th Symposium on String Processing and Information Retrieval (SPIRE 2006), Glasgow, UK, .12 p.
@conference{65a55dc980b1447d952a2c86a61f034d,
title = "Adaptive query-based sampling of distributed collections",
abstract = "As part of a Distributed Information Retrieval system a de-scription of each remote information resource, archive or repository is usually stored centrally in order to facilitate resource selection. The ac-quisition ofprecise resourcedescriptionsistherefore animportantphase in Distributed Information Retrieval, as the quality of such represen-tations will impact on selection accuracy, and ultimately retrieval per-formance. While Query-Based Sampling is currently used for content discovery of uncooperative resources, the application of this technique is dependent upon heuristic guidelines to determine when a sufficiently accurate representation of each remote resource has been obtained. In this paper we address this shortcoming by using the Predictive Likelihood to provide both an indication of thequality of an acquired resource description estimate, and when a sufficiently good representation of a resource hasbeen obtained during Query-Based Sampling.",
keywords = "distributed information retrieval, query-based sampling, searching",
author = "M. Baillie and L. Azzopardi and F. Crestani",
year = "2006",
language = "English",
pages = "316--328",
note = "13th Symposium on String Processing and Information Retrieval (SPIRE 2006) ; Conference date: 11-10-2006 Through 13-10-2006",

}

Baillie, M, Azzopardi, L & Crestani, F 2006, 'Adaptive query-based sampling of distributed collections' Paper presented at 13th Symposium on String Processing and Information Retrieval (SPIRE 2006), Glasgow, UK, 11/10/06 - 13/10/06, pp. 316-328.

Adaptive query-based sampling of distributed collections. / Baillie, M.; Azzopardi, L.; Crestani, F.

2006. 316-328 Paper presented at 13th Symposium on String Processing and Information Retrieval (SPIRE 2006), Glasgow, UK, .

Research output: Contribution to conferencePaper

TY - CONF

T1 - Adaptive query-based sampling of distributed collections

AU - Baillie, M.

AU - Azzopardi, L.

AU - Crestani, F.

PY - 2006

Y1 - 2006

N2 - As part of a Distributed Information Retrieval system a de-scription of each remote information resource, archive or repository is usually stored centrally in order to facilitate resource selection. The ac-quisition ofprecise resourcedescriptionsistherefore animportantphase in Distributed Information Retrieval, as the quality of such represen-tations will impact on selection accuracy, and ultimately retrieval per-formance. While Query-Based Sampling is currently used for content discovery of uncooperative resources, the application of this technique is dependent upon heuristic guidelines to determine when a sufficiently accurate representation of each remote resource has been obtained. In this paper we address this shortcoming by using the Predictive Likelihood to provide both an indication of thequality of an acquired resource description estimate, and when a sufficiently good representation of a resource hasbeen obtained during Query-Based Sampling.

AB - As part of a Distributed Information Retrieval system a de-scription of each remote information resource, archive or repository is usually stored centrally in order to facilitate resource selection. The ac-quisition ofprecise resourcedescriptionsistherefore animportantphase in Distributed Information Retrieval, as the quality of such represen-tations will impact on selection accuracy, and ultimately retrieval per-formance. While Query-Based Sampling is currently used for content discovery of uncooperative resources, the application of this technique is dependent upon heuristic guidelines to determine when a sufficiently accurate representation of each remote resource has been obtained. In this paper we address this shortcoming by using the Predictive Likelihood to provide both an indication of thequality of an acquired resource description estimate, and when a sufficiently good representation of a resource hasbeen obtained during Query-Based Sampling.

KW - distributed information retrieval

KW - query-based sampling

KW - searching

M3 - Paper

SP - 316

EP - 328

ER -

Baillie M, Azzopardi L, Crestani F. Adaptive query-based sampling of distributed collections. 2006. Paper presented at 13th Symposium on String Processing and Information Retrieval (SPIRE 2006), Glasgow, UK, .