Top-k rertrieval using facility location analysis

Guido Zuccon, Leif Azzopardi, Dell Zhang, Jun Wang

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

28 Citations (Scopus)

Abstract

The top-k retrieval problem aims to find the optimal set of k documents from a number of relevant documents given the user’s query. The key issue is to balance the relevance and diversity of the top-k search results. In this paper, we address this problem using Facility Location Analysis taken from Operations Research, where the locations of facilities are optimally chosen according to some criteria. We show how this analysis technique is a generalization of state-of-the-art retrieval models for diversification (such as the Modern Portfolio Theory for Information Retrieval), which treat the top-k search results like “obnoxious facilities” that should be dispersed as far as possible from each other. However, Facility Location Analysis suggests that the top-k search results could be treated like “desirable facilities” to be placed as close as possible to their customers. This leads to a new top-k retrieval model where the best representatives of the relevant documents are selected. In a series of experiments conducted on two TREC diversity collections, we show that significant improvements can be made over the current state-of-the-art through this alternative treatment of the top-k retrieval problem.
LanguageEnglish
Title of host publicationProceedings of the 34th European Conference on Advances in Information Retrieval
Place of PublicationBerlin, Heidelberg
PublisherSpringer-Verlag
Pages305-316
Number of pages12
ISBN (Print)978-3-642-28996-5
DOIs
Publication statusPublished - 2012
Externally publishedYes

Publication series

NameLecture Notes in Computer Science
PublisherSpringer-Verlag
Volume7224

Fingerprint

portfolio-theory
operations research
information retrieval
diversification
customer
experiment

Keywords

  • information storage
  • database management
  • information retrieval

Cite this

Zuccon, G., Azzopardi, L., Zhang, D., & Wang, J. (2012). Top-k rertrieval using facility location analysis. In Proceedings of the 34th European Conference on Advances in Information Retrieval (pp. 305-316). (Lecture Notes in Computer Science; Vol. 7224). Berlin, Heidelberg: Springer-Verlag. https://doi.org/10.1007/978-3-642-28997-2_26
Zuccon, Guido ; Azzopardi, Leif ; Zhang, Dell ; Wang, Jun. / Top-k rertrieval using facility location analysis. Proceedings of the 34th European Conference on Advances in Information Retrieval. Berlin, Heidelberg : Springer-Verlag, 2012. pp. 305-316 (Lecture Notes in Computer Science).
@inproceedings{e35e7515294740f49806a111f11eff3a,
title = "Top-k rertrieval using facility location analysis",
abstract = "The top-k retrieval problem aims to find the optimal set of k documents from a number of relevant documents given the user’s query. The key issue is to balance the relevance and diversity of the top-k search results. In this paper, we address this problem using Facility Location Analysis taken from Operations Research, where the locations of facilities are optimally chosen according to some criteria. We show how this analysis technique is a generalization of state-of-the-art retrieval models for diversification (such as the Modern Portfolio Theory for Information Retrieval), which treat the top-k search results like “obnoxious facilities” that should be dispersed as far as possible from each other. However, Facility Location Analysis suggests that the top-k search results could be treated like “desirable facilities” to be placed as close as possible to their customers. This leads to a new top-k retrieval model where the best representatives of the relevant documents are selected. In a series of experiments conducted on two TREC diversity collections, we show that significant improvements can be made over the current state-of-the-art through this alternative treatment of the top-k retrieval problem.",
keywords = "information storage, database management, information retrieval",
author = "Guido Zuccon and Leif Azzopardi and Dell Zhang and Jun Wang",
year = "2012",
doi = "10.1007/978-3-642-28997-2_26",
language = "English",
isbn = "978-3-642-28996-5",
series = "Lecture Notes in Computer Science",
publisher = "Springer-Verlag",
pages = "305--316",
booktitle = "Proceedings of the 34th European Conference on Advances in Information Retrieval",

}

Zuccon, G, Azzopardi, L, Zhang, D & Wang, J 2012, Top-k rertrieval using facility location analysis. in Proceedings of the 34th European Conference on Advances in Information Retrieval. Lecture Notes in Computer Science, vol. 7224, Springer-Verlag, Berlin, Heidelberg, pp. 305-316. https://doi.org/10.1007/978-3-642-28997-2_26

Top-k rertrieval using facility location analysis. / Zuccon, Guido; Azzopardi, Leif; Zhang, Dell; Wang, Jun.

Proceedings of the 34th European Conference on Advances in Information Retrieval. Berlin, Heidelberg : Springer-Verlag, 2012. p. 305-316 (Lecture Notes in Computer Science; Vol. 7224).

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

TY - GEN

T1 - Top-k rertrieval using facility location analysis

AU - Zuccon, Guido

AU - Azzopardi, Leif

AU - Zhang, Dell

AU - Wang, Jun

PY - 2012

Y1 - 2012

N2 - The top-k retrieval problem aims to find the optimal set of k documents from a number of relevant documents given the user’s query. The key issue is to balance the relevance and diversity of the top-k search results. In this paper, we address this problem using Facility Location Analysis taken from Operations Research, where the locations of facilities are optimally chosen according to some criteria. We show how this analysis technique is a generalization of state-of-the-art retrieval models for diversification (such as the Modern Portfolio Theory for Information Retrieval), which treat the top-k search results like “obnoxious facilities” that should be dispersed as far as possible from each other. However, Facility Location Analysis suggests that the top-k search results could be treated like “desirable facilities” to be placed as close as possible to their customers. This leads to a new top-k retrieval model where the best representatives of the relevant documents are selected. In a series of experiments conducted on two TREC diversity collections, we show that significant improvements can be made over the current state-of-the-art through this alternative treatment of the top-k retrieval problem.

AB - The top-k retrieval problem aims to find the optimal set of k documents from a number of relevant documents given the user’s query. The key issue is to balance the relevance and diversity of the top-k search results. In this paper, we address this problem using Facility Location Analysis taken from Operations Research, where the locations of facilities are optimally chosen according to some criteria. We show how this analysis technique is a generalization of state-of-the-art retrieval models for diversification (such as the Modern Portfolio Theory for Information Retrieval), which treat the top-k search results like “obnoxious facilities” that should be dispersed as far as possible from each other. However, Facility Location Analysis suggests that the top-k search results could be treated like “desirable facilities” to be placed as close as possible to their customers. This leads to a new top-k retrieval model where the best representatives of the relevant documents are selected. In a series of experiments conducted on two TREC diversity collections, we show that significant improvements can be made over the current state-of-the-art through this alternative treatment of the top-k retrieval problem.

KW - information storage

KW - database management

KW - information retrieval

U2 - 10.1007/978-3-642-28997-2_26

DO - 10.1007/978-3-642-28997-2_26

M3 - Conference contribution book

SN - 978-3-642-28996-5

T3 - Lecture Notes in Computer Science

SP - 305

EP - 316

BT - Proceedings of the 34th European Conference on Advances in Information Retrieval

PB - Springer-Verlag

CY - Berlin, Heidelberg

ER -

Zuccon G, Azzopardi L, Zhang D, Wang J. Top-k rertrieval using facility location analysis. In Proceedings of the 34th European Conference on Advances in Information Retrieval. Berlin, Heidelberg: Springer-Verlag. 2012. p. 305-316. (Lecture Notes in Computer Science). https://doi.org/10.1007/978-3-642-28997-2_26