Broad expertise retrieval in sparse data environments

Krisztian Balog, Toine Bogers, Leif Azzopardi, Maarten de Rijke, Antal van den Bosch

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

95 Citations (Scopus)

Abstract

Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.
LanguageEnglish
Title of host publicationSIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Place of PublicationNew York, NY, USA
Pages551-558
Number of pages8
DOIs
Publication statusPublished - 23 Jul 2007

Fingerprint

expertise
university
Intranet
organizational structure
experiment
language
evaluation

Keywords

  • expert finding
  • language models
  • intranet search
  • expertise search

Cite this

Balog, K., Bogers, T., Azzopardi, L., de Rijke, M., & van den Bosch, A. (2007). Broad expertise retrieval in sparse data environments. In SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (pp. 551-558). New York, NY, USA. https://doi.org/10.1145/1277741.1277836
Balog, Krisztian ; Bogers, Toine ; Azzopardi, Leif ; de Rijke, Maarten ; van den Bosch, Antal. / Broad expertise retrieval in sparse data environments. SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA, 2007. pp. 551-558
@inproceedings{ff57f64cf4b746be9c513ef55a8960a6,
title = "Broad expertise retrieval in sparse data environments",
abstract = "Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.",
keywords = "expert finding, language models, intranet search, expertise search",
author = "Krisztian Balog and Toine Bogers and Leif Azzopardi and {de Rijke}, Maarten and {van den Bosch}, Antal",
year = "2007",
month = "7",
day = "23",
doi = "10.1145/1277741.1277836",
language = "English",
isbn = "978-1-59593-597-7",
pages = "551--558",
booktitle = "SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval",

}

Balog, K, Bogers, T, Azzopardi, L, de Rijke, M & van den Bosch, A 2007, Broad expertise retrieval in sparse data environments. in SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA, pp. 551-558. https://doi.org/10.1145/1277741.1277836

Broad expertise retrieval in sparse data environments. / Balog, Krisztian; Bogers, Toine; Azzopardi, Leif; de Rijke, Maarten; van den Bosch, Antal.

SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA, 2007. p. 551-558.

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

TY - GEN

T1 - Broad expertise retrieval in sparse data environments

AU - Balog, Krisztian

AU - Bogers, Toine

AU - Azzopardi, Leif

AU - de Rijke, Maarten

AU - van den Bosch, Antal

PY - 2007/7/23

Y1 - 2007/7/23

N2 - Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.

AB - Expertise retrieval has been largely unexplored on data other than the W3C collection. At the same time, many intranets of universities and other knowledge-intensive organisations offer examples of relatively small but clean multilingual expertise data, covering broad ranges of expertise areas. We first present two main expertise retrieval tasks, along with a set of baseline approaches based on generative language modeling, aimed at finding expertise relations between topics and people. For our experimental evaluation, we introduce (and release) a new test set based on a crawl of a university site. Using this test set, we conduct two series of experiments. The first is aimed at determining the effectiveness of baseline expertise retrieval methods applied to the new test set. The second is aimed at assessing refined models that exploit characteristic features of the new test set, such as the organizational structure of the university, and the hierarchical structure of the topics in the test set. Expertise retrieval models are shown to be robust with respect to environments smaller than the W3C collection, and current techniques appear to be generalizable to other settings.

KW - expert finding

KW - language models

KW - intranet search

KW - expertise search

U2 - 10.1145/1277741.1277836

DO - 10.1145/1277741.1277836

M3 - Conference contribution book

SN - 978-1-59593-597-7

SP - 551

EP - 558

BT - SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval

CY - New York, NY, USA

ER -

Balog K, Bogers T, Azzopardi L, de Rijke M, van den Bosch A. Broad expertise retrieval in sparse data environments. In SIGIR '07 Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New York, NY, USA. 2007. p. 551-558 https://doi.org/10.1145/1277741.1277836