Topic detection and tracking on heterogeneous information

Long Chen, Huaizhi Zhang, Joemon M. Jose, Haitao Yu, Yashar Moshfeghi, Peter Triantafillou

Research output: Contribution to journalArticle

3 Citations (Scopus)
31 Downloads (Pure)

Abstract

Given the proliferation of social media and the abundance of news feeds, a substantial amount of real-time content is distributed through disparate sources, which makes it increasingly difficult to glean and distill useful information. Although combining heterogeneous sources for topic detection has gained attention from several research communities, most of them fail to consider the interaction among different sources and their intertwined temporal dynamics. To address this concern, we studied the dynamics of topics from heterogeneous sources by exploiting both their individual properties (including temporal features) and their inter-relationships. We first implemented a heterogeneous topic model that enables topic--topic correspondence between the sources by iteratively updating its topic--word distribution. To capture temporal dynamics, the topics are then correlated with a time-dependent function that can characterise its social response and popularity over time. We extensively evaluate the proposed approach and compare to the state-of-the-art techniques on heterogeneous collection. Experimental results demonstrate that our approach can significantly outperform the existing ones.
Original languageEnglish
Number of pages23
JournalJournal of Intelligent Information Systems
Early online date19 Sep 2017
DOIs
Publication statusE-pub ahead of print - 19 Sep 2017

Keywords

  • topic detection
  • heterogeneous sources
  • temporal dynamics
  • social response
  • topic importance

Cite this