Literature explorer

effective retrieval of scientific documents through nonparametric thematic topic detection

Shaopeng Wu, Youbing Zhao, Farzad Parvinzamir, Nikolaos Th. Ersotelos, Hui Wei, Feng Dong

Research output: Contribution to journalArticle

1 Downloads (Pure)

Abstract

Scientific researchers are facing a rapidly growing volume of literatures nowadays. While these publications offer rich and valuable information, the scale of the datasets makes it difficult for the researchers to manage and search for desired information efficiently. Literature Explorer is a new interactive visual analytics suite that facilitates the access to desired scientific literatures through mining and interactive visualisation. We propose a novel topic mining method that is able to uncover “thematic topics” from a scientific corpus. These thematic topics have an explicit semantic association to the research themes that are commonly used by human researchers in scientific fields, and hence are human interpretable. They also contribute to effective document retrieval. The visual analytics suite consists of a set of visual components that are closely coupled with the underlying thematic topic detection to support interactive document retrieval. The visual components are adequately integrated under the design rationale and goals. Evaluation results are given in both objective measurements and subjective terms through expert assessments. Comparisons are also made against the outcomes from the traditional topic modelling methods.
Original languageEnglish
Number of pages18
JournalThe Visual Computer
Early online date2 Aug 2019
DOIs
Publication statusE-pub ahead of print - 2 Aug 2019

Fingerprint

Visualization
Semantics

Keywords

  • topic explorer
  • data visualisation
  • topic modelling
  • text mining
  • web application
  • scientific documents

Cite this

Wu, Shaopeng ; Zhao, Youbing ; Parvinzamir, Farzad ; Th. Ersotelos, Nikolaos ; Wei, Hui ; Dong, Feng. / Literature explorer : effective retrieval of scientific documents through nonparametric thematic topic detection. In: The Visual Computer . 2019.
@article{75266384a6224e26b02c5470977a4946,
title = "Literature explorer: effective retrieval of scientific documents through nonparametric thematic topic detection",
abstract = "Scientific researchers are facing a rapidly growing volume of literatures nowadays. While these publications offer rich and valuable information, the scale of the datasets makes it difficult for the researchers to manage and search for desired information efficiently. Literature Explorer is a new interactive visual analytics suite that facilitates the access to desired scientific literatures through mining and interactive visualisation. We propose a novel topic mining method that is able to uncover “thematic topics” from a scientific corpus. These thematic topics have an explicit semantic association to the research themes that are commonly used by human researchers in scientific fields, and hence are human interpretable. They also contribute to effective document retrieval. The visual analytics suite consists of a set of visual components that are closely coupled with the underlying thematic topic detection to support interactive document retrieval. The visual components are adequately integrated under the design rationale and goals. Evaluation results are given in both objective measurements and subjective terms through expert assessments. Comparisons are also made against the outcomes from the traditional topic modelling methods.",
keywords = "topic explorer, data visualisation, topic modelling, text mining, web application, scientific documents",
author = "Shaopeng Wu and Youbing Zhao and Farzad Parvinzamir and {Th. Ersotelos}, Nikolaos and Hui Wei and Feng Dong",
year = "2019",
month = "8",
day = "2",
doi = "10.1007/s00371-019-01721-7",
language = "English",

}

Literature explorer : effective retrieval of scientific documents through nonparametric thematic topic detection. / Wu, Shaopeng; Zhao, Youbing; Parvinzamir, Farzad; Th. Ersotelos, Nikolaos; Wei, Hui; Dong, Feng.

In: The Visual Computer , 02.08.2019.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Literature explorer

T2 - effective retrieval of scientific documents through nonparametric thematic topic detection

AU - Wu, Shaopeng

AU - Zhao, Youbing

AU - Parvinzamir, Farzad

AU - Th. Ersotelos, Nikolaos

AU - Wei, Hui

AU - Dong, Feng

PY - 2019/8/2

Y1 - 2019/8/2

N2 - Scientific researchers are facing a rapidly growing volume of literatures nowadays. While these publications offer rich and valuable information, the scale of the datasets makes it difficult for the researchers to manage and search for desired information efficiently. Literature Explorer is a new interactive visual analytics suite that facilitates the access to desired scientific literatures through mining and interactive visualisation. We propose a novel topic mining method that is able to uncover “thematic topics” from a scientific corpus. These thematic topics have an explicit semantic association to the research themes that are commonly used by human researchers in scientific fields, and hence are human interpretable. They also contribute to effective document retrieval. The visual analytics suite consists of a set of visual components that are closely coupled with the underlying thematic topic detection to support interactive document retrieval. The visual components are adequately integrated under the design rationale and goals. Evaluation results are given in both objective measurements and subjective terms through expert assessments. Comparisons are also made against the outcomes from the traditional topic modelling methods.

AB - Scientific researchers are facing a rapidly growing volume of literatures nowadays. While these publications offer rich and valuable information, the scale of the datasets makes it difficult for the researchers to manage and search for desired information efficiently. Literature Explorer is a new interactive visual analytics suite that facilitates the access to desired scientific literatures through mining and interactive visualisation. We propose a novel topic mining method that is able to uncover “thematic topics” from a scientific corpus. These thematic topics have an explicit semantic association to the research themes that are commonly used by human researchers in scientific fields, and hence are human interpretable. They also contribute to effective document retrieval. The visual analytics suite consists of a set of visual components that are closely coupled with the underlying thematic topic detection to support interactive document retrieval. The visual components are adequately integrated under the design rationale and goals. Evaluation results are given in both objective measurements and subjective terms through expert assessments. Comparisons are also made against the outcomes from the traditional topic modelling methods.

KW - topic explorer

KW - data visualisation

KW - topic modelling

KW - text mining

KW - web application

KW - scientific documents

U2 - 10.1007/s00371-019-01721-7

DO - 10.1007/s00371-019-01721-7

M3 - Article

ER -