Extracting threshold conceptual structures from web documents

Gabriel Ciobanu, Ross Horne, Cristian Vǎideanu

Research output: Chapter in Book/Report/Conference proceedingConference contribution book

3 Citations (Scopus)

Abstract

In this paper we describe an iterative approach based on formal concept analysis to refine the information retrieval process. Based on weights for ranking documents we define a weighted formal context. We use a Galois connection to introduce a new type of formal concept that allows us to work with specific thresholds for searching words in Web documents. By increasing the threshold, we obtain smaller lattices with more relevant concepts, thus improving the retrieval of more specific items. We use techniques for processing large data sets in parallel, to generate sequences of Galois lattices, overcoming the time complexity of building a lattice for an entire large context.

Original languageEnglish
Title of host publicationGraph-Based Representation and Reasoning
Subtitle of host publication21st International Conference on Conceptual Structures, ICCS 2014, Iaşi, Romania, July 27-30, 2014, Proceedings
EditorsNathalie Hernandez, Robert Jäschke, Madalina Croitoru
Place of PublicationCham, Switzerland
PublisherSpringer
Pages130-144
Number of pages15
ISBN (Electronic)9783319083896
ISBN (Print)9783319083889
DOIs
Publication statusPublished - 17 Jul 2014
Event21st International Conference on Conceptual Structures, ICCS 2014 - Iasi, Romania
Duration: 27 Jul 201430 Jul 2014

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume8577 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Conceptual Structures, ICCS 2014
Country/TerritoryRomania
CityIasi
Period27/07/1430/07/14

Keywords

  • information retreival
  • derivation operator
  • concept lattice
  • formal context
  • formal concept analysis

Fingerprint

Dive into the research topics of 'Extracting threshold conceptual structures from web documents'. Together they form a unique fingerprint.

Cite this