Automatic extraction of citations from the text of English-language patents - an example of template mining

Matthew Lawson, Nick Kemp, Michael F. Lynch*, Gobinda G. Chowdhury

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

13 Citations (Scopus)

Abstract

Methods for automatically isolating and extracting bibliographic references from the full texts of patents are described and evaluated; these include citations both to patents and to other bibliographic sources. Patents are unusual as citing documents in that citations occur principally in the text of the abstracts or description parts of the documents, rather than as footnotes or in separate sections. A template mining approach has been developed for this purpose, to relieve patent examiners of the chore of doing this manually. The sub-languages of citations in patents are examined, and the development of templates for the extraction of cita-tions to patents, journal articles, books and other sources in English-language patents described, as well as the evaluation of the degree of success of the approach.

Original languageEnglish
Pages (from-to)423-436
Number of pages14
JournalJournal of Information Science
Volume22
Issue number6
DOIs
Publication statusPublished - 1 Dec 1996

Keywords

  • template mining
  • citations
  • patents

Fingerprint

Dive into the research topics of 'Automatic extraction of citations from the text of English-language patents - an example of template mining'. Together they form a unique fingerprint.

Cite this