Web-based extraction of semantic relation instances for terminology work

  1. (PDF, 400 KB)
  2. Get@NRC: Web-based extraction of semantic relation instances for terminology work (Opens in a new window)
DOIResolve DOI: http://doi.org/10.1075/term.14.1.03hal
AuthorSearch for: ; Search for:
Journal titleTerminology
Pages2044; # of pages: 25
Subjectautomatic extraction of linguistic patterns; UMLS ontology expansion; web-based semantic relation extraction
AbstractThis article describes the implementation and evaluation of WWW2REL, a domain-independent and pattern-based knowledge discovery system which extracts semantic relation instances from text fragments on the WWW so as to assist terminologists updating or expanding existing ontologies. Unlike most comparable systems, WWW2REL is special in that it can be applied to any semantic relation type and operates directly on unannotated and uncategorized WWW text snippets rather than static repositories of academic papers from the target domain. The WWW is used for knowledge pattern (KP) discovery, KP filtering and relation instance discovery. The system is tested with the help of the biomedical UMLS Metathesaurus for four different relation types and is manually evaluated by four domain experts. This system evaluation shows how ranking relation instances by a measure of "knowledge pattern range" and applying two heuristics yields an average performance of 70% to 65% of the maximum possible F-score by top 10 and top 50 instances, respectively. Importantly, results show that much valuable information not present in the UMLS can be found through the proposed method. Finally, the article examines the domain-dependence of different aspects of the pattern-based knowledge discovery approach proposed.
Publication date
AffiliationNational Research Council Canada (NRC-CNRC); NRC Institute for Information Technology
Peer reviewedYes
NRC number50794
NPARC number15084655
Export citationExport as RIS
Report a correctionReport a correction
Record identifierd194bea3-1ac1-4102-a0df-43a9541130b0
Record created2010-06-10
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)