Haystacks and Hypotheses

DOIResolve DOI: http://doi.org/10.1002/meet.1450400107
AuthorSearch for: ; Search for: ; Search for:
Proceedings titleProceedings of the American Society for Information Science and Technology (ASIST) Annual Conference
ConferenceAmerican Society for Information Science and Technology (ASIST) Annual Conference, October 19-22, 2003., Long Beach, California, USA
Pages5964; # of pages: 6
AbstractThis paper describes the EurekaSeek bibliometric technique for automated linked-literature analysis. The MEDLINE database of biomedical literature is iteratively searched in order to identify research opportunities in the form of conceptual linkages between terms. As a tool for identifying undiscovered public knowledge, EurekaSeek is a variation on the techniques of Swanson and Smalheiser. EurekaSeek uses medical subject headings instead of text analysis in a fully automated search process, thereby eliminating the reliance on expert input during the process of linking literatures. In this paper, the EurekaSeek process is tested by retroactively examining the co-occurrence of terms in the published literature. The hypothesis tested in this paper is whether this tool, had it existed in the past, could have identified conceptual linkages that occurred only later in the literature. In addition, EurekaSeek is compared against a process that considers all potential term-to-term relationships. The list of terms that EurekaSeek produces is a subset of all potential linked literature terms. The experiment shows that EurekaSeek produces a higher percentage of likely hypotheses than when all terms are considered. While the proportion of identified linkages generated is still too small for the process to be a practical aid to research, statistically significant results were achieved. Metaphorically speaking, EurekaSeek identifies a higher proportion of needles per haystack.
Publication date
AffiliationNRC Institute for Information Technology; National Research Council Canada; NRC Central Business Services
Peer reviewedNo
NRC number47117
NPARC number9167860
Export citationExport as RIS
Report a correctionReport a correction
Record identifiere4aa23ff-5af3-46cd-9b70-277f5f4ebb0a
Record created2009-06-29
Record modified2016-05-09
Bookmark and share
  • Share this page with Facebook (Opens in a new window)
  • Share this page with Twitter (Opens in a new window)
  • Share this page with Google+ (Opens in a new window)
  • Share this page with Delicious (Opens in a new window)