Harnessing Biomedical Natural Language Processing Tools to Identify Medicinal Plant Knowledge from Historical Texts

Vivekanand Sharma, Wayne Law, Michael J. Balick, Indra Neil Sarkar

Research output: Contribution to journalConference articlepeer-review

Abstract

The growing amount of data describing historical medicinal uses of plants from digitization efforts provides the opportunity to develop systematic approaches for identifying potential plant-based therapies. However, the task of cataloguing plant use information from natural language text is a challenging task for ethnobotanists. To date, there have been only limited adoption of informatics approaches used for supporting the identification of ethnobotanical information associated with medicinal uses. This study explored the feasibility of using biomedical terminologies and natural language processing approaches for extracting relevant plant-associated therapeutic use information from historical biodiversity literature collection available from the Biodiversity Heritage Library. The results from this preliminary study suggest that there is potential utility of informatics methods to identify medicinal plant knowledge from digitized resources as well as highlight opportunities for improvement.

Original languageAmerican English
Pages (from-to)1537-1546
Number of pages10
JournalAMIA Annual Symposium proceedings
Volume2017
StatePublished - 2017

ASJC Scopus Subject Areas

  • General Medicine

Keywords

  • Biodiversity
  • Biological Ontologies
  • Feasibility Studies
  • History, 19th Century
  • History, 20th Century
  • Information Storage and Retrieval
  • Natural Language Processing
  • Phytotherapy/history
  • Plants, Medicinal
  • Textbooks as Topic/history

Cite this