Extraction of chemical-protein interactions from the literature using neural networks and narrow instance representation
- PMID: 31622463
- PMCID: PMC6796919
- DOI: 10.1093/database/baz095
Extraction of chemical-protein interactions from the literature using neural networks and narrow instance representation
Abstract
The scientific literature contains large amounts of information on genes, proteins, chemicals and their interactions. Extraction and integration of this information in curated knowledge bases help researchers support their experimental results, leading to new hypotheses and discoveries. This is especially relevant for precision medicine, which aims to understand the individual variability across patient groups in order to select the most appropriate treatments. Methods for improved retrieval and automatic relation extraction from biomedical literature are therefore required for collecting structured information from the growing number of published works. In this paper, we follow a deep learning approach for extracting mentions of chemical-protein interactions from biomedical articles, based on various enhancements over our participation in the BioCreative VI CHEMPROT task. A significant aspect of our best method is the use of a simple deep learning model together with a very narrow representation of the relation instances, using only up to 10 words from the shortest dependency path and the respective dependency edges. Bidirectional long short-term memory recurrent networks or convolutional neural networks are used to build the deep learning models. We report the results of several experiments and show that our best model is competitive with more complex sentence representations or network structures, achieving an F1-score of 0.6306 on the test set. The source code of our work, along with detailed statistics, is publicly available.
© The Author(s) 2019. Published by Oxford University Press.
Figures
Similar articles
-
Structure-based protein design with deep learning.Curr Opin Chem Biol. 2021 Dec;65:136-144. doi: 10.1016/j.cbpa.2021.08.004. Epub 2021 Sep 20. Curr Opin Chem Biol. 2021. PMID: 34547592 Free PMC article. Review.
-
Application of deep learning methods in biological networks.Brief Bioinform. 2021 Mar 22;22(2):1902-1917. doi: 10.1093/bib/bbaa043. Brief Bioinform. 2021. PMID: 32363401 Review.
-
Extracting chemical-protein interactions from biomedical literature via granular attention based recurrent neural networks.Comput Methods Programs Biomed. 2019 Jul;176:61-68. doi: 10.1016/j.cmpb.2019.04.020. Epub 2019 Apr 30. Comput Methods Programs Biomed. 2019. PMID: 31200912
-
Integrating shortest dependency path and sentence sequence into a deep learning framework for relation extraction in clinical text.BMC Med Inform Decis Mak. 2019 Jan 31;19(Suppl 1):22. doi: 10.1186/s12911-019-0736-9. BMC Med Inform Decis Mak. 2019. PMID: 30700301 Free PMC article.
-
Potent pairing: ensemble of long short-term memory networks and support vector machine for chemical-protein relation extraction.Database (Oxford). 2018 Jan 1;2018:bay120. doi: 10.1093/database/bay120. Database (Oxford). 2018. PMID: 30576487 Free PMC article.
Cited by
-
Prompt Tuning in Biomedical Relation Extraction.J Healthc Inform Res. 2024 Feb 29;8(2):206-224. doi: 10.1007/s41666-024-00162-9. eCollection 2024 Jun. J Healthc Inform Res. 2024. PMID: 38681754
-
Automated recognition of functional compound-protein relationships in literature.PLoS One. 2020 Mar 3;15(3):e0220925. doi: 10.1371/journal.pone.0220925. eCollection 2020. PLoS One. 2020. PMID: 32126064 Free PMC article.
References
-
- Nunes T., Campos D., Matos S., et al. . BeCAS: biomedical concept recognition services and visualization. Bioinformatics, 29:1915, 2013. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources