Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models

doi:10.1093/bioinformatics/btae240

. 2024 Jun 28;40(Supplement_1):i539-i547.

doi: 10.1093/bioinformatics/btae240.

Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models

Zuolong Zhang¹, Xin He^{1

2}, Dazhi Long³, Gang Luo⁴, Shengbo Chen⁵

Affiliations

¹ School of Software, Henan University, Kaifeng, Henan Province 475000, China.
² Henan International Joint Laboratory of Intelligent Network Theory and Key Technology, Henan University, Kaifeng, Henan Province 475000, China.
³ Department of Urology, Ji'an Third People's Hospital, Ji'an, Jiangxi Province 343000, China.
⁴ School of Mathematics and Computer Science, Nanchang University, Nanchang, Jiangxi Province 330031, China.
⁵ Henan Engineering Research Center of Intelligent Technology and Application, Henan University, Kaifeng, Henan Province 475000, China.

PMID: 38940179
PMCID: PMC11211825
DOI: 10.1093/bioinformatics/btae240

Free PMC article

Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models

Zuolong Zhang et al. Bioinformatics. 2024.

Free PMC article

. 2024 Jun 28;40(Supplement_1):i539-i547.

doi: 10.1093/bioinformatics/btae240.

Authors

Zuolong Zhang¹, Xin He^{1

2}, Dazhi Long³, Gang Luo⁴, Shengbo Chen⁵

Affiliations

¹ School of Software, Henan University, Kaifeng, Henan Province 475000, China.
² Henan International Joint Laboratory of Intelligent Network Theory and Key Technology, Henan University, Kaifeng, Henan Province 475000, China.
³ Department of Urology, Ji'an Third People's Hospital, Ji'an, Jiangxi Province 343000, China.
⁴ School of Mathematics and Computer Science, Nanchang University, Nanchang, Jiangxi Province 330031, China.
⁵ Henan Engineering Research Center of Intelligent Technology and Application, Henan University, Kaifeng, Henan Province 475000, China.

PMID: 38940179
PMCID: PMC11211825
DOI: 10.1093/bioinformatics/btae240

Abstract

Motivation: In drug discovery, it is crucial to assess the drug-target binding affinity (DTA). Although molecular docking is widely used, computational efficiency limits its application in large-scale virtual screening. Deep learning-based methods learn virtual scoring functions from labeled datasets and can quickly predict affinity. However, there are three limitations. First, existing methods only consider the atom-bond graph or one-dimensional sequence representations of compounds, ignoring the information about functional groups (pharmacophores) with specific biological activities. Second, relying on limited labeled datasets fails to learn comprehensive embedding representations of compounds and proteins, resulting in poor generalization performance in complex scenarios. Third, existing feature fusion methods cannot adequately capture contextual interaction information.

Results: Therefore, we propose a novel DTA prediction method named HeteroDTA. Specifically, a multi-view compound feature extraction module is constructed to model the atom-bond graph and pharmacophore graph. The residue concat graph and protein sequence are also utilized to model protein structure and function. Moreover, to enhance the generalization capability and reduce the dependence on task-specific labeled data, pre-trained models are utilized to initialize the atomic features of the compounds and the embedding representations of the protein sequence. A context-aware nonlinear feature fusion method is also proposed to learn interaction patterns between compounds and proteins. Experimental results on public benchmark datasets show that HeteroDTA significantly outperforms existing methods. In addition, HeteroDTA shows excellent generalization performance in cold-start experiments and superiority in the representation learning ability of drug-target pairs. Finally, the effectiveness of HeteroDTA is demonstrated in a real-world drug discovery study.

Availability and implementation: The source code and data are available at https://github.com/daydayupzzl/HeteroDTA.

PubMed Disclaimer

References

1. Heliyon. 2023 Dec 05;10(1):e23172 - PubMed
1. Bioinformatics. 2015 Jun 15;31(12):i221-9 - PubMed
1. Precis Clin Med. 2021 Jan 18;4(1):1-16 - PubMed
1. J Cheminform. 2017 Apr 18;9(1):24 - PubMed
1. Nature. 2023 Apr;616(7958):673-685 - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Silverchair Information Systems

[1] Heliyon. 2023 Dec 05;10(1):e23172 - PubMed

[2] Heliyon. 2023 Dec 05;10(1):e23172 - PubMed

[3] Bioinformatics. 2015 Jun 15;31(12):i221-9 - PubMed

[4] Bioinformatics. 2015 Jun 15;31(12):i221-9 - PubMed

[5] Precis Clin Med. 2021 Jan 18;4(1):1-16 - PubMed

[6] Precis Clin Med. 2021 Jan 18;4(1):1-16 - PubMed

[7] J Cheminform. 2017 Apr 18;9(1):24 - PubMed

[8] J Cheminform. 2017 Apr 18;9(1):24 - PubMed

[9] Nature. 2023 Apr;616(7958):673-685 - PubMed

[10] Nature. 2023 Apr;616(7958):673-685 - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models

Affiliations

Enhancing generalizability and performance in drug-target interaction identification by integrating pharmacophore and pre-trained models

Authors

Affiliations

Abstract

Similar articles

References

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources