ChemProt: a disease chemical biology database

Olivier Taboureau; Sonny Kim Nielsen; Karine Audouze; Nils Weinhold; Daniel Edsgärd; Francisco S. Roque; Irene Kouskoumvekaki; Alina Bora; Ramona Curpan; Thomas Skøt Jensen; Søren Brunak; Tudor I. Oprea

doi:10.1093/nar/gkq906

Nucleic Acids Res. 2011 Jan; 39(Database issue): D367–D372.

Published online 2010 Oct 8. doi: 10.1093/nar/gkq906

PMCID: PMC3013776

PMID: 20935044

ChemProt: a disease chemical biology database

Olivier Taboureau,^1,^* Sonny Kim Nielsen,¹ Karine Audouze,¹ Nils Weinhold,¹ Daniel Edsgärd,¹ Francisco S. Roque,¹ Irene Kouskoumvekaki,¹ Alina Bora,² Ramona Curpan,² Thomas Skøt Jensen,¹ Søren Brunak,¹ and Tudor I. Oprea^1,³

Olivier Taboureau

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Olivier Taboureau

Sonny Kim Nielsen

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Sonny Kim Nielsen

Karine Audouze

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Karine Audouze

Nils Weinhold

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Nils Weinhold

Daniel Edsgärd

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Daniel Edsgärd

Francisco S. Roque

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Francisco S. Roque

Irene Kouskoumvekaki

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Irene Kouskoumvekaki

Alina Bora

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Alina Bora

Ramona Curpan

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Ramona Curpan

Thomas Skøt Jensen

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Thomas Skøt Jensen

Søren Brunak

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Søren Brunak

Tudor I. Oprea

¹Department of Systems Biology, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, DK-2800 Denmark, ²Department of Computational Chemistry, Institute of Chemistry, Romanian Academy, Timisoara 300223, Romania and ³Department of Biochemistry and Molecular Biology, Division of Biocomputing, University of New Mexico School of Medicine, Albuquerque, New Mexico 87131, USA

Find articles by Tudor I. Oprea

Author information Article notes Copyright and License information PMC Disclaimer

Abstract

Systems pharmacology is an emergent area that studies drug action across multiple scales of complexity, from molecular and cellular to tissue and organism levels. There is a critical need to develop network-based approaches to integrate the growing body of chemical biology knowledge with network biology. Here, we report ChemProt, a disease chemical biology database, which is based on a compilation of multiple chemical–protein annotation resources, as well as disease-associated protein–protein interactions (PPIs). We assembled more than 700 000 unique chemicals with biological annotation for 30 578 proteins. We gathered over 2-million chemical–protein interactions, which were integrated in a quality scored human PPI network of 428 429 interactions. The PPI network layer allows for studying disease and tissue specificity through each protein complex. ChemProt can assist in the in silico evaluation of environmental chemicals, natural products and approved drugs, as well as the selection of new compounds based on their activity profile against most known biological targets, including those related to adverse drug events. Results from the disease chemical biology database associate citalopram, an antidepressant, with osteogenesis imperfect and leukemia and bisphenol A, an endocrine disruptor, with certain types of cancer, respectively. The server can be accessed at http://www.cbs.dtu.dk/services/ChemProt/.

INTRODUCTION

The old drug design paradigm, i.e. drugs interact selectively with one or two targets (proteins), resulting in treatment and prevention of disease, is now challenged by several studies that show most drugs interacting with multiple targets (‘polypharmacology’) (1,2). For example, celecoxib, often considered a selective cyclooxygenase-2 non-steroidal anti-inflammatory drug (NSAID), has been documented to be active on at least two additional targets, namely carbonic anhydrase II and 5-lipoxygenase (3). Rosiglitazone, which has been used for the treatment of type II diabetes mellitus, not only stimulates the peroxisome proliferator activated receptor γ, but also blocks interferon gamma-induced chemokine expression in Graves disease or ophthalmopathy (4). Polypharmacology is not always beneficial, as it often causes side effects: Cisapride, which acts as a serotonergic 5-HT4 receptor agonist, as well as astemizole, which blocks histamine H1 receptors (H1Rs), have both been withdrawn from all markets due to the risk of fatal cardiac arrhythmia associated with their blockade of the hERG potassium ion channel, an unanticipated and undesirable ‘anti-target’ associated to QT prolongation and ‘torsades de pointes’ (5). However, ‘target’ and ‘anti-targets’ are dynamic attributes, as exemplified by the case of H1R antagonists and their (in)ability to achieve clinically significant levels in the brain, influenced by the ATP-binding cassette transporter ABCB1 (also known as P-glycoprotein), which effluxes some of these drugs from the brain (6). Acquiring knowledge of the complete pharmacology profile has inspired new strategies to predict and to characterize drug-target associations in order to improve the success rates of current drug discovery paradigms, i.e. increase the efficacy and reduce toxicity and adverse effects (2).

As large-scale chemical bioactivity databases are being assembled, the polypharmacology (i.e. high affinity bioactivity across related targets) and promiscuity (i.e. low affinity across multiple families) of chemicals are expanding the chemical space for druggable targets (7). These studies are often focused on specific protein families, such as G-protein coupled receptors (8), nuclear receptors (9) and kinases (10), but global pharmacology profiles of chemicals are considered as well (1,2). Recent chemoinformatics advances support the development of polypharmacology data mining, e.g. via iPHACE, an integrative web-based tool that enables pharmacological space navigation for small molecule drugs (11) or based on a Similarity Ensemble Approach (SEA) to relate protein pharmacology by ligand chemistry (12). Biological information can also be retrieved for a large set of chemical compounds through PubChem (13), CheBI and ChEMBL (14).

Two conceptual developments support polypharmacology: systems pharmacology, aimed at drug actions in the context of regulatory networks (15); and systems chemical biology (16), which introduces chemical awareness in systems biology. Since proteins rarely operate in isolation inside and outside cells, but rather function in highly interconnected cellular pathways, interactome networks have been developed by data integration. Yildirim et al. (17) combined FDA-approved drugs with a human protein–protein interaction (PPI) network (human interactome) in order to analyze the interrelationships between drug targets and disease–gene products i.e. disease–proteins. Similar work has been based on PubChem bioassays as source of polypharmacology (18). The use of side-effect similarity has been proposed on the assumption that drugs with similar side-effects are likely to interact with similar target proteins (19). Recent advances include a protein–protein association network based on the chemical toxicology of environmental chemicals (20) and a human disease network linking disorders and disease genes to various known phenotypes (21).

Our goal in the present work was to develop a disease chemical biology server, called ChemProt, based on the integration of chemical–protein annotation resources that are now accessible from large repositories, and curated disease-linked PPI data (22). ChemProt is designed to assist the elucidation of drug actions in the context of cellular and disease networks. Further to that, it allows the identification of additional genes that may play major roles in modulating chemical response i.e. to drugs, environmental chemicals and natural products, thus leading to new options in drug discovery and environmental chemical evaluation. Lastly, the ChemProt server could contribute to drug repurposing as well as to the investigation of chemicals related to anti-targets and adverse drug events.

IMPLEMENTATION

Data sources

We first gathered chemical–protein interaction data from different open source databases i.e. ChEMBL (version chembl_05) (14), BindingDB (23), PDSP Ki Database (24), DrugBank (version2.5) (25), PharmGKB (26) and two commercial databases, WOMBAT (version 2009) and WOMBAT-PK (version 2008) (7). Active compounds from the PubChem bioassay (2010) have been collected as well (13). We considered only active compounds from ‘confirmatory’ assays in order to capture high-confidence chemical–protein annotations from PubChem. These databases provide experimental evidence of chemical–protein interactions. Drug-target information was collected from DrugBank and PharmGKB. In addition, we integrated chemical–protein associations from CTD (version 2009) (27) and STITCH (version STITCH 2.0) (28). These last two databases consider the effect or modulation (positive or negative) of a chemical on proteins, other than that defined as binding activity. Examples include gene expression or pathway data, where the deregulation of a gene by a chemical may be not due to a physical interaction between the two entities but a response at a cellular level. Duplicate chemicals from the multiple databases were found by using InChI keys and were merged into a single ChemProt ID. However, the biological information associated to each chemical was conserved for users looking on selective databases. Overall, the final database contains 700 000 distinct molecules annotated for 30 578 proteins.

Descriptors and similarity measurement

The chemical structure of the molecules was encoded using two rather different types of fingerprints. The 166 MACCS keys, encode the presence or absence of predefined substructural or functional groups (29). On the other hand, a more complex 3-point pharmacophore fingerprint (GpiDAPH3) is based on an expansion of the PATTY pharmacophore feature recognition scheme of a 2D structure (30). This scheme assigns one or more pharmacophore feature types to all atoms in a molecule using a predefined list of SMART queries. The list of pharmacophore feature types comprises: hydrogen-bond donor (D), hydrogen-bond acceptor (A), polar (P) and hydrophobic (H). In addition, an extra label (p or pi) is added to each feature if the originating atom or group is sp²-hybridized or planar for other reasons. The GpiDAPH3 pharmacophore feature scheme is expressed in 2D as triplet feature combinations with a graph based inter-atom distance binning scheme. Both fingerprints are implemented in the Molecular Operating Environment (MOE, version 2008.10) (31). The similarity between two molecules is measured using the Tanimoto coefficient (Tc), a method of choice for the computation of fingerprint-based similarity (32). The Tc is defined as the number of bits in common divided by the total number of used bits in both molecules. For any pair of chemicals, Tc assumes values between 0 and 1. A high Tc represents high similarity.

PPI network

The human interactome used is an in-house protein–protein interaction network inferred from experiments in both humans and model organisms (22). Using an elaborate scoring scheme, all interactions have been validated against a gold standard (33). The current interactome contains 428 429 unique protein–proteins interactions derived from source databases such as BIND (34), GRID (35), MINT (36), dip_full (37), HPRD (38), intact (39), mppi (40), MPact (41), Reactome (42) and KEGG (43). Data are transferred between organisms by using the Inparanoid orthology database (44). In total the human interactome comprises 22 997 genes.

Human disease genes and complexes

Based on a previous study (45), disease-associated protein complexes were associated to the chemical–protein annotation by mining OMIM (46) and GeneCards (47), two data resources for genes association to diseases, we collected a list of 2227 unique disease-related proteins and mapped the complexes of genes to disease. Similarly, complexes of genes were mapped to Gene Ontology (GO) terms (48) and tissues by using the expression data from 73 non-disease tissues from the Novartis Research Foundation Gene Expression Database (GNF) (49) and Human Protein Atlas (50). Users of ChemProt can thus retrieve gene complexes that are related to a query chemical and visualize the annotations of each complex.

APPLICATIONS

Chemical–protein interactions

Chemicals can be searched using a common name, SMILES and by drawing the 2D structure, or retrieved through their annotation to a protein. Users can then choose the descriptor space and the Tc threshold to be used for similarity search. Following a successful query, hits grouped by species will be returned, together with computed physico-chemical properties such as Molecular Weight, LogP, the number of hydrogen bond donors and acceptors, the number of rigid bonds and the number of rings, based on the Marvin applet from Chemaxon (51). Hits are provided separately for known annotations, and for prediction of small molecule bioactivity, respectively. The biochemical and pharmacological effects of a chemical, e.g. substrate, inhibitor, agonist or antagonist, are provided if such information is available, together with hyperlinks to UniProt and Ensembl, which lead to more information on protein sequence and function, respectively.

From chemical–protein interactions to complex protein–disease associations

The unique feature of ChemProt is that it offers the user the possibility to get information at a cellular level, by linking chemically-induced biological perturbations to specific tissues and phenotypes.

Proteins that are both affected by a chemical and participate in one or more protein complexes are highlighted in the results table of the ChemProt server. By clicking on the protein, the user is redirected to the ‘Disease complexes’ server and has to choose which complex to visualize. On the ‘Disease complexes’ server, size and illustrations of the protein network are provided. Additionally, enrichment analysis results of the proteins in the complex are shown, with respect to disease association (OMIM, BioAlma), GO terms (biological process, cellular component) and tissue specificity (Human Protein Atlas, GNF). To ensure that the complexes were biologically relevant entities, the enrichment of the biological terms (OMIM, GO,…) was compared to randomly generated complexes (1.0e6). The significances were calculated using a hyper-geometric test and the P-value for the most significant enriched term for each of the data types was calculated as previously described (45). The table presenting the OMIM enrichment results is interactively linked with an illustration of the protein complex where proteins associated with the selected disease are colored yellow.

Output of the chemical–proteins interactions and disease complexes can be downloaded from the ChemProt website. In addition, the ‘Reflect’ service provides further information on chemicals and genes (52). ‘Reflect’ tags gene, protein and small molecule names in text and offers the opportunity to quickly view additional information on the ChemProt results, including synonyms, protein sequences, domains, 3D structures and subcellular location.

EXAMPLES

With the integration of several databases, ChemProt not only provides pharmacological information, but also includes biological data associated to environmental chemicals and natural products. As seen in the examples below, ChemProt can be queried for drugs as well as environmental chemicals. A search for citalopram, an antidepressant, illustrates the complementarity of the integrated databases within ChemProt (Figure 1). Marketed as a selective serotonin reuptake inhibitor (SSRI) (DrugBank), this drug displays bioactivity on seven human proteins (ChEMBL). Via ChemProt, four other proteins (DRD3, 5HT1B, 5HT3, ADRA2A) are retrieved from the Ki database. Additional information on drug-target associations is provided by STITCH and CTD. From the first annotation to the D4 dopamine receptor (DRD4), the disease term (under Disease Complexes) is highlighted, indicating that protein–protein interaction information for this protein is available. Using the link to the Disease Complexes server, one finds that DRD4 interacts with three proteins (SRC, GRB2 and NCK1). According to OMIM, this protein network is associated to osteogenesis imperfecta and leukemia and, according to BioAlma, to several psychotic disorders. GO enrichment indicates significant association of the protein complex to signal complex formation and vesicle membrane. Furthermore, tissue annotation suggests that this complex is mainly expressed in follicle and non-follicle cells (HPA) and dentritic cells (GNF). Although it might be surprising to see a connection between antidepressant and leukemia, it has been shown recently that antidepressants such as chlomipramine and fluoxetine reduce the growth of B-cell malignancies in leukemia (53).

An external file that holds a picture, illustration, etc.
Object name is gkq906f1.jpg

Open in a separate window

Figure 1.

Chemical–protein annotation and disease associations retrieved from ChemProt for the compound citalopram. (1) The compound can be queried using different formats (name, SMILES and structure). (2) A query results in a table showing protein annotations and bioactivity predictions for the compound. (3) Finally, a protein–protein interaction network (protein–complex) for a target protein can be depicted and disease associations (OMIM and BioAlma) and other biological components (GO terms, HPA and mRNA expression) are displayed.

The second query, ‘bisphenol A’ (BPA), is an environmental pollutant used as plasticizer (54). BPA has biological activity on the estrogen receptor α (ESR1), the androgen receptor (AR) and the estrogen related receptor gamma (ERR3). However, several other proteins are retrieved from CTD and STITCH based on association data with this chemical. Looking at ESR1 in the Disease Complexes server, a complex of 17 proteins is depicted (complex 265) with significant associations to Li-FRAUMENI syndrome, breast cancer and neoplasms. Enrichment analysis indicates that the complex is found in the nucleus (GO cellular component), involved in the regulation of metabolic processes and transcriptionally regulated by the RNA polymerase II promoter (GO biological process). Furthermore, data from immunohistochemistry studies suggest that the complex is mainly located in the endometrium and the cerebral cortex (HPA). The disease chemical biology network for BPA indicates that, under certain conditions, this chemical may be associated with certain types of cancers.

We have illustrated that ChemProt integrates molecular, cellular and phenotypic data associated to small molecules, which can lead to novel links and suggest new avenues for research. We envisage that the ChemProt server will find applications within a variety of chemogenomics, polypharmacology and systems chemical biology studies. ChemProt will be updated once a year with new compounds, new interactions and more sophisticated descriptors.

FUNDING

EU (DEER); Innovative Medicines Initiative Joint Undertaking (eTOX); Danish Research Council for Technology and Production Sciences; Lundbeck foundation and the Villum Rasmussen Foundation. Funding for open access charge: DEER.

Conflict of interest statement. None declared.

ACKNOWLEDGEMENTS

Sunset Molecular Discovery LLC (www.sunsetmolecular.com) contributed with the WOMBAT databases.

REFERENCES

1. Paolini GV, Shapland RH, van Hoorn WP, Mason JS, Hopkins AL. Global mapping of pharmacological space. Nat. Biothechnol. 2006;24:805–815. [PubMed] [Google Scholar]

2. Keiser MJ, Setola V, Irwin JJ, Laggner C, Abbas AI, Hufeisen SJ, Jensen NH, Kuijer MB, Matos RC, Tran TB, et al. Predicting new molecular targets for known drugs. Nature. 2009;462:175–181. [PMC free article] [PubMed] [Google Scholar]

3. Mestres J, Gregori-Puigjané E, Valverde S, Solé RV. The topology of drug-target interaction networks: implicit dependence on drug properties and target families. Mol. Biosyst. 2009;5:1051–1057. [PubMed] [Google Scholar]

4. Antonelli A, Ferrari SM, Fallahi P, Piaggi S, Paolicchi A, Franceschini SS, Salvi M, Ferrannini E. Metabolism. 2010. Cytokines (interferon-gamma and tumor necrosis factor-alpha)-induced nuclear factor-kappaB activation and chemokine (C-X-C motif) ligand 10 release in Graves disease and ophthalmopathy are modulated by pioglitazone. doi:10.1016/j.metabol.2010.02.002. [PubMed] [Google Scholar]

5. Vaz RJ, Klabunde T. Antitargets: Prediction and prevention of drug side effects. In: Mannhold R, Kubinyi H, Folkers G, editors. Methods and Principles in Medicinal Chemistry. Weinheim: Wiley-VCH; 2008. [Google Scholar]

6. Broccatelli F, Carosati E, Cruciani G, Oprea TI. Transporter-mediated efflux influences CNS side effects: ABCB1, from antitarget to target. Mol. Inf. 2010;29:16–26. [PMC free article] [PubMed] [Google Scholar]

7. Olah M, Rad R, Ostopovici L, Bora A, Hadaruga N, Hadaruga D, Moldovan R, Fulias A, Mracec M, Oprea TI. WOMBAT and WOMBAT-PK: bioactive databases for lead and drug discovery. In: Schreiber SL, Kapoor TM, Wess G, editors. Chemical Biology: From Small Molecules to Systems Biology and Drug Design. New York: Wiley-VCH; 2007. pp. 760–786. [Google Scholar]

8. Weill N, Rognan D. Development and validation of a novel protein-ligand fingerprint to mine chemogenomic space: application to G-protein coupled receptors and their ligands. J. Chem. Inf. Model. 2009;49:1049–1062. [PubMed] [Google Scholar]

9. Mestres J, Martin-Couce L, Grgori-Puigjané E, Cases M, Boyer S. Ligand-based approach to in silico pharmacology: nuclear receptor profiling. J. Chem. Inf. Model. 2006;46:2725–2736. [PubMed] [Google Scholar]

10. Knight ZA, Lin H, Shokat KM. Targeting the cancer kinome through polypharmacology. Nat. Rev. Cancer. 2010;10:130–137. [PMC free article] [PubMed] [Google Scholar]

11. Garcia-Serna R, Ursu O, Oprea TI, Mestres J. iPHACE: integrative navigation in pharmacological space. Bioinformatics. 2010;26:985–986. [PMC free article] [PubMed] [Google Scholar]

12. Keiser MJ, Roth BL, Armbruster BN, Ernsberger P, Irwin JJ, Shoichet BK. Relating protein pharmacology by ligand chemistry. Nat. Biotechnol. 2007;25:197–206. [PubMed] [Google Scholar]

13. Wheeler DL, Barrett T, Benson DA, Bryant SH, Canese K, Chetvernin V, Church DM, DiCuccion M, Edgar R, Federhen S, et al. Databases resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2007;35:D5–D12. [PMC free article] [PubMed] [Google Scholar]

14. de Matos P, Alcántara R, Dekker A, Ennis M, Hastings J, Haug K, Spiteri I, Turner S, Steinbeck C. Chemical entities of biological interest: an update. Nucleic Acids Res. 2010;38:D249–D254. [PMC free article] [PubMed] [Google Scholar]

15. Berger SI, Iyengar R. Network analyses in systems pharmacology. Bioinformatics. 2009;25:2466–2472. [PMC free article] [PubMed] [Google Scholar]

16. Oprea TI, Tropsha A, Faulon JL, Rintoul MD. Systems chemical biology. Nat. Chem. Biol. 2007;3:447–450. [PMC free article] [PubMed] [Google Scholar]

17. Yildirim MA, Goh KI, Cusick ME, Barabási AL, Vidal M. Drug-target network. Nat. Biotechnol. 2007;25:1119–1126. [PubMed] [Google Scholar]

18. Chen B, Wild D, Guha R. PubChem as a source of polypharmacology. J. Chem. Inf. Model. 2009;49:2044–2055. [PubMed] [Google Scholar]

19. Kuhn M, Campillos M, Letunic I, Jensen LJ, Bork P. A side effect resource to capture phenotypic effects of drugs. Mol. Syst. Biol. 2010;6:343. [PMC free article] [PubMed] [Google Scholar]

20. Audouze K, Juncker AS, Roque FJ, Krysiak-Baltyn K, Weinhold N, Taboureau O, Jensen TS, Brunak S. Deciphering diseases and biological targets for environmental chemicals using toxicogenomics networks. PLoS Comput. Biol. 2010;6:e10000788. [PMC free article] [PubMed] [Google Scholar]

21. Goh KI, Cusick ME, Valle D, Childs B, Vidal M, Barabási AL. The human disease network. Proc. Natl Acad. Sci. USA. 2007;104:8685–8690. [PMC free article] [PubMed] [Google Scholar]

22. Lage K, Karlberg EO, Størling ZM, Olason OI, Pedersen AG, Rigina O, Hinsby AM, Tümer Z, Pociot F, Tommerup N, et al. A human phenome-interactome network of protein complexes implicated in genetic disorders. Nat. Biotechnol. 2007;25:309–316. [PubMed] [Google Scholar]

23. Liu T, Lin Y, Wen X, Jorissen RN, Gilson MK. Binding DB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 2007;35:D198–D201. [PMC free article] [PubMed] [Google Scholar]

24. Roth B, Lopez E, Beischel S, Weskaemper RB, Evans JM. Screening the receptorome to discover the molecular targets for plant-derived psychoactive compounds: a novel approach for CNS drug discovery. Pharmacol. Ther. 2004;102:99–110. [PubMed] [Google Scholar]

25. Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J. DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006;34:D668–D672. [PMC free article] [PubMed] [Google Scholar]

26. Hewett M, Oliver DE, Rubin DL, Easton KL, Stuart JM, Altman RB, Klein TE. PharmGKB: the pharmacogenetics knowledge base. Nucleic Acids Res. 2002;30:163–165. [PMC free article] [PubMed] [Google Scholar]

27. Davis AP, Murphy CG, Saraceni-Richards CA, Rosentrein MC, Wiegers TC, Mattingly CJ. Comparative toxicogenomics database: a knowledgebase and discovery tool for chemical-gene-disease networks. Nucleic Acids Res. 2009;37:D786–D792. [PMC free article] [PubMed] [Google Scholar]

28. Kuhn M, Szklarczyk D, Franceschini A, Campillos M, von Mering C, Jensen LJ, Beyer A, Bork P. STITCH 2: an interaction network database for small molecules and proteins. Nucleic Acids Res. 2010;38:D552–D556. [PMC free article] [PubMed] [Google Scholar]

29. Durant JL, Leland BA, Henry DR, Nourse JG. Reoptimization of MDL keys for use in drug discovery. J. Chem. Inf. Comput. Sci. 2002;42:1273–1280. [PubMed] [Google Scholar]

30. Bush BL, Sheridan RP. Patty: a programmable atom typer and language for automatic classification of atoms in molecular databases. J. Chem. Inf. Comput. Sci. 1993;33:756–762. [Google Scholar]

31. MOE (version 2007.09), Chemical Computing Group, Montreal, Canada. [(29 September 2010, date last accessed)]. www.chemcomp.com. [Google Scholar]

32. Willet P. Similarity-based virtual screening using 2D fingerprints. Drug Discov. Today. 2006;11:1046–1053. [PubMed] [Google Scholar]

33. Rual JF, Venkatesan K, Hao T, Dricot A, Hirozane-Kishikawa T, Li N, Berriz GF, Gibbons FD, Dreze M, Ayivi-Guedehoussou N, et al. Towards a proteome-scale map of the human protein-protein interaction network. Nature. 2005;437:1173–1178. [PubMed] [Google Scholar]

34. Bader GD, Betel D, Hogue CW. BIND: the biomolecular interaction network database. Nucleic Acids Res. 2003;31:248–250. [PMC free article] [PubMed] [Google Scholar]

35. Stark C, Breitkreutz BJ, Reguly T, Boucher L, Breitkreutz A, Tyers M. BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 2006;34:D535–D539. [PMC free article] [PubMed] [Google Scholar]

36. Zanzoni A, Montecchi-Palazzi L, Quondam M, Ausiello G, Helmer-Citterich M, Cesareni G. MINT: a molecular interaction database. FEBS Lett. 2002;513:135–140. [PubMed] [Google Scholar]

37. Salwinski L, Miller C, Smith A, Pettit F, Bowie J, Eisenberg D. The database of interacting proteins: 2004 update. Nucleic Acids Res. 2004;32:D449–D451. [PMC free article] [PubMed] [Google Scholar]

38. Mishra G, Suresh M, Kumaran K, Kannabiran N, Suresh S, Bala P, Shivakumar K, Anuradha N, Reddy R, Raghavan TM, et al. Human protein reference database – 2006 update. Nucleic Acids Res. 2006;34:D411–D414. [PMC free article] [PubMed] [Google Scholar]

39. Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, et al. IntAct: an open source molecular interaction database. Nucleic Acids Res. 2004;32:D452–D455. [PMC free article] [PubMed] [Google Scholar]

40. Pagel P, Kovac S, Oesterheld M, Braumer B, Dunger-Kaltenbach I, Frishman G, Montrone C, Mark P, Stumpflen V, Mewes HW, et al. The MIPS mammalian protein-protein interaction database. Bioinformatics. 2005;21:832–834. [PubMed] [Google Scholar]

41. Guldener U, Munsterkotter M, Oesterheld M, Pagel P, Ruepp A, Mewes HW, Stumpflen V. MPact: the MIPS protein interaction resource on yeast. Nucleic Acids Res. 2006;34:D436–D441. [PMC free article] [PubMed] [Google Scholar]

42. Joshi-Tope G, Gillespie M, Vastrik I, D’Eustachio P, Schmidt E, de Bono B, Jassal B, Gopinath GR, Wu GR, Matthews L, et al. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res. 2005;33:D428–D432. [PMC free article] [PubMed] [Google Scholar]

43. Kanehisa M, Goto S, Hattori M, Aoki-Kinoshita KF, Itoh M, Kawashima S, Katayama T, Araki M, Hirakawa M. From genomics to chemical genomics: new developments in KEGG. Nucleic Acids Res. 2006;34:D354–D357. [PMC free article] [PubMed] [Google Scholar]

44. O’Brien KP, Remm M, Sonnhammer EL. Inparanoid: a comprehensive database of eukaryotic orthologs. Nucleic Acids Res. 2005;33:D476–D480. [PMC free article] [PubMed] [Google Scholar]

45. Lage K, Hansen NT, Karlberg EO, Eklund AC, Roque FS, Donahoe PK, Szallasi Z, Jensen TS, Brunak S. A large-scale analysis of tissue-specific pathology and gene expression of human disease genes and complexes. Proc. Natl Acad. Sci. USA. 2008;105:20870–20875. [PMC free article] [PubMed] [Google Scholar]

46. Hamosh A, Scott AF, Amberger JS, Bocchini CA, McKusick VA. Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders. Nucleic Acids Res. 2005;33:D514–D517. [PMC free article] [PubMed] [Google Scholar]

47. Safran M, Solomon I, Shmueli O, Lapidot M, Shen-Orr S, Adat A, Ben-Dor U, Esterman N, Rosen N, Peter I, et al. GeneCards 2002: towards a complete, object-oriented, human gene compendium. Bioinformatics. 2002;18:1542–1543. [PubMed] [Google Scholar]

48. Camon E, Magrane M, Barrell D, Lee V, Dimmer E, Maslen J, Binns D, Harte N, Lopez R, Apweiler R. The gene ontology annotation (GOA) database – sharing knowledge in Uniprot with gene ontology. Nucleic Acids Res. 2004;32:D262–D266. [PMC free article] [PubMed] [Google Scholar]

49. Su AI, Wiltshire T, Batalov S, Lapp H, Ching KA, Block D, Zhang J, Soden R, Hayakawa M, Kreiman G, et al. A gene atlas of the mouse and human protein-encoding transcriptomes. Proc. Natl Acad. Sci. USA. 2004;101:6062–6067. [PMC free article] [PubMed] [Google Scholar]

50. Ponten F, Jirström K, Uhlen M. The human protein atlas – a tool for pathology. J. Pathol. 2008;216:387–393. [PubMed] [Google Scholar]

51. Marvin, version5.3. [(29 September 2010, date last accessed)]. http://www.chemaxon.com/ [Google Scholar]

52. Pafilis E, O’Donoghue SI, Jensen LJ, Horn H, Kuhn M, Brown NP, Schneider R. Reflect: augmented browsing for the life scientist. Nat. Biotechnol. 2009;27:508–510. [PubMed] [Google Scholar]

53. Chamba A, Holder MJ, Jarrett RF, Shield L, Toellner KM, Drayson MT, Barnes NM, Gordon J. SLC6A4 expression and anti-proliferative responses to serotonin transporter ligands fluoxetine in primary B-cell malignancies. Leuk. Res. 2010;34:1103–1106. [PubMed] [Google Scholar]

54. Halden RU. Plastics and health risks. Annu. Rev. Public Health. 2010;31:179–194. [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press