Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Dec 22:6:10086.
doi: 10.1038/ncomms10086.

Patterns and functional implications of rare germline variants across 12 cancer types

Affiliations

Patterns and functional implications of rare germline variants across 12 cancer types

Charles Lu et al. Nat Commun. .

Abstract

Large-scale cancer sequencing data enable discovery of rare germline cancer susceptibility variants. Here we systematically analyse 4,034 cases from The Cancer Genome Atlas cancer cases representing 12 cancer types. We find that the frequency of rare germline truncations in 114 cancer-susceptibility-associated genes varies widely, from 4% (acute myeloid leukaemia (AML)) to 19% (ovarian cancer), with a notably high frequency of 11% in stomach cancer. Burden testing identifies 13 cancer genes with significant enrichment of rare truncations, some associated with specific cancers (for example, RAD51C, PALB2 and MSH6 in AML, stomach and endometrial cancers, respectively). Significant, tumour-specific loss of heterozygosity occurs in nine genes (ATM, BAP1, BRCA1/2, BRIP1, FANCM, PALB2 and RAD51C/D). Moreover, our homology-directed repair assay of 68 BRCA1 rare missense variants supports the utility of allelic enrichment analysis for characterizing variants of unknown significance. The scale of this analysis and the somatic-germline integration enable the detection of rare variants that may affect individual susceptibility to tumour development, a critical step toward precision medicine.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Characteristics of the data.
Data are distributed by age, cancer, cohort and carrier frequency. (a) Age of onset by cancer type. Average age varies across cancer types, from 43 years in LGG to 67.7 years in LUSC. Note that LGG, LUAD and STAD show clear bimodal characteristics. (b) Age distributions for discovery, validation and control cohorts. (c) Comparison of cancer gene truncation carrier frequencies across 12 cancer types. The distribution of rare germline truncation variants for 12 cancer types (represented as the per cent of cases in each cancer type with rare germline truncation mutation) in 2 different groups of cancer-associated genes (labelled on top of each bar plot): 114 cancer susceptibility genes from Rahman et al. and 47 genes associated with the DNA repair (Fanconi Anaemia) pathway. There are 15 genes common to both groups. The total number of unique genes from these 2 groups is 131.
Figure 2
Figure 2. Burden analysis reveals distinct set of cancer susceptibility genes across 12 cancer types.
A total of 34 genes-of-interest were identified by burden analysis by comparing the frequencies of rare truncation variants in Caucasian cancer cases (n=3,125) versus their frequencies in the WHI control population (n=1,039). Two oncogenes (ABL2 and BCR) were omitted. (a) Significant genes across Pan-Cancer types. Data were analysed with the total frequency test (TFT) followed by false discovery rate (FDR) ranking. Dark horizontal line indicates the 5% FDR threshold, which is satisfied by five genes, including BRCA1, BRCA2, ATM, BRIP1 and PALB2. Inset shows closer visual resolution. (b) Significant genes for specific cancer types. Each plot shows the top tested genes, by FDR, from the same TFT analysis procedure for all 12 individual cancer types. Eight genes in addition to the five shown in a are significant at the 5% FDR level from cancer-type-specific analysis. (c) Cohort frequencies of genes. Bubble plot shows frequency of rare truncation mutation as a percentage of cases in each cohort (all 4,034 cases included for frequency calculation). The x-axis denotes the test group of a specific cancer type, the Pan-Cancer discovery cohort (4,034) and the validation cohort (1,627). Genes found to be significant at 5% FDR using the Pan-Cancer discovery cohort are labelled in boldface. Rings indicate genes that are significant (TFT, FDR ≤5%) for a particular cohort on the x-axis. (d) Percentage of cases carrying rare truncation in the 34 genes-of-interest across 12 cancer types in the discovery cohort.
Figure 3
Figure 3. Analysis of loss of heterozygosity in rare truncation and missense variants.
(a) Bar plot shows individual truncations from nine genes (FDR shown) with lengths representing ratios of tumour-to-normal variant allele fractions (that is, the fraction of reads containing the variant allele). Statistically significant events, defined as FDR≤5%, are shaded boldly, while non-significant events are muted, with colours corresponding to genes. Cancer source of each truncation is shown underneath, for example, most BRCA1 variants occur in ovarian and breast cancers and all BAP1 variants in KIRC. (b) Bar plot for individual missense variants from four genes having elevated frequencies of such variants that show very significant LOH, that is, at the 1% FDR level. (c) Dot plot shows individual missense variants where abscissa and ordinate are amino acid positions and the ratio of tumour-to-normal variant allele fraction, respectively. Blue and red indicate significant (FDR ≤5%) and non-significant events, respectively, with size of dots proportional to negative log of the FDR. Annotated domains from the PFAM database are aligned with position, while shaded areas indicate ‘hotspot' regions where variants having significant LOH cluster more than the rate explainable by chance. Plots are shown for ATM, BRCA1, BRCA2, FANCA and FANCM.
Figure 4
Figure 4. Molecular interactions between rare germline variants and somatic mutations within and across cancer types.
(a) Heatmap demonstrates the significance of interactions between 34 burden test significant genes and 54 cancer-associated genes (top 30 are shown) with recurrently mutated somatic variants across cancer types. Red–white colour scale and blue–white colour scale depict the negative log of P-value for mutual exclusivity and co-occurrence, respectively. Both are based on the MuSiC permutation test (n=10,000). (b) Abacus plot displays the distribution of significant, mutually exclusive rare germline variants and somatic mutations across all 12 cancer types. Unique combinations of germline and somatic variants contribute to the development of individual cancer types. Bigger dots indicate recurrent genes across cancer types, while smaller dots indicate cancer-type-enriched genes.
Figure 5
Figure 5. Germline variants correlate with somatic mutations and age at diagnosis.
(a) Barplot illustrates the distribution of BRCA1, BRCA2 and ATM somatic and germline mutations across cancer types. (b,c) Panels display genes significantly correlated with somatic mutation frequency and younger age of onset in different cancer types and in Pan-Cancer. The width of the shape indicates the density, and the horizontal line indicates the median. P value is calculated by the Wilcoxon rank-sum test and is indicated by the size of the uppermost circles.
Figure 6
Figure 6. Functional validation of BRCA1 missense and truncation variants.
(a) 68 rare missense and 4 truncation variant sites were tested by HDR assay. All samples were depleted of endogenous BRCA1 by transfection of a siRNA targeting the 3′-untranslated region. Indicated in the legend are the plasmids transfected to test for rescue of BRCA1 activity. ‘pcDNA3' is empty vector and ‘WT' represents wild-type BRCA1 plasmid. The y-axis denotes the HDR activity relative to the wild-type BRCA1 protein. Error bars depict s.d. from the mean. Dots on the x-axis represent LOH status, each dot corresponding to one case. Blue, red, dark grey and light grey denote statistical significance, non-significance, unknown LOH (due to lack of sufficient coverage) and untested, respectively. Variants in different functional domains are indicated with colours as follows: orange, RING domain; green, nuclear localization signal (NLS); blue, DNA-binding region; purple, a SQ/TQ cluster domain (SCD); and red, BRCA1 C-terminal domain (BRCT). All the HDR assays were tested in triplicate. (b) Crystal structure of the BRCA1 RING (left) domain in complex with the BARD1 RING domain (labelled in grey) and BRCT domain (right panel) are displayed, with HDR-defective variants labelled in red and partial HDR-defective variants tagged in orange. Variants in yellow are functional in the HDR assay.

Similar articles

Cited by

References

    1. Rahman N. Realizing the promise of cancer predisposition genes. Nature 505, 302–308 (2014). - PMC - PubMed
    1. Walsh T. et al.. Mutations in 12 genes for inherited ovarian, fallopian tube, and peritoneal carcinoma identified by massively parallel sequencing. Proc. Natl Acad. Sci. USA 108, 18032–18037 (2011). - PMC - PubMed
    1. Kanchi K. L. et al.. Integrated analysis of germline and somatic variants in ovarian cancer. Nat. Commun. 5, 3156 (2014). - PMC - PubMed
    1. Schwartz A. G. Genetic epidemiology of cigarette smoke-induced lung disease. Proc. Am. Thorac. Soc. 9, 22–26 (2012). - PMC - PubMed
    1. Bodmer W. & Tomlinson I. Rare genetic variants and the risk of cancer. Curr. Opin. Genet. Dev. 20, 262–267 (2010). - PubMed

Publication types

-