Yoshiro Morimoto1,2 • Naoki Yamamoto2 • Shinji Kanegae2 • Ryosuke Matsuzaka2 • Hiroki Ozawa1,2 • Akira Imamura1
1Child and Adolescent Psychiatry Community Partnership Unit, Nagasaki University Hospital, Nagasaki, Japan; 2Department of Neuropsychiatry, Unit of Translational Medicine, Nagasaki University Graduate School of Biomedical Sciences, Nagasaki, Japan
Abstract: Neuropsychiatric disorders such as autism spectrum disorder (ASD), attention-deficit/hyperactivity disorder (ADHD), schizophrenia, bipolar disorder, and major depressive disorder tend to be classified as distinct entities. However, increasing evidence suggests that there are overlaps among these disorders in terms of their genetic risk factors. For example, chromosomal microdeletions and duplications in 16p11.2 have been reported in individuals with ASD as well as those with schizophrenia and intellectual disability, and common copy number variations have been reported in ASD, schizophrenia, and ADHD. Genome-wide association studies have also revealed common risk variants among ASD, ADHD, bipolar disorder, depression, and schizophrenia. Moreover, next-generation sequencing techniques have revealed overlap in de novo mutations among ASD, schizophrenia, and intellectual disability. Together, these results indicate that there are shared genetic risks across a range of different disorders. The findings from these studies converge on postsynaptic proteins, such as L-type calcium channels and proteins involved in N-methyl-D-aspartate signaling, and indicate that synaptic mechanisms may underlie many different neuropsychiatric disorders, including ASD. Future studies are likely to reveal further shared mechanisms and contribute to the development of therapies for these disorders.
Keywords: autism spectrum disorder; genetic overlap; genomic copy number variants; genome-wide association studies; next-generation sequencing
Author for correspondence: Akira Imamura, Child and Adolescent Psychiatry Community Partnership Unit, Nagasaki University Hospital, Nagasaki, Japan. E-mail: aimamura@nagasaki-u.ac.jp
Doi: https://doi.org/10.36255/exonpublications.autismspectrumdisorders.2021.geneticoverlap
In: Autism Spectrum Disorders. Grabrucker AM (Editor). Exon Publications, Brisbane, Australia. ISBN: 978-0-6450017-8-5; Doi: https://doi.org/10.36255/exonpublications.autismspectrumdisorders.2021
Copyright: The Authors.
License: This open access article is licenced under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) https://creativecommons.org/licenses/by-nc/4.0/
Autism spectrum disorder (ASD) is a neurodevelopmental disorder that is conceptualized as a behavioral syndrome (1). In 1943, Leo Kanner was the first to introduce the term “autism” to define a unique syndrome found in young children in which social and emotional relationships are disrupted (2). Autism is now known as ASD, and is classified as a neurodevelopmental disorder in the Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5) by the American Psychiatric Association (3) and in the International Classification of Diseases, 10th Revision by the World Health Organization (4). These classification systems define mental disorders, including ASD, based on the quality and quantity of symptoms. However, the boundaries between each disorder and normality are not always clear. In addition, many symptoms overlap between each disease category, and a single patient may present with features of more than one disorder. In some cases, “comorbidity” is considered to address this overlap of symptoms and disease categories. For example, it is estimated that between 30 and 80% of children with ASD also have attention-deficit/hyperactivity disorder (ADHD) (5, 6).
In recent years, accumulating evidence supports a biological overlap among many psychiatric disorders, which has fueled investigations into the underlying mechanisms of these disorders. In particular, advances in genomic technology and the resulting studies have provided converging evidence to support the hypothesis that genetic risk factors are shared among psychiatric disorders.
A systematic review of epidemiological studies of ASD reported a median global prevalence estimate of 62 per 10,000 (7). ASD is a multifactorial disorder and genetic factors appear to be particularly important in the development of ASD. Familial and hereditary studies have reported that if a second child in the family is also diagnosed with ASD, the recurrence rate may be 25–30% higher compared to sporadic cases of ASD. Furthermore, a high heritability has been reported in ASD; identical twins have a concordance rate of 70–90% (8, 9). It has also been reported that up to 40% of cases of ASD in children have a genetic cause (e.g., genetic syndromes such as fragile X syndrome, Rett syndrome, tuberous sclerosis, mutations in the phosphatase and tensin homolog (PTEN) gene, or structural chromosomal deletions or duplications that can be detected using chromosomal microarrays) (9, 10). In addition, metabolic disorders caused by mitochondrial DNA abnormalities are relatively common in individuals with ASD (11, 12).
Newer chromosomal single nucleotide polymorphism (SNP) microarrays can identify genetic abnormalities (e.g., microinsertions and microdeletions) that are unable to be identified using high-resolution chromosome methods. Shen et al. (13) performed karyotyping, fragile X DNA testing, and chromosomal microarrays in 933 ASD patients. They identified karyotypic abnormalities in 2.2% of subjects, fragile X abnormalities in 0.5% of subjects, and deletions and duplications in 18.2% of subjects. Deletions or duplications in children with ASD have been reported to involve chromosome regions 1q24.2, 2q37.3, 3p26.2, 4q34.2, 6q24.3, 7q11, 7q35, 13q13.2–q22, 15q11–q13, 15q22, 16p11.2, 17p11.2, 22q11.2, 22q13, and Xp22, and additional ASD-associated cytogenetic disorders have also been identified using new ultra-high-resolution microarray technology (e.g., 15q11.2 breakpoint 1–2 [BP1–BP2] deletions) (14–17). Copy number variation or copy number variability is a concept that refers to regions of the genome with variable copy numbers; it includes genomic structural abnormalities such as deletions, duplications, and inversions (18, 19). CNVs can be identified by microarrays, and early studies using microarrays reported the presence of CNVs in up to 11% of ASD patients (20–22). These early studies also reported a high prevalence of de novo CNVs in sporadic ASD cases. These de novo CNVs were considered highly pathogenic because they were not observed in unaffected family members or controls. However, more recent studies of multiplex families have confirmed that some CNVs are present in unaffected relatives of probands, while others are absent in affected relatives, revealing a discrepancy between genotype and phenotype (21, 22). The CNVs that have been reported as associated with ASD include genes that were previously demonstrated to be associated with ASD. Representative ASD-related genes that were identified in these early reports include oxytocin/neurophysin I prepropeptide (OXT), SH3 and multiple ankyrin repeat domains 3 (SHANK3), and neuroligin 4 (NLGN4) (20–23). To date, more than 40 recurrent CNVs have been reported as consistently associated with ASD (24). In addition, some recently identified recurrent CNVs are reportedly associated with characteristic clinical symptoms in addition to ASD (25, 26).
GWAS have attracted attention as a method that can identify disease-associated genetic loci, especially for multifactorial diseases. Using GWAS, hundreds of ASD risk loci scattered across all human chromosomes have been identified. In a GWAS of 4,300 children with ASD and 6,500 controls of European descent, six SNPs located on chromosome 5 between the cadherin 10 (CDH10) and cadherin 9 (CDH9) genes, which encode neural cell adhesion molecules, were reported to be strongly associated with ASD (27). In recent years, many GWAS of ASD have been conducted, and more than 100 loci have been reported as associated with ASD (28, 29). However, the effect sizes of the common ASD-associated variants identified by these GWAS analyses are smaller than expected, and the composition of ASD-associated genes is highly heterogeneous. Currently, no ASD-associated regions have been consistently replicated across multiple GWAS, although there have been several significant genome-wide findings (29). These results suggest that the previously reported GWAS for ASD may be underpowered. The primary cause of underpowered GWAS is a small sample size; thus, larger sample sizes may allow for the discovery of additional ASD candidate loci (30). In addition, several new challenges have been reported to increase the power of GWAS analysis to detect disease-related genes and pathogenic mechanisms. Narita et al. (31) demonstrated that clustering can be used to identify subgroups that are relatively homogeneous in disease etiology. Furthermore, Xie et al. (32) performed an integrated investigation based on Sherlock analysis, spatiotemporal expression patterns, expression analysis, protein–protein interactions, coexpression, and associations with brain structure, and revealed that comprehensive integrated analysis may be an effective way to identify ASD-susceptible variants. Future GWAS analyses using these new methods may lead to the discovery of novel ASD risk genes.
NGS was put to practical use in the early 2000s and has dramatically increased the speed of genome sequencing. The use of NGS has enabled the performance of whole-genome and whole-exome sequencing, which were not possible using conventional sequencing methods, and has led to the identification of rare genetic variants (variants with minor allele frequencies of less than 1%) with strong effect sizes. Early NGS analyses of ASD reported a significantly higher proportion of de novo single nucleotide mutations in ASD patients compared with both unaffected sibling control subjects and theoretical expectations (33–36). It has also been reported that, despite the very high heterogeneity of loci at which de novo mutations are identified, a large number of risk genes and regions occur in two major biological areas: synaptic structure and function, and chromatin modification (37–40). In addition, studies with large cohorts have identified new biological regions where risk genes for ASD may functionally intersect or converge. Examples include the relative enrichment genes that are expressed in the fetal brain, transcription factors and RNA-binding proteins, and the targets of fragile X mental retardation protein (FMRP) (38, 39, 41, 42). Recent analyses using NGS have implicated three potential functional pathways to ASD. These include genes and pathways for chromatin remodeling, (e.g., chromodomain helicase DNA binding protein 7 [CHD7], methyl CpG binding protein 2 [MECP2], DNA [cytosine-5]-methyltransferase 3A [DNMT3A], and PHD finger protein 2 [PHF2]), Wnt (e.g., CHD8, paired box protein Pax-5 [PAX5], and transcriptional regulator ATRX [ATRX]), other signaling super-pathways (e.g., G protein-coupled receptors [GPCRs], extracellular signal-regulated kinases [ERKs], RET proto-oncogene [RET], and protein kinase B [AKT] pathways) (43), and mitochondrial dysfunction in ASD (44).
In recent years, the relationship and overlap between functional psychiatric disorders (e.g., schizophrenia and bipolar disorder) and neurodevelopmental disorders has been discussed. These disorders are considered to be a continuum of neurodevelopmental disturbances that are induced by genetic and environmental factors (45, 46). This model suggests that multiple psychiatric disorders, including ASD, may form a spectrum with a common biological basis and differing grades of severity. Recent genomic analyses have supported such a model, and have led to the identification of common genetic variants and biological factors across a range of neurodevelopmental disorders.
As noted previously in this chapter, early CNV studies reported striking results regarding the relationship between genotype and phenotype. As the size of analyzed cohorts has increased, duplications and deletions of ASD-related regions have been demonstrated to be independently associated with ASD risk (47, 48). As de novo CNV studies have extended across psychiatric research populations, it has become increasingly difficult to identify ASD-associated CNVs that do not also confer some risk for another psychiatric disorder or phenotype, including schizophrenia, bipolar disorder, ADHD, specific language impairment, and a leftward shift in intelligence quotient among carriers. Sanders et al. (48) screened 69 loci in 1,105 children with unexplained intellectual disability (ID) and identified seven children (0.7%) with 16p11.2 deletion, which has been linked to ASD. In addition, McCarthy et al. (49) reported an association between 16p11.2 microduplication and schizophrenia in two large cohorts. Furthermore, a meta-analysis of multiple psychiatric datasets revealed significant associations between 16p11.2 microduplication and schizophrenia (p = 4.8 × 10−7), bipolar disorder (p = 0.017), and ASD (p = 1.9 × 10−7). In a study of individuals from Iceland and Norway, Gudmundsson et al. (50) reported that eight CNVs that had previously been linked to schizophrenia and ASD were also associated with ADHD (deletions in 2p16.3 [neurexin 1; NRXN1], 15q11.2, 15q13.3 [BP4 and BP4.5–BP5], and 22q11.21, and duplications at 1q21.1 distal, 16p11.2 proximal, 16p13.11, and 22q11.21). This result highlights the pleiotropic effects of neuropsychiatric CNVs and provides evidence to support the concept that ADHD, ASD, and schizophrenia may be related neurodevelopmental disorders rather than distinct entities (50).
Zarrei et al. (51) used microarrays to genotype 2,691 subjects with diagnosed neurodevelopmental disorders (204 with schizophrenia, 1,838 with ASD, 427 with ADHD, and 222 with obsessive-compulsive disorder) and 1,769 family members (mostly parents). Clinically significant CNVs (CNVs in the broad sense) were identified in 284 (10.5%) subjects, including 22 (10.8%) with schizophrenia, 209 (11.4%) with ASD, 40 (9.4%) with ADHD, and 13 (5.6%) with obsessive-compulsive disorder. In addition, these authors searched for genes affected by different CNVs in multiple disorders. Examples of neurodevelopmental disorder-related genes linked across multiple disorders include NRXN1, SEH1 like nucleoporin (SEH1L), low density lipoprotein receptor class A domain containing 4 (LDLRAD4), G protein subunit alpha L (GNAL), G protein subunit gamma 13 (GNG13), makorin ring finger protein 1 (MKRN1), dynactin subunit 2 (DCTN2), kinase non-catalytic C-lobe domain containing 1 (KNDC1), protein-L-isoaspartate O-methyltransferase domain-containing protein 2 (PCMTD2), kinesin family member 5A (KIF5A), synemin (SYNM), and the long non-coding RNAs AK127244 and PTCHD1-AS. These findings indicate that CNVs impacting the same genes may potentially contribute to the etiology of multiple neurodevelopmental disorders (51).
The Cross-Disorder Group of the Psychiatric Genomics Consortium has attempted to identify the specific variants that underlie the genetic effects common to five psychiatric disorders: ASD, ADHD, bipolar disorder, major depression, and schizophrenia. They analyzed SNP data from 33,332 cases and 27,888 controls of European ancestry. Four SNPs exceeded the cutoff value (p < 5 × 10−8) for genome-wide significance: regions on chromosomes 3p21 and 10q24, and SNPs within two L-type calcium voltage-gated channel subunits, alpha 1 C (CACNA1C) and beta 2 (CACNB2) (52). In addition, Grove et al. (53) conducted a genome-wide association meta-analysis of 18,381 ASD cases and 27,969 controls, using a unique Danish population resource to substantially increase the sample size. They leveraged the results of GWAS for three phenotypes (schizophrenia, major depression, and education level) with significant overlap in genetic architecture to identify seven loci that were shared with other traits using a strict significance level (53). Furthermore, Wu et al. (54) conducted a meta-analysis of GWAS for schizophrenia (n = 65,967), bipolar disorder (n = 41,653), ASD (n = 46,350), ADHD (n = 55,374), and depression (n = 688,809). This meta-analysis identified significant enrichment of overlapping genes among different disorders and identified a group of cross-disease genes. Seven genes—gamma-aminobutyric acid type B receptor subunit 1 (GABBR1), glycosyltransferase 8 domain containing 1 (GLT8D1); histones H1.5 (HIST1H1B), H2B type 1-N (HIST1H2BN), and H4 (HIST1H4L); potassium voltage-gated channel subfamily B member 1 (KCNB1); and DCC netrin 1 receptor (DCC)—were commonly associated with four of the five psychiatric disorders. The sortilin related VPS10 domain containing receptor 3 (SORCS3) gene was also highlighted because it was involved in all five conditions in the study (54).
A polygenic risk score for common variants allows us to verify that genetic risk is shared among different disorders. Using polygenic risk scores, it has been reported that alleles overexpressed in patients with schizophrenia are also overexpressed in patients with bipolar disorder and ADHD, but not ASD (55–57). Moreover, González-Peñas et al. (58) reported genetic overlap among five disorders (schizophrenia, major depressive disorder, ADHD, obsessive-compulsive disorder, and anxiety disorders) and Asperger syndrome or other ASD subtypes using polygenic risk scores. They reported that the risk of schizophrenia, ADHD, and major depressive disorder is transmitted from parents to children with Asperger syndrome, but that this does not occur in the other ASD subtypes. Their results support the idea that Asperger syndrome is qualitatively different from the rest of the ASD subtypes, and indicates a genetic overlap between Asperger syndrome and ADHD, major depressive disorder, and schizophrenia (58). Furthermore, polygenic risk scores in a recent GWAS of multiple psychiatric disorders revealed a high correlation between schizophrenia and bipolar disorder, a moderate correlation between major depressive disorder and schizophrenia, major depressive disorder and bipolar disorder, major depressive disorder and ADHD, and a small but significant correlation between schizophrenia and ASD. However, other reports have identified no correlations between other pairs of disorders, such as ASD and ADHD, bipolar disorder and ADHD, and schizophrenia and ADHD. These results are unexpected given that there is considerable phenotypic overlap between these disease pairs, and that previous family, twin, and linkage studies have suggested the presence of common genetic risk factors (59–64).
Fromer et al. (65) reported that de novo mutations in genes encoding proteins with specific functions are prevalent in schizophrenic patients (e.g., glutamatergic postsynaptic proteins comprising activity-regulated cytoskeleton-associated protein [ARC] and N-methyl-D-aspartate receptor [NMDAR] complexes). Moreover, genes with de novo mutations in schizophrenia overlap with those affected by de novo mutations in ASD and ID, but not in controls (65). The most extensive exome sequencing study to date (n = 35,584 total samples, 11,986 with ASD), conducted by Satterstrom et al. (66), reported an overlap of genes affected by de novo mutations in ASD and ID, but not in controls. Furthermore, loss-of-function mutations in schizophrenia were also enriched in a small group of genes (n = 7) with recurrent loss-of-function de novo mutations in ASD (p = 0.0018) and ID (p = 0.019). These mutations clustered in sodium voltage-gated channel alpha subunit 2 (SCN2A) and pogo transposable element derived with ZNF domain (POGZ). These genes have been established as ASD-associated genes. Other genes, such as discs large MAGUK scaffold protein 2 (DLG2) and SHANK1, which have been implicated in loss-of-function gene mutations in other neurodevelopmental disorders, were also identified. Thus, there is a clear overlap among schizophrenia, ASD, and ID (66). The list of rare risk alleles for ASD and other psychiatric disorders remains relatively small; as more rare risk alleles are discovered in each diagnostic category, more overlapping genes may be identified.
Recently, GWAS, CNV, and NGS studies have suggested the existence of a set of biological processes that are common to multiple psychiatric disorders. These include L-type calcium channels, postsynaptic scaffolding proteins involved in N-methyl-D-aspartate (NMDA) signal transduction, proteins that interact with ARC (referred to as the ARC complex) (67), brain-expressed genes that are repressed by FMRP (65, 68), and NRXN1 (an associated CNV locus) (69). However, it is not yet fully understood how the biological pathways that are commonly associated with these multiple psychiatric disorders can cause effects across current diagnostic boundaries. Elucidating the biological processes commonly involved in the risk of ASD and related disorders will allow for a more detailed understanding of the pathomechanisms of each disease, and may lead to the development of novel treatments and biomarkers.
In this chapter, we reviewed recent ASD genetic studies and presented evidence of common genetic factors in multiple psychiatric disorders, including ASD, that spread beyond the diagnostic categories of conventional psychiatry. These findings suggest that some etiological mechanisms that are present in the current diagnostic categories may not be specific to each disorder. Future studies are likely to reveal robust risk alleles and mechanisms that are associated with higher disease risks. However, strict regulations on genetic counseling, confidentiality, and data protection need to be applied to the handling of genetic information of individuals with such high disease risk (70). Further genetic research is expected to reveal new genetic risk factors and etiologies common to multiple psychiatric disorders, and to contribute to the discovery of new therapeutic targets.
Acknowledgment: We thank Bronwen Gardner, PhD, from Edanz Group (https://jp.edanz.com/ac) for editing a draft of this manuscript. This work was supported by JSPS KAKENHI (grant number 20K16650) as well as by the Takeda Science Foundation (to YM).
Conflict of interest: The authors declare no potential conflicts of interest with respect to research, authorship and/or publication of this manuscript.
Copyright and permission statement: The authors confirm that the materials included in this chapter do not violate copyright laws. Where relevant, appropriate permissions have been obtained from the original copyright holder(s), and all original sources have been appropriately acknowledged or referenced.