Abstract
Sprinkled throughout the genome are a million regulatory sequences called transcriptional enhancers that activate gene promoters in the right cells, at the right time. Enhancers endow the brain with its incredible diversity of cell types and also translate neural activity into gene induction. Thanks to rapid advances in genomic technologies, it is now possible to identify thousands of enhancers rapidly, test their transcriptional function en masse, and address their neurobiological functions via genome editing. Enhancers also promise to be a great technological opportunity for neuroscience, offering the potential for cell-type-specific genetic labeling and manipulation without the need for transgenesis. The objective of this review and the accompanying 2015 SfN mini-symposium is to highlight the use of new and emerging genomic technologies to probe enhancer function in the nervous system.
SIGNIFICANCE STATEMENT Transcriptional enhancers turn on genes in the right cells, at the right time. Enhancers are also the genomic sequences that encode the incredible diversity of cell types in the brain and enable the brain to turn genes on in response to new experiences. New technology enables enhancers to be found and manipulated. The study of enhancers promises to inform our understanding of brain development and function. The application of enhancer technology holds promise in accelerating basic neuroscience research and enabling gene therapies to be targeted to specific cell types in the brain.
Introduction
Mammalian genomes contain millions of regulatory sequences called transcriptional enhancers that far outnumber the number of genes and enable the tight control of gene expression in the brain. Enhancers endow the brain with its incredible diversity of cell types, effectively encoding cellular identity in the genome via the control of lineage-specific transcription patterns. They also enable neural activity-regulated gene induction by transducing neuronal activity into genomic responses. Enhancers function by interacting directly with gene promoters to activate gene transcription in the right cells at the right time in response to transcription factor binding (Fig. 1). Whereas promoters are defined biochemically by their ability to encode mRNA transcription start sites (TSSs), enhancers are defined by their ability to activate transcription at promoters (Fig. 2).
Enhancers versus promoters. Enhancers juxtapose with their specific target promoters via chromatin looping. Enhancers and promoters both recruit transcription factors and coactivators such as CBP (CREB-binding protein), as well as RNA polymerase II. Whereas enhancers initiate transcription of unspliced, nonpolyadenylated eRNAs, promoters initiate transcription of spliced, polyadenylated mRNAs or long, noncoding RNAs. eRNA transcription typically occurs bidirectionally. In addition to transcribing mRNAs, promoters also drive upstream antisense transcription of transcripts called uaRNAs, which exhibit properties similar to eRNAs. The differences between enhancers and promoters may arise entirely or in part due to the presence of the 5′ splice signal at the end of the first exon of mRNAs. For simplicity, nucleosomes are depicted as gray circles and DNA as a black line, but in reality, DNA is looped around nucleosomes.
Detecting enhancer activity using reporter assays and massively parallel reporter assays (MPRAs). A, Promoter activity is detected by fusing a test sequence upstream of a reporter sequence (i.e., GFP) and introducing it into cells or embryos. B, Enhancer activity is detected by fusing a test sequence near a minimal promoter and reporter. Alternatively, enhancer activity may be detected with the test sequence in the 3′ untranslated region (UTR) of a reporter gene (the STARR-seq configuration; Arnold et al., 2013; Cotney et al., 2015). For high-throughput analysis of enhancer activity, identifying sequences (such as barcodes, BCs) are placed in the 3′ UTR to identify individual enhancers so that they may be combined for “massively parallel” or multiplex quantification. C, MPRAs rely on libraries of plasmids in which the enhancer activity of each test sequence is quantified through sequencing of the BCs (or for STARR-seq, the enhancer itself). For MPRAs in primary neurons or in vivo, enhancer libraries can be packaged into AAV and transduced.
The substantial number and biological importance of enhancers is not yet matched by their mindshare in the neuroscience community. A search for “enhancer” from SfN abstracts in 2014 yields 55 hits, of which 23 relate directly to transcriptional enhancers (the others relate to cognitive, performance, and flavor enhancers). However, the complexity and function of the brain may owe as much to transcriptional enhancers as it does to synapses in that the identity of each cell type in the nervous system is likely encoded by the activity of a specific subset of enhancers that control its transcriptional potential. Likewise, enhancer dysfunction, perhaps even more so than mutations in protein-coding sequences, is likely the source of most common genetic risk for mental illnesses such as schizophrenia.
Thanks to rapid technological advances in genomic technologies, it is now possible to identify rapidly and inexpensively thousands of enhancers, perform high-throughput testing of their transcriptional function in a single experiment, and address their neurobiological functions via genome editing. Beyond this scientific opportunity to understand the mechanisms of neural genome regulation, enhancers promise to be a technological opportunity in the form of an ever better toolkit for genetic labeling of cell types and manipulation of functional gene expression in the nervous system.
How can I find transcriptional enhancers?
After the initial discovery of enhancers, it was soon realized that they could confer upon a promoter cell-type-specific activation (Banerji et al., 1983; Gillies et al., 1983; Heintzman et al., 2009); indeed, this is now understood to be a primary function of enhancers in the nervous system and beyond (Heintzman et al., 2009; Nord et al., 2015). Although understanding enhancer biology represents a great scientific opportunity for also understanding how the genome encodes cellular identity, the cell-type specificity of enhancers, along with their ability to act on specific target promoters in orientation and distance-independent manner, have made finding them and characterizing their function a challenging problem. Initially, enhancers were studied using laborious reporter assays (Fig. 2) designed with few clues about which sequences might function as enhancers beyond evolutionary sequence conservation (Nobrega et al., 2003; Pennacchio et al., 2006). However, it is now possible to identify putative enhancers rapidly based on their chromatin signatures in specific cell populations of interest.
Putative enhancers active in a cell type of interest can be identified by first purifying selected populations of neurons from the heterogeneous environment of the brain via fluorescence-activated cell sorting (Jiang et al., 2008; Mellén et al., 2012) or affinity purification of genetically labeled nuclei (Mo et al., 2015). Antibody-based nuclear sorting methods offer a particular advantage for species including humans for which genetic methods for tagging nuclei are not an option (Cheung et al., 2010). As at promoters, chromatin at enhancers is depleted of histones and therefore accessible to DNase (i.e., it is DNase hypersensitive). Therefore, genome-wide quantification of DNase (or transposase) accessibility (Thurman et al., 2012; Buenrostro et al., 2013; Yue et al., 2014), which identifies all accessible chromatin, can be used in a single experiment to find the set of all putative regulatory elements. Chromatin signatures further distinguish enhancers from promoters and other classes of regulatory elements. Nucleosomes surrounding enhancers typically exhibit a high level of histone H3 lysine 4 mono-methylation (H3K4me1) with lower H3K4me3. Active promoters show the opposite pattern of H3K4 methylation (high H3K4me3 and little to no H3K4me1). H3K4me1-enriched enhancers can be in a poised state or fully active. The poised enhancers have high levels of H3K27me3, a mark associated with transcriptionally repressed or inactive loci. Conversely, functionally active enhancers are additionally bound by H3K27ac and also transcribe a class of noncoding RNAs termed enhancer RNAs (eRNAs). Collectively, these marks are strong predictors of enhancer activity (Kim et al., 2010; Wu et al., 2014). Two common methods of putative enhancer identification are to perform chromatin immunoprecipitation sequencing (ChIP-Seq) to quantify genome-wide binding of H3K27ac or to perform ATAC-seq (a method similar to DNase-seq in that it identifies nucleosome-free regions) (Buenrostro et al., 2013; Nord et al., 2013; Markenscoff-Papadimitriou et al., 2014; Mo et al., 2015). Putative enhancers predicted by these signatures are often bona fide enhancers, but the true test of function is still either to manipulate the enhancer genetically or (most commonly) to use enhancer reporter assays in which a reporter such as lacZ or GFP expression depends on enhancer activity (Fig. 2). Techniques in which enzymatically dead Cas9 is fused to transcriptional repressors or activators are highly promising for investigating enhancer function. However, interpreting the results of this kind of manipulation remains a challenge. Targeting a repressor to an enhancer that is in physical proximity with a promoter could bring the repressor to the promoter and therefore may not reveal much about the importance of the enhancer beyond identifying the promoters with which it interacts.
How do enhancers make developmental choices?
Chromatin undergoes dynamic changes during brain development and neuronal differentiation (Stadler et al., 2011; Lister et al., 2013). Two recent studies illustrate the power of genome-wide enhancer identification and the dynamics of enhancer function in the context of brain development. First, ChIP-Seq experiments assessing H3K27ac binding across mouse development identified 50–100 k putative brain developmental enhancers (Nord et al., 2013). A majority of these enhancers are predicted to be transiently activated during development based on detection of H3K27ac at a subset of developmental stages (Fig. 3). The enhancers that activated earlier in development have greater DNA sequence conservation across species, consistent with the greater diversity of phenotypes along development. Reporter data for large numbers of enhancers is available in the VISTA database (http://enhancer.lbl.gov) and enhancers from this database have been used for lineage identification and fate mapping in the brain (Pattabiraman et al., 2014). Presumably, these enhancers serve as docking sites for distinct combinations of both cell-type-specific transcription factors and more general transcription factors that are repurposed for the programming of different cell types through a complex hierarchy of protein–protein and protein–DNA interactions (Mazzoni et al., 2013; Andzelm et al., 2015). The vast number of enhancers engaged during brain development is an important new opportunity to understand how the cellular diversity of the brain is encoded in mammalian genomes. In addition, these enhancers are now a resource for developing toolkits for developmental stage-specific markers.
Spatial and temporal specificity of enhancers active in the developing forebrain. Developmental dynamics of the enhancer-associated histone mark H3K27ac at putative enhancers in mouse forebrain analyzed by ChIP-seq across seven stages of brain development. Most sites show temporally restricted H3K27ac marks. Each column of the heat map displays H3K27ac enrichment patterns from a collection of putative enhancers for a given developmental stage, which are arranged from constitutively active enhancers at the top to more developmental-stage-specific enhancers proceeding downward.
A second recent study focused on neural differentiation in the cerebellum, examining chromatin regulation in cerebellar granule neurons (CGNs) in vivo (Frank et al., 2015). Using the identification of DNase-accessible sites as a proxy for all gene regulatory elements, the study identified changes in chromatin regulation that define stages of neuronal maturation. As in the forebrain, chromatin at CGN enhancers was highly dynamic during development, even after exit from the cell cycle. A motif analysis within dynamic DNase-accessible sites identified a binding motif for the Zic family of transcription factors. These data led to experiments showing that Zics play an important role in promoting CGN differentiation, which provides an intriguing complement to their known roles in inhibiting the exit of neuronal precursors from the cell cycle. Finally, this study took advantage of the power of the CRISPR/Cas9 system to investigate the functional importance of the putative enhancer elements identified. Specifically targeting an enzymatically dead Cas9 fused to a transcriptional activator to two putative enhancers near the Grin2c gene was found to be sufficient to significantly and selectively increase Grin2c mRNA expression levels in neurons.
Enhancers are also responsible for making one of the most bizarre and unusual gene expression choices in all of mammalian biology. In each olfactory receptor neuron (ORN), just one allele of one of about a thousand different olfactory receptors is expressed, yet the choice is seemingly random for each ORN. One component of this mechanism involves LSD1-mediated removal of histone methylation (histone H3, lysine 9), which leads to expression of an olfactory receptor (OR) (Lyons et al., 2013). OR expression activates the unfolded protein response, ultimately blocking further LSD1 activity (Dalton et al., 2013). A second component of this mechanism involves a network of interacting enhancers in which numerous enhancers colocalize in an interchromosomal interaction complex with an activated OR (Markenscoff-Papadimitriou et al., 2014). These enhancers bind a transcription factor, BPTF, that is required for OR expression. These results suggest a fascinating and complex structural model in which multiple enhancers are recruited to the activated OR gene, both to activate the OR gene and to sequester the enhancers away from other OR genes (Fig. 4).
A model for essential and redundant functions of an OR enhancer in cis and in trans, respectively. Schematic representations of different states in the olfactory sensory neuron nucleus (left) and corresponding transcriptional outputs (right) are shown. An OR gene (orange box) located proximal to an enhancer (orange circle) is repressed by H3K9me3 (red flag, A). The cis-proximal enhancer may facilitate derepression of the OR chromatin landscape (green flag, B), but is not sufficient for OR transcription. Multiple trans-interacting enhancers (colored circles) aggregate around the transcribed cis-proximal OR (orange box, C). This model of enhancer–promoter configuration can explain why some enhancers are required in cis but are redundant in trans.
How do enhancers respond to neural activity?
One of the most exciting findings to arise from the profiling of chromatin signatures in neurons is the discovery that thousands of enhancers are engaged by neural activity to regulate transcription (Kim et al., 2010, 2015). Neural-activity-regulated promoter proximal enhancers—those located within 2 kb of a gene's TSS—have been studied since Fos was first found to be regulated by activity (Greenberg et al., 1986). Genome-scale identification of non-promoter-binding sites for the canonical activity-regulated transcription factors CREB and MEF2 first hinted at the potential for activity-dependent transcriptional regulation of enhancers (Impey et al., 2004; Flavell, 2006), and the later observation that regions with the chromatin signature of enhancers were preferential sites of activity-dependent recruitment of CBP, NPAS4, and FOS binding made clear the fundamental importance of this mechanism (Kim et al., 2010; Malik et al., 2014). Enhancers are important for activity-dependent transcription for two reasons. Similar to the function of developmental enhancers in cell-type-specific gene expression, activity-regulated enhancers can define the set of genes that can be activity inducible in any given type of neuron, thus allowing different brain regions to have distinct transcriptional plasticities in response to experience (Spiegel et al., 2014; Whitney et al., 2014). In addition neuronal-activity-regulated enhancers directly mediate gene induction. The enhancer-mediated induction of synaptic plasticity genes suggests a function for activity-regulated enhancer activation in memory formation (Telese et al., 2015).
One of the best understood activity-regulated enhancers is situated 7 kb upstream of the activity-regulated gene Arc and is called the synaptic-activity-regulated element (SARE). This enhancer was the first distal neural-activity-regulated enhancer to be identified and it remains the best characterized (Kawashima et al., 2009, 2014; Schaukowitch et al., 2014). Upon neural depolarization, this enhancer comes into closer physical proximity with the Arc promoter, and SARE eRNAs are synthesized. These eRNAs bind the negative regulator of transcription elongation (NELF), thereby releasing RNA polymerase II to transcribe the Arc gene. When the SARE eRNA is knocked down, chromatin looping between the enhancer and promoter is unaffected. However, NELF release from the Arc promoter is impaired, and Arc gene induction is reduced. Interestingly, not all neuronal-activity-dependent genes are similarly regulated by NELF (Saha and Dudek, 2013), and the specific functional relevance of this mechanism for neural plasticity remains to be explored.
How relevant are enhancers to human diseases of the CNS?
The genomics revolution has led to large-scale identification of genomic loci that confer risk for common neuropsychiatric disease (Schizophrenia Working Group of the Psychiatric Genomics Consortium, 2014; Network and Pathway Analysis Subgroup of Psychiatric Genomics Consortium, 2015). The causal sequence variants within these risk loci remain almost entirely unidentified, but most of them reside in noncoding regions of the genome. These variants are highly likely to affect enhancers, given that much of the functional noncoding genome is devoted to enhancers (Hemberg et al., 2012). Some of these variants might disrupt transcription factor binding at enhancers, as appears to be the case for a variant in the MEIS1 enhancer locus that is associated with “restless legs syndrome” (Spieler et al., 2014). Alternatively, noncoding mutations outside of enhancers could disrupt long-range promoter–enhancer interactions, as has been suggested for variants near the FKBP5 gene that are associated with posttraumatic stress disorder (Klengel et al., 2013). Risk for diseases of the CNS is influenced not only by variation within enhancers active in the CNS but also by variation in enhancers active in other tissues. Common variants within blood cell enhancers associate with risk for depression, affect cortisol-dependent transcription in blood cells, and predict inappropriate amygdala reactivity, which is a common psychiatric endophenotype for stress-inducible psychiatric disorders (Arloth et al., 2015). An immediate priority is to identify which genetic variants that correlate with disease also affect enhancer function and gene expression—a daunting task given that enhancers often function only in specific cell types.
In addition to variation in enhancer sequences, disease states may involve misregulation of enhancers in trans. High-throughput sequencing of patient “exomes” (exon-encoding genomes) recently identified mutations in 22 genes underlying autism spectrum disorders (De Rubeis et al., 2014). Surprisingly, a disproportionate number of these genes function to remodel chromatin. Chromatin remodelers (including CHD8, ARID1B, POGZ, and SUV420H1) may alter the histone modifications and positioning of nucleosomes at enhancer sequences (Cotney et al., 2015).
How can we screen thousands of enhancers at a time?
A major challenge for human health will be to determine which of the numerous human genomic sequence variants influence enhancer function. In a big step forward, enhancer reporter assays can now be performed in high-throughput (Fig. 2). High-throughput reporter assays have so far been performed in vitro (Melnikov et al., 2012; Arnold et al., 2013; Shlyueva et al., 2014) and in retinal explants (Kwasnieski et al., 2012). So-called massively parallel reporter assays (MPRAs) are performed much like traditional reporter assays except that a single plasmid with one enhancer is replaced by a library with many enhancers. Instead of measuring GFP fluorescence, high-throughput sequencing is used to quantify cDNA barcodes whose expression reports the activity of a specific enhancer or test sequence. In Drosophila, a related method called STARR-seq has been used to screen the entire genome for enhancer activity (Arnold et al., 2013). These tools promise to make it possible to determine which of the many sequence variants associated with human neuropsychiatric disorders actually influence enhancer activity.
What is the difference between an enhancer and a promoter?
Enhancers and gene promoters have long been considered two separate classes of regulatory sequences. They are now increasingly recognized for their functional and structural similarities and may be differentiated only by the presence of a splicing signal (Almada et al., 2013; Andersson et al., 2014; Core et al., 2014; Andersson et al., 2015). Structurally, both promoters and enhancers can be envisioned as islands of transcription-factor-binding sites in a sea of mostly nonfunctional genomic sequence. The function of sequencing-binding transcription factors is typically to recruit transcriptional machinery to local genomic regions. Therefore, it is not too surprising that from these islands initiate divergent transcription whether those binding sites are near a gene body or distal from that site (Core et al., 2014; Scruggs et al., 2015). Functionally, the very first enhancers to be studied were observed to have promoter and enhancer activity (Moreau et al., 1981; Tuan et al., 1992; Ashe et al., 1997). We now know that, in addition to mRNA transcription, transcription of eRNAs occurs at many or most enhancers (Fig. 1) (De Santa et al., 2010; Kim et al., 2010). Whether an mRNA or eRNA is produced appears to depend on the presence of a 5′ splice site within a few hundred nucleotides of the initiation site (Kaida et al., 2010; Almada et al., 2013; Ntini et al., 2013).
Whether eRNAs are universally required for enhancer function is unknown. When cells differentiate or respond to a stimulus, eRNAs are transcribed before genic mRNAs are transcribed (De Santa et al., 2010; Arner et al., 2015). Enhancer RNA induction at neural-activity-regulated enhancers is greatest near the genes with mRNAs that are also induced the most (Kim et al., 2010). These results suggest the possibility that enhancer transcription or eRNA transcripts are required for enhancer function. Multiple models of eRNA function have been proposed based on experiments outside of the nervous system, including that eRNAs facilitate enhancer–promoter looping or mediate chromatin remodeling near the promoter region (Ørom et al., 2010; Li et al., 2013; Mousavi et al., 2013; Hsieh et al., 2014). Alternatively, transcription of eRNAs rather than eRNA transcripts may be functionally important (Kaikkonen et al., 2013). As mentioned above, in neurons, eRNA-dependent release of a negative regulator (NELF) from the Arc promoter drives neuronal activity-dependent transcription of Arc mRNA (Schaukowitch et al., 2014). From this menagerie of mechanisms, each reported at a different enhancer in a different biological system, we hope that universal mechanisms will eventually emerge.
How should we develop enhancers as tools for neuroscience?
One of the most valuable uses of the mass of enhancer data gathered in recent years is to improve genetic access to CNS subtypes without the need for transgenesis. Enhancers are small enough that many of them can fit together within an adeno-associated virus (AAV) genome and still leave plenty of room for a promoter, cDNA and other necessary components. Developing enhancer-based genetic labeling tools is a goal that is worth prioritizing because it would enable specific cell types within neural circults to be manipulated (e.g., optogenetically), without the need for time-consuming and costly transgenesis. There are two obvious strategies to achieving this goal. The DESIGNER strategy begins with finding putative enhancers predicted to be active in a particular tissue or cell type (Nord et al., 2013; Vermunt et al., 2014; Mo et al., 2015). Next, it combines those predicted enhancer sequences with promoters (and with each other) and tests them for the desired expression patterns in reporter assays. In contrast, the SCREENING strategy examines the function of random libraries of DNA (or enhancers) in reporter assays and tests for any specific expression pattern in the brain (Pennacchio et al., 2006). Success in the DESIGNER strategy could be an AAV expression system for genetic access to PV+ neurons. The SCREENING strategy instead would produce AAV vectors that label many different, unknown subsets of neurons, analogous to how the GENSAT project produced BAC-transgenic mice (Gerfen et al., 2013). These two strategies can be viewed as two ends of a spectrum in which the emphasis in resource investment is placed on enhancer identification or on reporter assay screening.
The DESIGNER strategy has so far been laborious. There are few AAV-based promoters that drive expression in neural subtypes. The commonly used short CaMKII promoter fragment is an exception (Dittgen et al., 2004) that is nonetheless not 100% specific to excitatory neurons (Nathanson et al., 2009b). In one example of the DESIGNER strategy, regulatory sequences from subtypes of inhibitory neurons drive expression specifically in inhibitory neurons, but not in specific subtypes (Nathanson et al., 2009a). In another example, five tandem copies of the SARE labeled neurons in an activity-dependent manner more effectively than the one copy that suffices in the genomic context (Kawashima et al., 2013, 2014). The biggest challenge in the DESIGNER strategy is that functional testing is always required because regulatory elements, once recombined into plasmids, behave unpredictably. Unfortunately, functional testing has until recently been laborious.
If MPRAs could be extended to work in vivo in a cell-type-specific manner, then they could be used to screen for highly cell-type-specific expression. A hybrid strategy that combines elements of the DESIGNER and SCREENING strategies would be to identify putative cell-type-specific enhancers directly or based on their proximity to cell-type-specific genes, which are now readily identifiable by single-cell drop-seq (Klein et al., 2015; Macosko et al., 2015). Next, the hybrid strategy would use in vivo cell-type-specific MPRA to screen identified sequences for actual cell-type-specific enhancers that function in AAV-transduced tissue. The immediate product, as with the GENSAT project (Gerfen et al., 2013), would be a reagent. A similar screening strategy could prove powerful for screening genetic variants for their potential effect on enhancer function.
Conclusions
These are exciting times for unraveling the complex mechanisms by which gene expression in the brain is controlled. New tools are emerging that promise to revolutionize our ability to dissect gene regulation at level of single cells (Buenrostro et al., 2015), to examine transcriptional dynamics across single cells in the brain at rapidly increasing resolution (Klein et al., 2015; Macosko et al., 2015), to perform targeted genome editing (Li et al., 2014; Meyer et al., 2015), and to generate synthetic transcription factors to probe the activity of enhancers in the endogenous chromatin context (Hilton et al., 2015; Kearns et al., 2015). At the same time, there are large-scale efforts to characterize the role of enhancer function in disorders such as schizophrenia (psychENCODE) and autism (The Simons Autism Genomes Project) and to build our understanding of cell-type-specific enhancer activity during neurodevelopment and after environmental stimulation or challenge. We anticipate that the accompanying 2015 SfN mini-symposium on neural enhancers will mark a new period of increased interest within neuroscience about these critical components of the genome.
Footnotes
J.M.G. acknowledges support from the National Institutes of Health (Grant R01MH101528). T.-K.K. acknowledges support from the National Institutes of Health (NIH-NINDS R01NS085418).
The authors declare no competing financial interests.
- Correspondence should be addressed to Jesse M. Gray, Department of Genetics, Harvard Medical School, COMPLETE MAILING ADDRESS, Boston, MA 02115; gray{at}genetics.med.harvard.edu