Abstract
Complex learned behavior is influenced throughout development by both genetic and environmental factors. Birdsong, like human speech, is a complex vocal behavior acquired through sensorimotor learning and is based on coordinated auditory input and vocal output to mimic tutor song. Song is primarily learned during a specific developmental stage called the critical period. Although auditory input is crucial for acquiring complex vocal patterns, its exact role in neural circuit maturation for vocal learning and production is not well understood. Using audition-deprived songbirds, we examined whether auditory experience affects developmental gene expression in the major elements of neural circuits that mediate vocal learning and production. Compared with intact zebra finches, early-deafened zebra finches showed excessively delayed vocal development, but their songs eventually crystallized. In contrast to the different rates of song development between the intact and deafened birds, developmental gene expression in the motor circuit is conserved in an age-dependent manner from the juvenile stage until the older adult stage, even in the deafened birds, which indicates the audition-independent robustness of gene expression dynamics during development. Furthermore, even after adult deafening, which degrades crystallized song, the deteriorated songs ultimately restabilized at the same point when the early-deafened birds stabilized their songs. These results indicate a genetic program-associated inevitable termination of vocal plasticity that results in audition-independent vocal crystallization.
Introduction
Songbirds and humans learn vocalization to communicate through sounds by tutor mimicry and sensory feedback. The learning of birdsong, a complex sequenced motor pattern, occurs within a defined critical period during which juvenile songbirds listen, memorize, and gradually match their own developing vocalizations to the song of an adult tutor. The brain areas associated with song learning and production, the song nuclei, are organized into two major circuits: (1) the posterior vocal motor circuit and (2) the anterior basal ganglia–forebrain circuit. The vocal motor circuit receives direct projections from auditory areas (Bauer et al., 2008) and participates in song pattern generation in a hierarchical manner by regulating syllable sequences and acoustic features (Yu and Margoliash, 1996; Hahnloser et al., 2002). In contrast, the basal ganglia–forebrain circuit, also called the anterior forebrain pathway (AFP), plays a critical role in song learning by producing the vocal exploration necessary for audition-dependent vocal plasticity (Bottjer et al., 1984; Scharff and Nottebohm, 1991; Brainard and Doupe, 2000; Andalman and Fee, 2009).
Developmental epigenetic factors, such as social interaction (Kroodsma and Pickert, 1984) and nutrition (Nowicki et al., 2002), influence vocal learning. In addition, an auditory input is crucial for the acquisition of birdsong and human speech as an epigenetic factor for sensorimotor learning (Konishi, 1965; Ruben, 1997). Audition provides important information for vocal learning, both for hearing model songs and precise evaluation of own vocal output. However, audition-deprived songbirds can still develop a certain degree of species-specific song, albeit noisy and amorphous (Price, 1979; Marler and Sherman, 1983). This suggests that inherited genetic predisposition also contributes to song development and the generation of song patterns, but the mechanism by which auditory input regulates the genetic programs in neural substrates mediating development and stabilization of complex vocal patterns remains unclear.
Here we investigate the neural molecular mechanisms contributing to song development by comparing song output and intrinsic gene expression profiles within song nuclei between the intact and audition-deprived zebra finches. The zebra finch is known as a closed-ended learner of complex vocal patterns given that, once a stable song pattern “motif” is developed, the song structure remains unchanged throughout life (Immelmann, 1969; Eales, 1985; Zann, 1996). This stereotypy of crystallized song enables precise quantification of the similarities and differences in vocal development and song patterns between the intact and deafened birds, thus allowing examination of how genetic and epigenetic factors contribute to acquisition and maintenance of complex vocal patterns. We showed that early-deafened zebra finches can still develop crystallized songs but do so more slowly than the intact birds. Eventual development of crystallized songs in the deafened birds is correlated with audition-independent changes in gene expression that continue to emerge well beyond sexual maturity. These results provide evidence for audition-independent regulation of vocal development and crystallization associated with intrinsic developmental gene expression dynamics.
Materials and Methods
Animals.
We used male zebra finches from our breeding colony ranging from 47 to 1780 d post-hatching (dph). The photoperiod was constantly maintained at a 14/10 h light/dark cycle with food and water provided ad libitum. The sex of birds was checked by PCR as reported previously (Wada et al., 2006) within 3 d after hatching. The birds were raised in individual breeding cages with their parents and siblings until at least dph 30 and were then housed in single-sex groups (6–10 birds per cage) with visual and auditory access to both the male and female birds. We divided the animals into the following developmental stages to examine age-associated changes: (1) dph 47–65, juvenile; (2) dph 100–187, young adult; and (3) >dph 300, older adult. For social isolation, the juveniles were separated from their fathers by dph 5 and raised by their mothers with their siblings in a sound-attenuation box until they could feed themselves (dph 35). No song tutor was provided, and they could not see the other zebra finches or hear other songs after dph 35. All animal experiments were performed according to the guidelines of the Committee on Animal Experiments of Hokkaido University from whom permission for this study was obtained. The guidelines are based on national regulations for animal welfare in Japan (Law for the Humane Treatment and Management of Animals; after partial amendment number 68, 2005).
Song recording and analysis.
The birds were individually housed in sound-attenuation boxes (box size, 65 × 27 × 30 cm; cage size, 54 × 22 × 23 cm). Songs were automatically saved 24 h/d using the Sound Analysis Pro program (version 1.04) (Tchernichovski et al., 2000) at 16 bits and a 44 kHz sampling rate. Analysis of syllable acoustic structure was performed using Sound Analysis Pro and Avisoft Saslab (Avisoft Bioacoustics). Low- and high-frequency noises (<0.5 and >1.9 kHz) were filtered from the recordings using Avisoft Saslab, and the low-frequency noises (<1.5 kHz) were further filtered using Sound Analysis Pro. For all analyses, we randomly selected songs per time point and manually eliminated calls and cage noise from the dataset. The entropy and amplitude parameters of syllables were set manually, and the syllables were segmented using Sound Analysis Pro. A song bout was defined as the continuous production of syllables, followed by at least 400 ms of silence. Typical song bouts in adult zebra finches start with introductory notes, followed by one or more motifs that consist of the same stereotyped sequence of syllables.
To measure the degree of song stabilization, we calculated song similarity by the Kullback–Leibler (K–L) distance analysis (Wu et al., 2008) between song at each developmental stage and at >1 year of age. The K–L distance analysis compares the probability density function from two large sets of syllables and quantifies the difference. Syllable density scatter plots were treated as random samples from a two-dimensional probability distribution. For construction of two-dimensional scatter plots with syllable duration and mean frequency modulation, we used 600–1000 syllables (mean ± SD, 982 ± 68) at each developmental stage for each bird. The two-dimensional scatter plot was partitioned into an M × N array of bins. To obtain an accurate and consistent estimation of the distribution, M and N were kept constant throughout all days of singing. The number of data points in each bin was counted and divided by the total number of points in the plot. We represented the two-dimensional scatter plot generated at >1 year of age by Ps and the scatter plot at each developmental stage as Pk. The difference between Ps and Pk was calculated to obtain the estimate probability density functions (denoted Qs and Qk) for the two scatter plots. The K–L distance is represented by the following equation: where qs(m, n) and qk(m, n) denote the estimated probabilities for bin (m, n). Large values of the K–L distance indicate that the two patterns are more dissimilar, whereas a K–L distance of 0 indicates a perfect match. We normalized the K–L distance value to the average K–L distance at the first stage of development (dph 49–59) for each experimental group. We used 15 male birds (intact, n = 4; early-deafened, n = 6; socially isolated, n = 5) for the K–L distance analysis.
To examine the correlation between amount of singing and expression level of immediate early genes as neural activity-dependent gene markers, we measured total singing duration (time spent producing songs) before brain sampling. We counted the number of bouts of singing within a day at least 1 d after the bird (either intact or deafened) was put into a sound-attenuated box (20 juveniles, n = 10 per group; 19 young adults, n = 9–10 per group; 21 older adults, n = 10–11 per group).
To compute correlations between song traits and DNA microarray expression data, we calculated mean values and coefficients of variation (CVs) of nine song parameters for each bird from 100 syllables recorded within 1 week before brain sampling: (1) syllable duration; (2) amplitude; (3) pitch; (4) frequency modulation (FM); (5) Wiener entropy; (6) goodness of pitch; (7) intersyllable duration; (8) syllable number in a motif; and (9) motif consistency. Motif consistency was measured as motif similarity score within 1 d and was calculated by the default setting in the Sound Analysis Pro software using the “time course” and “symmetric” comparison modes. This procedure can calculate the similarity score between two motifs on the basis of comparison of syllable acoustic features (e.g., pitch, FM, amplitude modulation, Wiener entropy, and goodness of pitch) within sliding 9 ms time windows and represents a global measure of percent similarity. For motif consistency analysis, song motifs were extracted from sequences of three to five syllables at the beginning of the bout but excluding introductory notes. We compared 20 randomly selected motifs in each bird.
Measurement of testosterone levels.
We measured the serum concentration of circulating testosterone in the male juveniles (n = 16, 8 per group), young adults (n = 20, 10 per group), and older adults (n = 20, 10 per group) to examine whether early-stage deafening affects the rate of maturation. Blood was sampled before lights on (morning) under silent conditions. Blood serum was isolated by centrifuging whole blood for 15 min at room temperature and was stored at −30°C until testosterone analyses. The blood serum samples were diluted by ARCHITECT Testosterone Manual Diluent (Abbott Laboratories), and testosterone levels were measured by BML.
Deafening.
The birds were deafened by cochlear extirpation at dph 17–23 (before fledging) to establish the early-deafened group or at dph 92–157 (after song crystallization) to establish the adult deafened group. The birds were anesthetized with pentobarbital (6.48 mg/ml; 60 μl/10 g body weight) by intraperitoneal injection. After fixing the head in a custom-made stereotaxic apparatus with ear bars, a small window was made through the neck muscle and the skull near the end of the elastic extension of the hyoid bone. A small hole was then made in the cochlear dome. The cochlea was pulled out with a fine hooked wire. The removed cochleae were confirmed by visual inspection under a dissecting microscope. After bilateral cochlear removal, the birds were put back in their nest and kept with their parents and siblings until dph 32–41.
Brain lesion and its evaluation.
We used 16 adult male birds for lesion experiments (intact, n = 8; early-deafened, n = 8). Songs recorded at least 2 d before surgery are referred to as HVC (used as a proper name; the motor circuit)-intact songs. The bilateral HVC was lesioned by local injection of 0.44 μl of 1% ibotenic acid dissolved in 1 m NaCl using a Nanoject 2 injector (Drummond Scientific). Songs were then re-recorded multiple times starting after 1 week of surgery. To examine the effect of lesions on singing, syllable duration distributions were calculated using 700 syllables.
To confirm lesioned areas, the anesthetized birds were perfused with 1× PBS and then 4% paraformaldehyde/1× PBS. Sagittal sections, 30 μm thick, were cut on a freezing microtome and were free-floated in 1× PBS. The brain sections were then transferred to 0.3% H2O2/1× PBS for peroxidase inactivation, washed three times with 1× PBS, and reacted overnight at 4°C in blocking solution containing mouse anti-NeuN monoclonal antibody (1:2000; Millipore). After three washes with 1× PBS, the sections were incubated in biotinylated goat anti-mouse IgG serum (1:400; Vector Laboratories), washed three times with 1× PBS, and then submerged in avidin–biotin–peroxidase complex solution (PK-6101; Vector Laboratories). Antibody labeling was visualized with DAB solution (SK4100; Vector Laboratories). We stained one of every four sections to estimate lesion size and location.
Brain tissue collection.
We collected brain samples from the intact and early-deafened zebra finches under the following two conditions: (1) silence and darkness with no singing and just before switching the light on (intact, n = 20; early-deafened, n = 19); and (2) with singing for 30–60 min after switching the light on (intact, n = 19; early-deafened, n = 18). To ensure similar circadian rhythms between the silent and singing conditions, sample collection was performed in the morning. To collect brain samples after singing, all the animals were individually placed in a sound-attenuated box at least 2 d before singing, and they were subsequently killed by decapitation. In both groups, the brains were rapidly removed, frozen in a plastic mold with Tissue-Tek OCT compound (Sakura Fine Technical), placed on dry ice, and stored at −80°C until use for in situ hybridization or for laser capture microdissection (LCM).
Radioisotopic in situ hybridization and quantification.
Serial sagittal sections, 12 μm thick, were cut throughout the brain, and 35S-labeled riboprobe in situ hybridization was performed as described previously (Wada et al., 2004). We first amplified cDNA fragments [dual-specific phosphatase 1 (Dusp1), Early growth response 1 (Egr1), hairy/enhancer-of-split related with YRPW motif protein 1 (Hey1), MAX-interacting protein 1 (Mxi1), neuronal differentiation 6 (Neurod6), sex determining region Y-box 4 (Sox4), and zinc finger protein 238 (Znf238)] by PCR from the zebra finch whole-brain first-strand cDNA and cloned them into pGEMT-easy vectors. PCR was performed to amplify the plasmid region that included the cloned cDNA between T7 and SP6 RNA polymerase binding regions. The resulting PCR fragment was used as the DNA template for T7 or SP6-primed in vitro transcription to synthesize 35S-labeled riboprobes for in situ hybridization. Frozen sections were fixed in 3% paraformaldehyde/1× PBS, acetylated, dehydrated in an ascending ethanol series, and then hybridized with 35S-labeled riboprobe (1 × 106 cpm/slide) in hybridization solution (50% formamide, 10% dextran, 1× Denhardt's solution, 12 mm EDTA, 10 mm Tris HCl, pH 8.0, 30 mm NaCl, 0.5 μg/μl yeast tRNA, and 10 mm DTT) at 65°C for 14 h. The slides were then washed and exposed to β-Max Hyperfilm (Kodak) before being immersed in NTB-2 emulsion (Kodak) for 2–5 weeks. The slides were then developed and stained with cresyl violet. The signal intensity of mRNAs in song nuclei and other brain regions was calculated by a previously described procedure (Wada et al., 2006). The exposed x-ray films of brain images were digitally scanned under a dissecting microscope (Z16 APO; Leica) connected to a CCD camera (DFC490; Leica) using Application Suite V3 imaging software (Leica). The same light and camera settings were used for all images to facilitate comparison. We used Photoshop (Adobe Systems) to measure the mean pixel intensities in the brain areas of interest from two or more adjacent sections after conversion to a 256 grayscale.
We performed regression analysis using the statistical software R (http://www.r-project.org/; R Core Team, 2013; R Foundation for Statistical Computing) to compare Dusp1 and Egr1 expression levels induced by singing in the AFP between the four experimental groups. To compare expression levels of Hey1, Mxi1, Neurod6, Sox4, and Znf238, F tests were first performed to confirm the homoscedasticity of the expression ratio between song nuclei, followed by Student's t tests or Welch's t tests. For all tests, statistical significance was considered at p < 0.05 after Bonferroni's correction.
LCM.
The brains were serially cryosectioned at 14 μm thick onto handmade membrane slides for LCM. The tissue sections were stained for Nissl bodies to histologically confirm the presence and boundaries of each nucleus and were then dehydrated before LCM. Briefly, frozen sections were sequentially exposed to 75% EtOH for 30 s, 50% EtOH for 30 s, cresyl violet for 15 s, 50% EtOH for 30 s, 75% EtOH for 30 s, 95% EtOH for 30 s, 100% EtOH for 30 s, 100% EtOH for 1 min, 100% EtOH for 3 min, 2 × 2 min xylene, and finally air dried for 5 min (LCM Staining kit; Ambion). HVC and the robust nucleus of the arcopallium (RA) tissues were microdissected using a laser capture microscope (ArcturusXT; Arcturus Bioscience). The laser parameters for LCM were as follows: spot diameter, 100 μm; laser power, 80 mW; and laser duration, 80 ms. The microdissected tissue was directly collected into RLT Buffer from the RNeasy Micro kit (Qiagen) supplemented with β-mercaptoethanol. The samples were vortexed, centrifuged, and stored at −80°C until RNA purification.
RNA purification, amplification, and confirmation of the linearity of amplification.
We isolated and amplified RNA according to the procedures of the Agilent Gene Expression FFPE Workflow (http://www.chem.agilent.com/Library/applications/5990-3917en_hi.pdf). RNA was column purified (RNeasy Micro kit; Qiagen) and treated with DNase on the column to remove genomic DNA. To quantify RNA quality, RNA integrity number (RIN) and concentration were measured by a Bioanalyzer 2100 (Agilent Technologies). Because of the limited yield of extracted RNA, we randomly selected four RNA samples to examine RIN and RNA concentration (RIN, 5.9–7.1; RNA concentration, 1.69–2.35 ng/μl). Purified RNA was amplified using the TransPlex Whole Transcriptome Amplification kit (WTA2 kit; Sigma-Aldrich) according to the instructions of the manufacturer. The amplified cDNA was purified by removal of residual primers and nucleotides using the QIAquick PCR purification kit (Qiagen). Quality and yield of the amplified cDNA were measured by a NanoDrop ND-8000 (Thermo Fisher Scientific) and by electrophoresis. A 1.7 μg sample of total amplified cDNA was converted into Cy3-labeled cDNA using the Genomic DNA ULS labeling kit (Agilent Technologies) following the Agilent Gene Expression FFPE Workflow. Labeled cDNAs from each sample were hybridized to a DNA microarray.
DNA microarrays.
Agilent zebra finch 4 × 44k oligoarrays (AMADID number 021323) containing 44,969 cDNA 60-mer probes (including 11,549 genes as gene-symbol identified genes) were designed at the Jarvis Laboratory of Duke University on the basis of sequence information of cDNA libraries from Duke University, Michigan State University, Rockefeller University, and the Keck Center of the University of Illinois (Wada et al., 2006; Li et al., 2007; Replogle et al., 2008). HVC and RA tissue samples (each n = 3 at juvenile, young, and old stages) were laser microdissected from the intact and early-deafened birds (for both the intact and early-deafened birds at juvenile, young, and old stages: n = 4, 5, and 3 birds, respectively). Because of experimental difficulty at the RNA amplification step, some birds were used for only HVC or RA (thus, the number of sample birds at some stages exceeded three). Single-channel hybridizations were conducted for each sample that was hybridized to a single array. Because four arrays were printed separately on a slide, we hybridized four samples matched for the age and brain region as a single batch to minimize possible interslide bias or batch effects. After hybridization, the arrays were washed and scanned, and the data were extracted from the scanned images (G2505B; Agilent Technologies) using Feature Extraction version 9.1 (Agilent Technologies). The DNA microarray data reported in this study were deposited in the Gene Expression Omnibus database (accession number GSE56075).
DNA microarray statistical analysis.
All subsequent preprocessing and analyses were performed using Microsoft Excel and the statistical software R. Probe refers to a single probe on the array. For the majority of gene-symbol identified genes, multiple probes were present on the array. HVC and RA samples were separated, and the two datasets were preprocessed separately as follows. First, control probes were removed, and probes with raw fluorescence values >3 SDs from the mean of all samples for that probe were removed. To reduce noise, probes with raw fluorescence values <50 in >80% of samples in each nuclei were removed. A total of 41,525 probes for 11,391 gene-symbol identified genes in HVC and 41,340 probes for 11,374 gene-symbol identified genes in RA fit these criteria and were processed. Arrays were quantile normalized, and the fluorescence value was transformed to log2 scale.
ANOVA was used to identify probes differentially expressed between developmental stages in the intact birds (p < 0.05; 3362 and 1466 probes in HVC and RA, respectively). The expression levels of these genes were compared between the intact and early-deafened birds to assess how auditory input influenced gene expression levels. Principal component analysis (PCA) using the prcomp package and three-dimensional view using the rgl package in R were used to visualize the separation of samples.
Weighted gene coexpression network analysis (WGCNA) was performed using the WGCNA R package for additional investigation of the relation between biological traits (age, audition status, and song characteristics) and clusters of coregulated genes (Langfelder and Horvath, 2008). General information about network analysis methodology and WGCNA software is available at http://labs.genetics.ucla.edu/horvath/htdocs/CoexpressionNetwork/. Differentially expressed probes (9903 and 6536 probes in HVC and RA, respectively) selected on the basis of a minimum 1.5-fold change and p < 0.05 (t test) within a song nucleus were included. Pairwise Pearson's correlation coefficients were calculated for all genes selected. The resulting Pearson's correlation matrix was transformed into a matrix of connection strengths (an adjacency matrix) using a power function [(1 + correlation)/2)β], which was then converted to a topological overlap matrix. WGCNA identifies modules of densely interconnected probes by hierarchical clustering based on topological overlap, a biologically meaningful measure of similarity of expression patterns among all pairs of probes across all treatment conditions and by assigning each probe to a “module” based on shared expression patterns. A preliminary network was built to assess overall connectivity. From this network, the 6500 and 3800 probes in HVC and RA, respectively, with the highest connectivity were retained for subsequent WGCNA. For analysis using all expressed probes, we first omitted the probes with low connectivity (whole-network connectivity; k.total < 0.5 and gray module), and then 32,607 probes in HVC and 21,669 probes in RA were used in the network construction. Modules were defined as branches of the dendrogram obtained from clustering and were labeled by colors beneath the dendrograms. To study the relationship between expression variability within the modules and behavioral trait variability, correlations were computed between principal components of each module and traits. p values were computed for each correlation.
Results
Song development and stabilization in early-deafened songbirds
To examine how a song developed and the characteristic song patterns under an audition-deprived condition, the male zebra finches were deafened by removing the cochleae before fledging (dph 18–22; average dph, 19; n = 6), a time during which no song production is observed, and song ontogeny was compared in the intact and socially isolated birds (n = 4 and 5, respectively). Because of sensorineural hearing loss, the songbirds missed listening to song models and monitoring their own vocal outputs. These early-deafened birds showed slower song development than the intact and socially isolated birds. Clear differences in song development were observed at dph 140–180; during this period, the majority of the intact birds exhibited crystallized song patterns (Fig. 1A,B,D). In contrast, the early-deafened birds still generated variable song sequences with fewer harmonic syllables, resembling the early plastic songs recorded during early normal development. However, the deafened birds eventually exhibited stabilized song patterns characterized by stable temporal sequences of distinguishable syllables by approximately dph 300, which is approximately three times longer than the time required for song crystallization in the intact birds (Fig. 1C,D). There was a significant difference in the degree of song stabilization between the early-deafened and intact birds at dph 140–180 (Student's t test with Bonferroni's correction, p < 0.01; Fig. 1D). Although there was no significant difference in the song stabilization trajectory between the socially isolated and early-deafened birds, the socially isolated birds exhibited a tendency for generating “parts” of a motif structure and stabilizing the acoustic features for the majority of syllables with a time course similar to that in the intact birds (Fig. 1A,B, syllable scatter plots). In contrast, all the early-deafened birds continued to generate acoustically unstable syllables across bouts, and their stable motif patterns did not emerge after more than dph 300. The stabilized songs of the deafened birds were composed of a motif structure showing a variety of individual differences in the number of syllables, acoustic features, syllable durations, and intersyllabic intervals (Figs. 1C, 2A). These individual differences were observed even among siblings from the same family (Fig. 2A). During the developmental period before song crystallization/stabilization, circulating testosterone concentration and singing frequency were not significantly different between the intact and early-deafened birds (Fig. 2B,C). Furthermore, it was also observed that the early-deafened birds directed songs to females and exhibited normal mating behavior at approximately dph 100, the typical time of sexual maturity, suggesting that delayed song stabilization in the early-deafened birds did not generally reflect retardation in development. These results indicate that hearing loss at an early developmental stage delayed but did not prevent the eventual slow development of some stable syllables and structured vocal sequences.
Activity of the basal ganglia–forebrain circuit during audition-independent song crystallization/stabilization
We next set out to examine where auditory deprivation affects the neural circuits that mediate vocal learning and production and how it causes delayed song development and audition-independent song crystallization. The basal ganglia–forebrain circuit AFP is considered to contribute to the generation of vocal variability necessary for audition-dependent song plasticity (Kao et al., 2005; Ölveczky et al., 2005; Andalman and Fee, 2009). Lesions of the AFP output nucleus, lateral magnocellular nucleus of the anterior nidopallium (LMAN), and basal ganglia nuclei Area X prevent song deterioration induced by auditory deprivation at the adult stage (Brainard and Doupe, 2000; Kojima et al., 2013). In addition, the pharmacological inactivation of LMAN dramatically reduces the acoustic and sequence variability of songs at the juvenile stage (Ölveczky et al., 2005). Based on these observations, we hypothesized that abnormally increased or decreased activity of the AFP nuclei during song development delays song development and stabilization in the early-deafened birds. To test these possibilities, we compared the neural activity of AFP among the juvenile (dph 47–59; mean ± SEM, dph 55.17 ± 0.58), young adult (dph 102–147; mean ± SEM, dph 133.16 ± 3.89), and older adult (dph 327–1715; mean ± SEM, dph 881.06 ± 67.24) birds (both intact and early-deafened) using two approaches: (1) molecular mapping analysis of activity-induced genes (Wada et al., 2006; Horita et al., 2012); and (2) behavioral analysis of vocal outputs driven by AFP (Aronov et al., 2008). The neural activity-induced genes Dusp1 and c-fos were used as molecular markers of AFP activity because both were induced by singing in all three AFP song nuclei: LMAN, striatum nucleus Area X, and dorsal lateral nucleus of the medial thalamus (DLM) (Fig. 3A). There were no significant differences in singing-induced expression levels of Dusp1 in the same nuclei between the age-matched early-deafened and intact birds (Fig. 3B). The consistent result was also observed with the c-fos probe (data not shown). Furthermore, after bilateral lesions of the premotor nucleus HVC, both the intact and early-deafened birds at the young and adult stages exhibited highly similar AFP-driven vocal outputs with coincident syllable duration distributions (Fig. 3C,D). These results suggest that molecular signaling cascades in the AFP nuclei and AFP-driven premotor outputs are similarly regulated regardless of age or the presence of auditory inputs.
Audition-dependent sensitivity and audition-independent robustness of gene expression dynamics in the motor circuit during song development
Because we observed unaltered AFP activity throughout song development in both the intact and deafened birds, we considered the possibility that the autonomous changes associated with song development regulation and crystallization timing exist in the vocal motor output circuit. A decrease in vocal variability has been attributed to the strengthening of inputs from HVC and the concurrent weakening of inputs from LMAN to RA at the synaptic level (Herrmann and Arnold, 1991; Stark and Perkel, 1999). Furthermore, HVC is the only song system nucleus that receives direct projections from auditory areas (Bauer et al., 2008). Adult deafening reduces the size of dendritic spines and alters spontaneous action potential activity in HVC projection neurons to Area X (Tschida and Mooney, 2012). Therefore, HVC and RA might be affected by audition deprivation and delayed vocal development. In addition, several studies have indicated a relationship between song development and gene expression changes in the motor circuit (Soderstrom and Tian, 2006; Wada et al., 2006; Balmer et al., 2009; Matsunaga et al., 2011). Thus, we first examined the induction level of Egr1 (Zenk), a neural activity-induced and neural plasticity-related gene (Jones et al., 2001), that was shown previously to be more highly induced by singing in the motor nucleus RA of juveniles compared with that of adult birds (Jin and Clayton, 1997b). As reported previously, Egr1 expression in the RA was significantly higher after singing during the juvenile plastic song phase (dph 47–59) than that during young adult (dph 104–147) or older adult (dph 327–1715) crystallized song phases in the intact birds (regression analysis, p < 0.05; Fig. 4). In addition, in the early-deafened zebra finches, Egr1 expression dynamics in the motor nuclei HVC and RA were very similar to the intact birds, although the intact and early-deafened birds developed individual song patterns at different rates (Fig. 4), suggesting that expression of neural plasticity-related genes during development is not substantially altered by auditory input.
This result of Egr1 expression motivated us to conduct global transcriptome analysis in both the intact and early-deafened zebra finches because acquisition of complex song pattern should be achieved by orchestral contribution of a large variety of genes and not by a few restricted numbers of genes. Thus, we used a strategy to evaluate gene expression dynamics in the motor circuit nuclei, HVC and RA, during the critical period of vocal learning in both the intact and early-deafened birds. Global transcriptome profiling was performed using the total RNA extracted from the laser microdissected HVC and RA at three developmental stages: (1) juvenile, dph 54–57; (2) young adult, dph 121–135; and (3) older adult, dph 504–787 (n = 3 deafened and intact birds at each developmental stage). We used the songbird oligo-DNA array platform for comprehensive coverage of genes expressed in the zebra finch brain (Warren et al., 2010). After quality-control assessment and normalization (see Materials and Methods), we surveyed 41,525 probes (11,391 gene-symbol identified genes) in HVC and 41,340 probes (11,374 genes) in RA of the 43,552 probe total set during at least one developmental period. We identified 3362 probes in HVC (2120 genes, 8.10% of the total HVC-expressed probes) and 1466 probes in RA (974 genes, 3.54% of the total RA-expressed probes) that were differentially expressed during at least one developmental period (ANOVA, p < 0.05; Fig. 5A). These developmentally regulated genes encoded signal transduction proteins, receptors, ion channels, cell adhesion molecules, neuron differentiation factors, and neuronal transcription factors, including Sox4, Hey1, Znf238, Neurod6, and Mxi1 (Fig. 5B). In the early-deafened birds, there were many fewer significant differences in expression between developmental periods in both HVC [1922 of 2120 genes: 2959 of 3362 probes (88.0%)] and RA [915 of 974 genes: 1357 of 1466 probes (92.6%)]. In contrast, individual variability in expression was much higher as evidenced by the larger CV in the early-deafened birds compared with the intact birds (Fig. 5A,C). This finding suggests that an underlying genetic program of developmental neural maturation continues after deafening, with auditory experience driving the stabilization of gene expression patterns and levels. Nonetheless, examination of gene expression heat maps (Fig. 5A) showed similar gene expression patterns between groups and preservation of age-dependent expression dynamics in the early-deafened birds. Nearly all genes [2120 genes in HVC (100%) and 973 genes in RA (99.9%)] showed qualitatively similar intrinsic expression patterns during development, indicating the audition-independent robustness of gene expression dynamics in the motor circuit during song development. This robustness of gene expression dynamics is also indicated by similar regulation of average of expression levels for most genes between the intact and early-deafened birds at each developmental stage (Fig. 5C).
Moreover, the audition-independent robustness of age-dependent gene expression dynamics in the motor circuit could be demonstrated using the same set of genes (3362 and 1466 probes in HVC and RA, respectively) by applying PCA. We reduced the dimensionality of the gene expression pattern distribution by applying PCA to the collection of feature distributions of the intact and early-deafened birds at three developmental stages (juvenile, young adult, and older adult), and the first three principal components (PC1, PC2, and PC3) were used to obtain the three-dimensional gene expression values. The principal components for gene expression values showed separable clusters for the three developmental stages but not for audition (+/−) in both HVC and RA (Fig. 5D). The core of each cluster was composed of intact individuals within the same age group surrounded by the age-matched early-deafened individuals, indicating audition-independent, age-driven robust stability of gene expression programs but audition-dependent sensitivity that reduced expression fluctuation in the presence of auditory feedback.
To extract additional information on gene expression dynamics from the transcriptome dataset linked to developmental stage (age), audition (+/−), and song phenotype, we performed WGCNA (Zhang and Horvath, 2005; Hilliard et al., 2012), which allows for the identification of “gene modules” of coexpressed genes. Similar to PCA, WGCNA can also overcome a potential problem associated with the small sample size of the DNA microarray experiment because three developmental datasets were combined in both the intact and deafened groups and used as a single dataset for the analysis (n = 18 in HVC and RA, separately). Within a set of genes that were differentially expressed to a significant level during at least one developmental stage in both the intact and deafened birds (6500 probes including 3744 gene-symbol identified genes for HVC and 3800 probes including 2379 genes for RA), we identified seven modules of coexpressed genes in HVC and nine in RA (Fig. 6). All gene modules in HVC (4662 probes, 2874 known genes) and four of nine modules in RA (1664 probes, 1097 known genes) significantly correlated with the age-related parameters: days post-hatching and developmental stage. After Bonferroni's correction, the first principal components of modules 1 and 2 in HVC (Fig. 6, turquoise and blue, respectively) were significantly related to age and motif consistency. The positive correlations of module 1 (2620 probes representing 1752 genes) in HVC indicate upregulation of gene members during development. In contrast, module 2 (2295 probes representing 1353 genes) in HVC indicates a strongly negative correlation to age and thus represents downregulation of gene members during development. Module 1 (Fig. 6, turquoise, 1162 probes representing 816 genes) and module 3 (Fig. 6, brown, 736 probes representing 496 genes) in RA also indicate significantly positive and negative correlations to age, respectively. Some of the gene modules, such as 1 and 2 in HVC and 7 in RA (Fig. 6, black), also have weak but significant correlations with song parameters related to temporal regulation and syllable phonology. However, no gene module in either HVC or RA showed any significant correlation with auditory input. Likewise, no gene modules exhibited any significant correlation with auditory input in either HVC or RA when the data were reanalyzed using all expressed genes for WGCNA (using 41,525 and 41,340 probes for HVC and RA, respectively) instead of only the developmentally regulated subset. These results indicate the existence of audition-independent gene expression programs in the vocal motor circuit during song development at the coexpression network level.
Audition-independent gene expression dynamics associated with inevitable termination of vocal plasticity and acquisition of structured song patterns
The above results indicate a discrepancy between song phenotype variation and stable gene expression dynamics in the vocal motor circuit during development in the absence of auditory input. Song crystallization/stabilization, typically observed at the young adult stage (dph 100–120) in the intact birds, was still observed in the early-deafened birds but at older ages (older than dph 300). However, the global gene expression pattern and level in the vocal motor circuit nuclei were quite similar between the age-matched intact and deafened birds and were clearly different among the three developmental stages within these two groups. Thus, the gene expression pattern observed at the older adult stage (older than dph 300) in both the intact and deafened birds may be a molecular signature representing the “absolute” endpoint of vocal plasticity associated with audition-independent vocal crystallization/stabilization. To examine this idea, we analyzed the long-term song stability after deafening after song pattern crystallization. Deafening young adult zebra finches induces deterioration of crystallized song patterns and results in variable song patterns (Nordeen and Nordeen, 1992; Lombardino and Nottebohm, 2000; Horita et al., 2008). This deterioration and instability of song pattern by adult auditory deprivation is considered to be the result of an output signal from the basal ganglia–forebrain circuit AFP, which detects mismatching of auditory feedback and vocal outputs, to the vocal motor circuit (Brainard and Doupe, 2000). However, instead of focusing on song deterioration, we examined whether unstable and variable song patterns continued or some degree of structured song pattern was eventually reattained because of an inherent decrease in vocal plasticity (i.e., if there is an age beyond which significant vocal plasticity cannot occur). We observed song deterioration and increased variability in song patterns 1 month after deafening in young adult birds (n = 4 at deafening operations were performed at dph 105–157; average dph, 123.5). However, by dph 300–400, vocal variability started to decrease and the song restabilized with a motif structure (Fig. 7). The restabilized song was distinctly different from the first crystallized song pattern and showed unique motif structure across birds (Fig. 7A). Individual song patterns, once stabilized, were repeatedly generated with the same temporal pattern and sequence and phonologically deteriorated to stable syllables, resembling the songs of the early-deafened birds.
Using a set of transcription factors that showed developmentally different regulation in the motor circuit nuclei HVC and RA (Fig. 5B), we examined the gene expression patterns and levels in the young adult-deafened birds during the song deterioration (>1 week after deafening, n = 5 at dph 104–161; average dph, 132.4) and recrystallization phases (n = 6 at dph 1158–1551; average dph, 1274). We then compared these patterns with those of the age-matched intact and early-deafened birds; however, no significant differences in gene expression were observed in HVC and RA for any transcription factor examined among all the age-matched intact, early-deafened, and young adult deafened groups (Fig. 7C). These results supported the idea of a genetic program-associated “absolute” end of vocal plasticity.
Discussion
Sensorimotor development is accelerated during critical periods of behavioral variation corresponding to enhanced neuroplasticity, followed by behavioral stabilization concomitant with reduced neuroplasticity. However, a certain degree of development and complex motor behavior may be achieved without the appropriate sensory inputs and may be driven by intrinsic developmental genetic programs instead. Our study revealed audition-independent vocal development and stabilization in a songbird zebra finch. Neural activity-dependent gene induction and vocal output driven by the AFP was similar throughout development in both the intact and deafened birds. Although our global transcriptome analysis identified >3000 genes differentially regulated in the vocal motor circuit during normal song development, we also observed similar qualitative patterns of developmental gene expression between the hearing-intact and early-deafened birds using PCA and WGCNA. Thus, vocal motor circuit maturation was driven by an audition-independent genetic mechanism. However, we cannot rule out the possibility that discrete effects of deafening on gene expression could be detected using other experimental designs (e.g., larger sample sizes, testing during singing instead of silence, testing at other developmental time points, or testing the AFP song nuclei). Adult deafening reduces the size of dendritic spines and alters spontaneous action potential activity only in Area X projection neurons of HVC (Tschida and Mooney, 2012). Therefore, audition deprivation could alter developmental gene expression in a cell type-specific manner.
A few studies have examined the regulation of neural plasticity-related gene expression in the song system in social isolation-reared songbirds (Jin and Clayton, 1997a; Sakaguchi, 2004); however, to the best of our knowledge, the present study is the first to examine the dynamics of the brain transcriptome in a vocal learner under audition deprivation, a crucial epigenetic factor in vocal learning. It has been reported that several genes associated with song development are expressed in the song system during the critical period of song learning in audition-intact birds (Soderstrom and Tian, 2006; Balmer et al., 2009; Kubikova et al., 2010). In addition, gene manipulation and pharmacological studies of motor circuit genes regulated by development reflect the effects on song development and song pattern generation (Denisenko-Nehrbass et al., 2000; Matsunaga et al., 2011). However, our global transcriptome analysis revealed a discrepancy between the dynamics of gene expression and song ontogeny, particularly during the young adult stage, when gene expression patterns were similar between the intact and deafened birds, but the stability of their song patterns differed considerably. Moreover, during the critical period, singing-driven gene expression of Dusp1, c-fos, and Egr1 was unaltered in the absence of audition input in both AFP and motor circuits, even with long-term auditory deprivation. A large set of neural plasticity-related genes in song nuclei are regulated by singing (Li et al., 2000; Poopatanapong et al., 2006; Wada et al., 2006), which suggests that singing-driven gene expression is also regulated purely by motor activity and not sensory information (hearing input; Teramitsu et al., 2010). Therefore, volitional singing itself (and/or total amount of singing) as a motor activity could be a crucial developmental epigenetic factor that activates the genetic programs to regulate vocal plasticity and develop species-specific vocal patterns, just as the potency of the external environment provides experience-dependent stimulation that regulates the critical periods and neural plasticity of sensory-related circuits (Hensch, 2004; Knudsen, 2004). Recently, it was shown that both neural plasticity-related genes and epigenetic regulators are induced by neural activity. The direct epigenetic regulators, a DNA demethylation regulator Gadd45b (Ma et al., 2009) and a replacement histone H3.3b (Michod et al., 2012), are induced by singing-driven neural activation in song nuclei in both intact and deafened birds (Wada et al., 2006; our unpublished data). Although the data indicating direct relationships between motor-driven epigenetic regulation and developmentally regulated gene expression dynamics are still limited, neural activity-dependent modification of the DNA methylation landscape and subsequent gene expression changes have been demonstrated in the adult brain (Guo et al., 2011a,b). Therefore, audition-independent genetic programs in the song system may be directly or indirectly regulated by activation of the singing (motor)-dependent epigenetic factors. This idea fits with the observation that the intact and early-deafened birds exhibited a similar amount of singing during development (Fig. 2C), which can be examined by an experiment that includes long-term singing inhibition during the critical period of vocal learning, which might determine whether singing-inhibited songbirds showed delayed gene expression dynamics in song circuits.
How do early-deafened birds develop their songs and finally crystallize song patterns without a tutor song and their vocal feedback information? Although collateral afferent feedback from the syrinx and/or respiratory organs may guide neural maturation and song formation in a deafened condition, the AFP has been considered to generate the vocal exploration necessary for audition-dependent song learning to match with a tutor song (Andalman and Fee, 2009; Charlesworth et al., 2012). Therefore, it is speculated that the early-deafened birds continue random exploration of vocal outputs in their life. However, this supposed exploration of vocal outputs was not observed at the old adult stage even if the AFP is active while singing (Figs. 1C, 3C). Song development of the early-deafened birds is delayed, but their song patterns were eventually stabilized. Similarly, songbird pupils do not continue to generate variable vocal outputs until they match the tutor song. Instead, they generate unique song patterns as their crystallized songs when they mature (Tchernichovski and Nottebohm, 1998). These facts suggest that the transfer of functional dominance from the AFP to the motor circuit is crucial for regulating vocal plasticity and stabilization, and this transference would persist under audition deprivation. Song patterns with species-specific song archetype were eventually crystallized in both the birds deafened before song development and the adult-deafened birds (Figs. 1, 7). These results indicate that the inherent endpoint for song stabilization was regulated by audition- and AFP-independent ways and further suggest the existence of intrinsic neural mechanisms for developing song phenotypes with species constraints and diversification.
Based on our finding regarding audition-independent gene expression dynamics and the similar crystallization and recrystallization timings of song of both the early- and adult-deafened birds, we propose a “two modes hypothesis” with “active” and “passive” modes that explains how song crystallization is regulated. Active song crystallization is considered the traditional song crystallization maintained by the AFP using auditory feedback. In the zebra finch, this active crystallization is usually observed at approximately dph 90–120. Although an outward song pattern of a young adult appears stable after song crystallization, the global gene expression patterns in the song circuits sustain a level of neural plasticity that allows changes in vocal patterns at syllable, phonological, and sequential levels. Therefore, the neural plasticity associated with active crystallization was revealed by manipulating auditory feedback, such as the delayed auditory feedback (Leonardo and Konishi, 1999) and deafening at the young adult stage in an AFP-dependent manner (Brainard and Doupe, 2000; Fig. 7). Furthermore, the neural plasticity associated with intrinsic gene expression patterns in a young adult would allow variable song patterns, with intervention of the AFP vocal exploration but without evaluation of auditory feedback. An active crystallized song may contribute to mating for young adults because it acts as an early deceptive sign of individual maturation, just as the female-directed song is more stable than the undirected song and is therefore considered more attractive (Woolley and Doupe, 2008). In contrast, late passive song crystallization would be independent of audition but associated with age and/or singing experience, which emerges in the zebra finch at approximately dph 300. A previous experiment using adult-deafened zebra finches with a wide range of ages revealed a song deterioration that was more rapid in younger adults, whereas later-deafened adults showed few long-term changes in song structure (Lombardino and Nottebohm, 2000). The early- and adult-deafened birds revealed song stabilization and restabilization at approximately dph 300 (Figs. 1, 7). Furthermore, developmentally regulated gene expression patterns are similarly maintained in HVC and RA among the age-matched intact, early-deafened, and adult-deafened birds (Figs. 5, 7). These results indicate the existence of an age-associated and/or singing experience-associated passive crystallization driven by audition-independent gene expression dynamics in the vocal motor circuit. Although several deafening experiments have been performed in many songbird species, little attention has been paid to the timing of song crystallization and stabilization after auditory deprivation. Therefore, the existence of active and passive crystallization in other species remains to be examined, particularly in open-ended learners such as canaries that can learn new songs.
There is no doubt about the importance of genomic information as the molecular basis for gene expression dynamics in acquiring species-specific vocal patterns. Although it is crucial to use auditory information to control song development, auditory input is not the main driver of developmental gene expression dynamics in the motor circuit nuclei associated with audition-independent vocal crystallization in the zebra finch. Therefore, the audition-independent gene expression dynamics would provide a molecular signature for revealing an intrinsic state of neural plasticity in the song system. However, it remains necessary to examine the possible contribution of neural activity-driven epigenetic regulation of vocal development and the critical period in vocal learners.
Footnotes
This work was supported by Grant-in-Aid for Japan Society for the Promotion of Science Fellows 11J07071 (C.M.) and the Uehara Memorial Foundation, the Asahi Glass Foundation, Grant for Basic Science Research Project from the Sumitomo Foundation, and Grants-in-Aid for Scientific Research 21681024, 21115501, and 25290063 (K.W.). We thank Dr. A. Watanabe for advice on the deafening operation; Drs. H. Horita, W.-C. Liu, S. Yanagihara, and S. Kojima for providing critical comments; Dr. M. Soma for statistical advice; Dr. K. Suzuki for advice on analysis using K-L distance; and A. Kamimura for advice on DNA microarray experiments.
- Correspondence should be addressed to Kazuhiro Wada, Faculty of Science, Department of Biological Sciences, Hokkaido University, Room 910, Building 5, North 10, West 8, Kita-ku, Sapporo, Hokkaido 060-0810, Japan. wada{at}sci.hokudai.ac.jp