Abstract
The striatum's complex microcircuit is made by connections within and between its D1- and D2-receptor expressing projection neurons and at least five species of interneuron. Precise knowledge of this circuit is likely essential to understanding striatum's functional roles and its dysfunction in a wide range of movement and cognitive disorders. We introduce here a Bayesian approach to mapping neuron connectivity using intracellular recording data, which lets us simultaneously evaluate the probability of connection between neuron types, the strength of evidence for it, and its dependence on distance. Using it to synthesize a complete map of the mouse striatum, we find strong evidence for two asymmetries: a selective asymmetry of projection neuron connections, with D2 neurons connecting twice as densely to other projection neurons than do D1 neurons, but neither subtype preferentially connecting to another; and a length-scale asymmetry, with interneuron connection probabilities remaining non-negligible at more than twice the distance of projection neuron connections. We further show that our Bayesian approach can evaluate evidence for wiring changes, using data from the developing striatum and a mouse model of Huntington's disease. By quantifying the uncertainty in our knowledge of the microcircuit, our approach reveals a wide range of potential striatal wiring diagrams consistent with current data.
SIGNIFICANCE STATEMENT To properly understand a neuronal circuit's function, it is important to have an accurate picture of the rate of connection between individual neurons and how this rate changes with the distance separating pairs of neurons. We present a Bayesian method for extracting this information from experimental data and apply it to the mouse striatum, a subcortical structure involved in learning and decision-making, which is made up of a variety of different projection neurons and interneurons. Our resulting statistical map reveals not just the most robust estimates of the probability of connection between neuron types, but also the strength of evidence for them, and their dependence on distance.
Introduction
As the input of the basal ganglia circuit, the striatum has been ascribed key computational roles in action selection (Redgrave et al., 1999; Gurney et al., 2001a; Liénard and Girard, 2014), decision-making (Bogacz and Gurney, 2007; Ding and Gold, 2010, 2012; Yartsev et al., 2018), and reinforcement learning (Reynolds et al., 2001; Samejima et al., 2005; Bornstein and Daw, 2011; Khamassi and Humphries, 2012; Gurney et al., 2015). Within the striatum is a microcircuit comprising the GABAergic spiny projection neurons (SPNs), which make up to 97% of striatal neurons in the rat (Oorschot, 2013), and at least five species of predominantly GABAergic interneurons (Burke et al., 2017; Tepper et al., 2018). These SPNs divide into two populations that express either the D1 or D2 type of dopamine receptors (Gerfen et al., 1990; Gerfen and Surmeier, 2011). Projections from the D1 and D2 SPN populations, respectively, form the striatonigral and striatopallidal pathways (Gerfen et al., 1990; Kreitzer, 2009; Gerfen and Surmeier, 2011), through which they influence dynamics throughout the basal ganglia and beyond. The microcircuit's connections onto the D1 and D2 SPNs are then a potentially major actor in sculpting the output of this nucleus, and thus the computations ascribed to it.
One key to understanding the role of the microcircuit in the computations of striatum is knowing the relative influence of one neuron type on another (Alexander and Wickens, 1993; Hjorth et al., 2009; Humphries et al., 2009; Lau et al., 2010; Ponzi and Wickens, 2010; Klaus et al., 2011; Damodaran et al., 2014). Two broad influences of this microcircuit on the output of SPNs are well known: the feedforward inhibition by GABAergic interneurons, and feedback inhibition by lateral connections between the SPNs (Plenz, 2003; Tepper et al., 2004, 2008; Humphries et al., 2010). But to understand how all elements of the striatum's microcircuit influence its output requires a full account of the microcircuit's wiring, which we currently lack. To address this problem, here we synthesize data from pairwise intracellular recording studies to generate a statistically rigorous and comprehensive map of the wiring probabilities between the key neuron species of the mouse striatum.
A key issue in estimating connection probabilities from intracellular recording data is that recording studies report a single probability for each connection type, given by the rate of successful connections between two types of neuron, without providing any measures of uncertainty. In this paper, we solve this problem by introducing a Bayesian approach to estimating the probability of connection between neuron types using pairwise intracellular recording data, which allows us to draw rigorous conclusions about the strength of evidence for claims about the microcircuit. Using this approach on data from the mouse striatum, we show that the previously reported asymmetry between the rates at which D1 and D2 neurons make connections is robust, with D2 SPNs having roughly twice the connection rate of D1 SPNs; but contrary to previous claims, we also show there is no evidence for an asymmetry in the rates at which they receive connections, and so there is no preferential target for D1 or D2 SPNs. We then demonstrate a new method for using single measurements of connection rates to estimate distance-dependent probabilities and their uncertainty. Using these methods to analyze both SPN and interneuron connectivity, we complete our Bayesian map of the connectivity of the mouse striatum. Finally, we demonstrate how our Bayesian approach lets us quantify and test changes to that microcircuit map: we test the claim that D1 SPN connections are altered in a mouse model of Huntington's disease, and find no evidence for it; and, using recent data from Krajeski et al. (2019), we show the selective asymmetry of D1 and D2 SPNs appears at different stages during development. Our Bayesian approach thus simultaneously evaluates the probability of connection between neuron types, its dependence on distance, and the strength of evidence for it, creating a solid foundation for theories of striatal computation.
Materials and Methods
Data processing
We extracted data on pairwise connections from intracellular recordings of striatal neurons from a database of studies. The full set of data we extracted is given in Table 1. Because of the way Taverna et al. (2008) gave their results, namely, reporting the number of connected pairs and specifying whether any were bidirectional instead of reporting the number of connections, the number of tests for nonmixed pairs we use in this paper is doubled compared with the original study. For instance, when Taverna et al. (2008) say they found 5 connected pairs of 19 pairs of D1 neurons, we interpret this as 5 connections of 38 tests, to be consistent with the mixed D1 and D2 pairs, which by necessity are unidirectional (a D1 → D2 connection can only be tested in one direction or it becomes a D2 → D1 connection). This was also the case for the data of Cepeda et al. (2013) on SPN connections in wild type and Huntington's disease model mice.
Bayesian inference of connection probabilities
A single experimental test for determining whether one neuron connects to another will yield either a positive or negative result, so that it is equivalent to a Bernoulli test with a success rate p, the unknown probability of connection we are trying to infer. When analyzing a whole study consisting of several of these tests, we assume that each test is independent and shares the same success rate p with the others. Thus, the study as a whole can be described using a binomial distribution as follows:
Our goal is to estimate this p, the probability of connection, and the uncertainty of that estimate. According to Bayes theorem, the posterior distribution for p can be determined by the following:
In other words, to determine the posterior, we simply have to add the number of successful tests k to a, and the number of unsuccessful tests n – k to b.
Consequently, obtaining the posterior distribution is a single line of code. In MATLAB, this is posterior = betapdf (p, a + k, b + n – k) with p a vector of probabilities of connection for which we want the corresponding probability density value, and a and b the shape parameters of the initial prior.
Design of the prior based on previous literature
In Results, we test a set of standard priors for the beta distribution, the uniform prior (a = b = 1), the Jeffreys prior (a = b = 0.5), and the Haldane prior (a = b = 0). We also test a prior based on previous literature of connections between SPNs, which we derive here. Knowing its mean μ and variance v, the shape parameters of a beta distribution are as follows:
Previous studies that did not differentiate the D1 and D2 subtypes have shown that SPNs connect to one another at a mean rate of 0.12 (Czubayko and Plenz, 2002; Tunstall et al., 2002; Koos et al., 2004; Taverna et al., 2004), leaving us with a decision to make about the desired variance of the prior. Despite their thoroughness (325 tested pairs in Koos et al., 2004), we could not directly use a beta distribution based on the number of pairs in the initial studies, as the resulting variance, which would reflect uncertainty attached to the measurement of the average connection rate between all types of pairs, would be so small that the new evidence with SPN subtype distinction would be unable to significantly affect the posterior. Indeed, the desired variance should reflect the fact that the average connection rate of 0.12 masks the potential existence of four distinct connection rates for each pair. We were unable to find a principled way of deriving this desired variance; and for this reason, different values of variance were tested before settling for 0.005, which gives the corresponding beta distribution a shape that makes such a prior both sufficiently informative as to be interesting without being completely insensitive to the addition of new data. Setting μ = 0.12 and v = 0.005, we find a = 2.56 and b = 18.12.
Inferring distance-dependent probabilities of connection from point estimates
Intracellular recording studies typically report a maximum distance of pairwise recording, so our point estimate p of the probability of connection is then actually an integral over any distance-dependent probability of connection. We show here how we can derive estimates for the distance-dependent probability of connection from these point estimates, using simple models.
We assume that the probability of connection from a source neuron to a target neuron at distance r away is an exponentially decreasing function of distance as follows:
Our goal here is to estimate the length-scale of the decay of connectivity, particularly so that we may compare the scales between different types of connection, rather than find a detailed model of the distance dependence decay of the probability of connections. Finding the most accurate models would require both having the exact distances between all pairs of sampled neurons (e.g., all pairs of D1 SPNs sampled), which are often not readily available, and solving a range of issues, including the following: finding suitable models to fit the data; finding appropriate methods to fit the models to the data; determining whether the data have sufficient power to fit each model; determining whether the data have sufficient power to decide between different models; and determining whether the data have sufficient power to accurately recover the parameters of each model. The specific distance-dependent model of a particular type of connection in the striatum is thus a considerable piece of work, beyond our scope here. Moreover, it is unlikely in any case to markedly change the estimated length-scale over which the probability of connection decays, as we expect distance dependence to decay exponentially: models of connectivity in the striatum derived from overlapping models of dendrites and axons (Humphries et al., 2010; Hjorth et al., 2020) and data from cortical slices (Levy and Reyes, 2012) and cultures (Barral and Reyes, 2016) all show that probabilities of connection between neurons exponentially decrease with distance. Our exponential model is thus a reasonable choice.
Given our model for
Taking a central neuron as a reference point, we start by looking for a distribution for r, the distance between that central neuron and other neurons chosen for testing. By definition of a probability density function, fsamp(r) must be such that the probability that r is found between two arbitrary values r1 and r1 + Δr is as follows:
This probability distribution for distance depends on how the experimenters sampled their pairs. We consider two models for fsamp(r). Our first model is that, given a starting neuron, experimenters are equally likely to sample any target neuron within their maximum recording radius — we call this model fequi. Our second model is that, from the starting neuron, experimenters will sample its nearest neighbor — we call this model fNN. Next, we derive the fequi model and describe the fNN model below.
The equiprobable sampling model
For a given distance r1 from a central neuron, the probability of selecting a neuron within the volume bounded by r1 and r1 + Δr is also equal to the ratio of the expected number of neurons found within it over the expected number of neurons in the total volume as follows:
If we now combine the general definition of a probability density function (Eq. 9) with this particular equiprobable sampling assumption, we now have the following:
So that finally, by plugging Equation 14 into Equation 8, we obtain the following:
The nearest-neighbor sampling model
To derive the nearest-neighbor model, we will consider the case where experimenters only patched pairs of neurons, ignoring the fact that Planert et al. (2010) would also patch triplets or quadruplets, and we will assume that they always patched the closest neuron within the maximum distance they set themselves. This means that we are looking for the density function for the nearest neighbor, fNN(r).
Because information about the nearest neighbor distribution was hard to find, we reproduce its derivation here, basing ourselves on Krider and Kehoe (2004), in case such a derivation would be of interest to others. Such a density function must satisfy the following:
According to Leibniz' rule, we get on the one hand the following:
We can also substitute the term in brackets in the second half of Equation 17, thanks to the following rewriting of Equation 16 as follows:
After these substitutions and factoring by fNN, we can now rewrite Equation 17 as follows:
As previously, we can now combine these probabilities of sampling a neuron at a given distance with the probability of connection given distance (Eq. 8) to find the overall probability of connection as follows:
Unlike the equiprobable model, the nearest-neighbor model depends on both the density of neurons N and the depth of the sampling plane h. Given that we have collapsed probabilities of connection based on the nature of the presynaptic neuron, we simply use an overall SPN density in the mouse brain of 80,500 per mm3 following the convention of Hjorth et al. (2020) who chose this number based on the work of Rosen and Williams (2001) (and which is close to the estimated density of 84,900 per mm3 in the rat brain) (Oorschot, 1996). As for h, the experimenters tell us that neurons were sampled in the same field of focus that would correspond to a height with an order of magnitude of a tenth or even a hundredth of micrometer. However, given that for a neuron to be in the same field of focus as another, it suffices that some part of its soma, whose diameter is between 10 and 20 μm in mice according to Gagnon et al. (2017), intersects this very small volume, we can expect h to be much larger in practice. Because of this uncertainty, we used three different values of h to get three different nearest-neighbor distributions: 0.1, 1, and 10 μm.
Transformation of posterior distributions
Having obtained the mapping p(β) between p and β, we can go a step further and find a full distribution (a posterior) for β, by transforming the posteriors we have previously obtained for p, fp(p), into posteriors for β, fβ(β). By definition of a density function, for any possible values a and b of β, we have:
Thanks to the mapping from β to p (which is monotonically decreasing), we can also write:
Finally, integration by substitution tells us:
Hence, by identification with Equation 28:
In order to draw fβ, we converted regularly interpolated values of β into the corresponding values of p using Equation 15 for the equiprobable sampling model or Equation 27 for the nearest-neighbor model.
Obtaining the derivative of p with respect to β in Equation 31 depended on the sampling process. In the case of equiprobable sampling, after an integration by parts of Equation 15, we arrive at the following expression of p(β):
However, in the case of the nearest-neighbor distribution, we were unable to find a closed-form for
Code availability
All code used for this work was written with MATLAB. The code necessary for the Bayesian analysis of p, the transformation of fp(p) to fβ(β), and the Monte Carlo simulations depicted in Figure 4C, D is available on the Github account of the M.D.H. laboratory: https://github.com/Humphries-Lab/Bayesian-map-of-striatum-circuitry.
Results
Patch-clamp data on connection rates between SPNs
We begin by reviewing key data on the connections within and between the D1 and D2 type SPNs, which we will also use to motivate our Bayesian approach. Previous studies by Taverna et al. (2008) and Planert et al. (2010) collected data on pairwise connections between SPN subtypes in slices obtained from both the dorsal and ventral striatum of mice. The subtype of the SPNs was determined by targeted expression of EGFP under the control of either a D1 or D2 receptor promoter sequence for Taverna et al. (2008) and of only a D1 receptor promoter for Planert et al. (2010). Nonlabeled SPNs were then assumed to belong to whichever group was not meant to be labeled in this particular animal, and electrophysiological criteria were used to exclude interneurons. After choosing a pair of neighboring cells, no further apart than 50 μm, Taverna et al. (2008) detected connections in current-clamp mode by injecting a depolarizing current step in the first (potentially presynaptic) neuron of the pair, and measuring depolarizing postsynaptic potentials in the second one. This procedure was then repeated the other way around. Planert et al. (2010) recorded up to four neighboring neurons simultaneously, with a larger maximum intersomatic distance of 100 μm, and detected connections by stimulating one neuron with a train of depolarizing pulses to generate action potentials in the presynaptic neuron and recording the potential postsynaptic responses of the other neurons.
In both studies, the ratio of successful tests to the total number of tests (Table 1) was then reported as an estimate
However, typically, intracellular recording studies do not report any estimate for the uncertainty surrounding their measurements of
Bayesian inference of connection probabilities
As we have just explained, frequentist inference gives us a single point estimate
As we show in Materials and Methods, for these data, Bayesian inference turns out to be simple. Given the number of pairwise tests n, and the number of successful connections k, the posterior for p is a beta distribution with updated shape parameters as follows:
For instance, if we had initial values of a and b equal to 1, and we were looking at the data concerning D1 → D1 connections obtained by Taverna et al. (2008) (Table 1) with n = 38 tests and k = 5 connections found, then we would obtain a = 6 and b = 34, and the resulting posterior would be the one depicted in Figure 1C (light blue curve). Depending on our assumptions, different values of a and b can be used to give the prior a desired shape. We begin with the common choice of the uniform distribution in which p could be anywhere between 0 and 1 with equal probability, achieved by setting a = b = 1 as in the example just given.
Using this prior in combination with the data of Taverna et al. (2008) gives us the posterior curves shown in Figure 1C. Once obtained, we can revert, if necessary, to a more frequentist standpoint by extracting from these density functions a single point estimate
The fact that we used the same prior for all pairs of neuron types reflects our initial belief that there is no difference in the probability of connection between pairs. To overcome this belief requires a sufficient amount of evidence, and we can start comparing the different probabilities of connection visually by looking at how much the different posteriors overlap. Based on Figure 1C, for example, it seems that, in the data of Taverna et al. (2008), probabilities of connection segregate depending on the nature of the presynaptic neuron in the pair. The posteriors involving presynaptic D1 neurons overlap considerably with one another, and their region of highest density is lower than for presynaptic D2 neurons who also show great overlap, while there is much less overlap between pairs with different presynaptic neurons. This becomes even more obvious when looking at the 95% credibility intervals drawn underneath the curves that show more or less overlap depending on the nature of the presynaptic neuron: the credibility intervals for pairs with a presynaptic D1 neuron share an overlapping interval roughly covering probabilities of 0.05-0.15, while the overlapping interval for connections with a presynaptic D2 SPN ranges between probabilities of ∼0.20-0.28. A similar pattern repeats itself in the data of Planert et al. (2010) (Fig. 1D), although the exact values of the overlapping regions are shifted toward 0 compared with Taverna et al. (2008), an effect that is potentially because of the maximum distance of sampling as explained later. This opens the possibility that there is indeed an asymmetry in terms of probability of connection that is dependent on the subtype of the presynaptic neuron, with no or little effect of the postsynaptic target subtype, something we will explore more thoroughly later.
One of the main advantages of Bayesian inference is that it forces researchers to be explicit about their priors and gives them the opportunity to choose appropriate ones. In order to illustrate this, we applied three further priors to the experimental data. First, the so-called noninformative Jeffreys prior sets a = b = 1/2. An intuitive way of understanding this prior is to picture ourselves at the very beginning of the experiment, waiting for the result of the very first paired stimulation and recording. This test will either be successful or not, meaning that the shape of the prior should give most and equal weight to these two outcomes (Fig. 1E, inset). Figure 1E, F shows the posteriors that result from using this prior, and we can see how they are practically identical to the posteriors obtained with a uniform prior. This was also the case when using the Haldane prior for which a and b equal 0 (not shown).
Our third prior is based on prior data, for Bayesian inference also provides us with a principled way of integrating previous knowledge into the prior. Earlier work (Taverna et al., 2004; Czubayko and Plenz, 2002; Koos et al., 2004) quantified the rate of lateral connections between SPNs without distinguishing SPN subtypes and concluded that lateral connections occurred at a rate of ∼0.12. Using this information, we can design a beta distribution with a mean of 0.12 and an arbitrary variance of 0.005 (see Materials and Methods), which serves as our third and final prior shown in the inset of Figure 1G. Although the posteriors are more clearly different from the ones obtained with the uniform and Jeffreys prior, they still look very similar. Indeed, we find that the MAP values for a given type of connection (Table 2) are very close whatever the choice of prior, and the previous observation that the curves seem to segregate according to the subtype of the presynaptic neuron is valid in every case. On the other hand, because this prior is more informative than the two previous ones, uncertainty is reduced, as witnessed by the smaller credibility intervals. Given this robustness to the different priors, we shall henceforth exclusively use the prior based on previous literature when analyzing connections between SPNs.
D1 neurons make fewer connections than D2 neurons
We previously observed that D1 neurons seem to make fewer connections than D2 neurons without necessarily targeting one subtype over the other, based on how the posterior distributions appear to segregate by presynaptic subtype in Figure 1. We can go beyond this qualitative analysis by calculating a density function fΔ for the difference between two probabilities of connection. For instance, if we are interested in the difference in the probability of connection between D1 → D1 and D1 → D2 pairs (Fig. 2A), using the posterior distributions fD1→D1(p) and fD1→D2(p), we can find the density function for Δ(D1→D1)– (D1→D2) by the following:
We applied this method to answer the question: do SPNs of either kind preferentially target SPNs of a certain subtype? In particular, based on their point estimates of connection probabilities, Taverna et al. (2008) have claimed that D1 neurons prefer to connect to other D1 neurons than to D2 neurons, a claim used in the computational study by Burke et al. (2017). As evidenced in Figure 2A, E, which plots the density functions for Δ(D1→D1)– (D1→D2) according to the data by Taverna et al. (2008) and Planert et al. (2010), respectively, this is not the case as both density functions include 0 among their most likely values. Indeed, the probability that Δ(D1→D1)– (D1→D2) is smaller than 0 is equal to 0.19 and 0.30 in Taverna et al. (2008) and Planert et al. (2010), respectively. Similarly, we do not find any difference when looking at whether D2 neurons have a preference for a particular postsynaptic neuron subtype (for the density functions of Δ(D2→D2)– (D2→D1), see Fig. 2B,F): for Taverna et al. (2008), the probability that Δ(D2→D2)– (D2→D1) is larger than 0 is 0.16, and the probability that it is smaller than 0 is 0.17 for Planert et al. (2010). We thus find no evidence that SPNs of one subtype (D1 or D2) preferentially target a certain subtype.
Having established that rates of connection are not different for a given presynaptic neuron subtype, we can collapse the data according to the subtype of the presynaptic neuron to answer another question: are D1 neurons more or less likely to make connections overall than D2 neurons? To do this, we simply add up the total number of tested pairs and connections found for the same presynaptic neuron type (e.g., for D1 SPNs:
Probability of connection as a function of distance
So far, when considering data on SPN connections from Taverna et al. (2008) and Planert et al. (2010), we have been careful to analyze each study separately, resisting the temptation of combining the two into a more powerful dataset. We were justified in being so careful since the two experiments used different maximum intersomatic distances between neurons, namely, 50 μm in the study of Taverna et al. (2008) and 100 μm in that of Planert et al. (2010). Given that probability of connection between neurons typically decreases with distance (Hellwig, 2000; Humphries et al., 2010), sampling within a larger area around a neuron will probably cause a decrease in the ratio of connected pairs. Indeed, if we compare the probability of D1 or D2 neurons making lateral connections between the two experiments, we find that these probabilities tend to be larger in Taverna et al. (2008), which has the smaller sampling area, as illustrated by the corresponding density functions shown in Figure 3A. We thus introduce here a method for estimating the distance-dependent probability of connection between neuron types from data on neuron pairs recorded between some known maximum separation; our first use of this method is then to see whether the distance dependence is consistent between the two studies of SPN connectivity.
To do this, we start by positing that this decrease obeys a simple exponential decay function as follows:
A simple model for fsamp would be that, given a certain volume surrounding a central neuron, the probability of sampling any given neuron in that volume is equiprobable for all neurons (Fig. 3B): we call this model fequi. With this assumption, we obtain the following solution (see Materials and Methods):
Probability of connection decreases faster for D1 than for D2 neurons
We apply this method to the posteriors for the probabilities of connection collapsed according to the subtype of the presynaptic SPN and obtain the posteriors for the decay rate β shown in Figure 4A, B for D1 and D2 neurons, respectively. Despite not being perfect, there is a good level of agreement between the two studies, which give estimates in the same ballpark. In both cases, although the posteriors do not overlap much, they do indeed lie quite close to each other providing us with a continuous, albeit broad, range of possible values. In the case of D1 neurons, the decay rate is expected to be in a region between 0.03 and 0.13 μm–1. The exponential decay curves representing the probability of connection as a function of distance for a decay rate equal to the MAP of each study (which are given in Table 3) are also shown in the inset of Figure 4A, and it is evident that they are extremely close to one another. As for D2 neurons (Fig. 4B), the decay rate is smaller, as expected given that we have already shown that the overall probability of connection is higher for these neurons, ranging between 0.02 and 0.07 μm–1.
To get a better idea of how consistent the results are between the Taverna et al. (2008) and Planert et al. (2010) datasets, we used Monte Carlo simulations to try and recover the number of observations made by each experiment with its own value of R using the β value that seems most likely given the two posterior curves, that is, the intersection of the two posterior curves (Fig. 4A,B, black dotted lines). We ran 10,000 virtual experiments by generating random distances between pairs of neurons (according to Eq. 14) and the maximum distance R used by that study, and then deciding whether they were indeed connected according to the probability of connection of Equation 36. We generated the same number of pairs as tested in each study and then reported the number of times we obtained the exact same number of positive results (Fig. 4C,D, red bars in the histograms). For instance, setting an intermediate decay rate of 0.075 μm–1 for D1 neurons, and generating 10,000 replications of the experiment of Taverna et al. (2008), which recorded 85 pairs of SPNs with a D1 presynaptic neuron, we obtained >500 simulations where exactly 8 pairs were connected, which is the number originally reported (Fig. 4C, left). In all 4 cases, we managed to replicate the original results relatively often, proving that the best estimates for β are reasonable. As a sanity check, we also used the decay rate MAPs for one subtype to try and replicate the results of the other subtype and found it much harder, if not impossible, to replicate the results, verifying that the decay rates have to be different for the two types of SPNs (results not shown).
If we compare the estimates between the two datasets more critically, there is a clear bias for the posterior curves extracted from Taverna et al. (2008) to be shifted to the right compared with the posterior curves from Planert et al. (2010) (Fig. 4A,B). Indeed, if we refer to the insets in Figure 4A, B, the best estimate of β according to the study by Taverna et al. (2008) would predict a 50% drop in probability of connection every 8 μm for D1 neurons versus every 13 μm according to the study by Planert et al. (2010). In the case of D2 neurons, the difference between the two exponential curves is even greater, with a half distance of 13 μm versus 21 μm according to Taverna et al. (2008) and Planert et al. (2010), respectively. Furthermore, the density functions for the difference in posteriors between the two studies both lie predominantly in the positive domain (Fig. 4E,F), and the probability that the decay rate is larger in the Taverna than Planert data is 0.967 and 0.996 for βD1→SPN and βD2→SPN, respectively. Although the disagreement between the datasets is small, it is nonetheless consistent.
Biased neuron sampling could explain differences between datasets
One potential explanation is that the sampling of neuron pairs was more complex than the equiprobable sampling model we first assumed. In this section, we explore this possibility by considering a model for fsamp where, rather than choose neurons equiprobably within a visible area surrounding a neuron, experimenters preferentially tested neurons that were closest to one another, to maximize the probability of detecting connections.
To explore this model, we used the same probability of connection given distance (Eq. 36) in combination with a new density function fNN for the probability of the distance to the nearest neighbor, to derive a new mapping from p to β. We found the resulting mapping to be (see Materials and Methods):
The first column of Figure 5 shows the resulting posteriors for D1 neurons. The picture for h = 0.1 μm is not so different from that obtained under the equiprobable sampling hypothesis, but for greater values of h, the posteriors overlap far more. In particular, for h = 1 μm, the posterior curves practically coincide. Similarly, for D2 neurons, h = 0.1 μm does not much improve the agreement between the two studies, but greater values of h do (Fig. 5, second column). This approach successfully illustrates how a tendency to select neurons closer together might account for the discrepancy observed in estimates of the decay parameter using the simpler equiprobable sampling model. Moreover, this analysis shows how the details of data sampling matter when estimating connectivity statistics from intracellular recording data.
Fast spiking (FS) interneurons preferentially connect to D1 SPNs
We now turn to completing our Bayesian map of the striatal microcircuit by evaluating the connections of different species of interneurons to the SPNs and to each other; we present the full map in Discussion (see Fig. 10). Three main types of interneurons are commonly documented (Kreitzer, 2009): FS, persistent low threshold spiking (PLTS), and cholinergic (ACh) interneurons. We will also include in this list TH and NPY-expressing NeuroGliaForm (NGF) interneurons because of their relationship to ACh interneurons, which is crucial to the function of the cholinergic component of the circuitry (English et al., 2011; Ibáñez-Sandoval et al., 2011; Dorst et al., 2020). We took data on pairwise intracellular recordings of these interneuron types from the range of studies listed in Table 1, and determined Bayesian posteriors for p as previously. Unlike for the SPNs, we did not have prior studies of the interneuron connections to help us design a prior, so we relied on a uniform prior instead.
We focus first on FS interneurons that project to the SPNs, using intracellular recording data from Planert et al. (2010) and Gittis et al. (2010). The posteriors we obtain for Planert et al. (2010) (Fig. 6B) are consistent with quite high connection probabilities: the MAPs are 0.67 and 0.89 for connections to D2 and D1 neurons, respectively. However, the small size of the samples means that the range of possible values is also broad (95% credibility intervals: D2, [0.35, 0.88]; D1, [0.56, 0.98]). Fortunately, the study of Gittis et al. (2010) is based on a much larger sample resulting in narrower posteriors shown in Figure 6A. Thanks to these narrower posterior curves, it is possible to conclude that FS interneurons preferentially target D1 neurons. Indeed, when we integrate ΔFS→D1– FS→D2 (Fig. 6A, inset), we find the probability that FS interneurons prefer to connect to D1 neurons is >0.99.
The two studies seem to disagree as Planert et al. (2010) gives much higher estimates of p, but this is resolved by taking into account the maximum distance used by the two studies: 100 μm for Planert et al. (2010) and 250 μm for Gittis et al. (2010). Indeed, if we convert the posteriors for p into posteriors for the exponential decay rate β of the probability of connection given distance, with the assumption of equiprobable sampling as previously explained, we obtain posteriors that very largely overlap (Fig. 6C,E; Table 3), thus reconciling the two studies. In line with the already discussed overall smaller rate of connection to D2 neurons, the probability of connection drops much faster as distance increases for connections to D2 neurons (dropping to 50% after ∼100 μm; see Fig. 6F) than for connections to D1 neurons (50% connection rates occurring at a distance of at least 200 μm; see Fig. 6D). Consequently, there are at least two length-scales in the striatal microcircuit, with connections between SPNs falling to 50% probability within a few tens of micrometers (Fig. 4A,B), but connections to SPNs from FS interneurons falling to 50% probability at ≥100 μm (Fig. 6D,F).
PLTS interneurons make few local connections in striatum
We turn now to the connections that FS interneurons make on other interneurons of the striatum. To assess these, we analyzed data from Gittis et al. (2010) on connections FS interneurons make to PLTS, cholinergic, and other FS interneurons (Fig. 7A). Their data on connections between FS and PLTS interneurons were pooled with the data on the same connections from Szydlowski et al. (2013): We checked that the data from the two studies were in agreement by calculating posteriors separately for each study and found the density functions for the difference between the posteriors (Δ) for both directions (FS →PLTS and PLTS →FS) included 0 among their most likely values (results not shown).
We see from these data that the probability of connection from FS interneurons to PLTS interneurons is low (
The combined data of Gittis et al. (2010) and Szydlowski et al. (2013) (Fig. 7C) show no evidence that PLTS interneurons connect locally to other interneurons (
Further evidence that the effect of cholinergic interneurons onto SPNs is mediated by GABA interneurons
A recent study (Dorst et al., 2020) reported intracellular recording data on connections between cholinergic interneurons and the subtype of GABAergic interneurons that express TH (Table 1). When we apply our Bayesian method to this dataset (Fig. 7D), we find that TH and cholinergic interneurons connect reciprocally to one another quite frequently and with practically equal probabilities (
The activity of cholinergic interneurons indirectly affects SPNs via at least one type of GABAergic interneuron (English et al., 2011). To examine this route, we combine pairwise intracellular recording data from English et al. (2011) on connections between cholinergic interneurons and NPY-NGF interneurons, with data from Ibáñez-Sandoval et al. (2011) on connections from NPY-NGF interneurons to SPNs. Our analysis (Fig. 7E) reveals that cholinergic neurons connect frequently to NPY-NGF interneurons, which in turn connect very frequently to SPNs, making them an effective relay of cholinergic signals; this relay may also be regulated by the NPY-NGF interneurons frequent feedback connections on cholinergic interneurons. Furthermore, given that Ibáñez-Sandoval et al. (2011) used a maximum distance between neurons of 100 μm, it is possible to transform the posteriors for the probability of NGF → SPN into posteriors for the exponential decay rate β of probability of connection given distance. The MAP of β is the lowest we have found at ∼0.002 μm–1 (Fig. 7F; Table 3). This means that, even at a distance of 100 μm from an NGF interneuron, an SPN still has a 0.8 probability of receiving a connection from this interneuron, which partly explains the effectiveness of the cholinergic system in regulating SPN activity.
Evidence used to compare SPN subtype connection rates in WT and Huntington's disease mice is insufficient
To this point, we have used our Bayesian approach to evaluate the probability of connection, the evidence for it, and (where possible) its dependence on the distance between neurons for every unique connection within the striatal microcircuit (for which there are extant data). We turn now to showing how our Bayesian approach lets us not just construct a map of the microcircuit, but also quantitatively test evidence for changes in the microcircuit. To do so, in this section, we evaluate evidence that connections between SPNs change in a mouse model of Huntington's disease (Cepeda et al., 2013); in the next section, we evaluate evidence for how connections between SPNs change over development.
The study of Cepeda et al. (2013) used smaller samples of identified SPN pairs than the (Taverna et al., 2008) and (Planert et al., 2010) studies of SPN connectivity (Table 4); consequently, the resulting posteriors are notably impacted by the choice of the prior (Fig. 8A,D). Indeed, the posterior curves obtained from the data of Cepeda et al. (2013) with a uniform prior or the prior based on previous literature look very different, contrary to those of Taverna et al. (2008) (compare Fig. 1C and Fig. 1G) and Planert et al. (2010) (compare Fig. 1D and Fig. 1H). In particular, the posteriors for WT mice obtained from the prior based on the previous literature look very similar to the initial prior (Fig. 8D) simply because of the small number of samples. Independently of the choice of prior, the posteriors overlap so much that it is not possible to confirm the rates of connection differ between any pairs of SPNs in the WT mice (Fig. 8A,D). Given how broad the posteriors are, we infer that this lack of difference is because of insufficient data rather than a true absence of difference.
Crucially, this lack of data is also true when comparing connection rates between the WT and Huntington's model mice (Fig. 8A,B or Fig. 8D,E). We find no evidence to support one of the conclusions reached by the authors that D1 → D1 connections are more likely in the Huntington's model than WT mice. Indeed, if we plot the density functions for the difference in probabilities of connection for D1 → D1 pairs between the two animal groups, we can see that it is very probable for this difference to be 0, but also any other value ranging from ∼−0.3 to 0.3 if we use a uniform prior (Fig. 8C) or a slightly more conservative −0.2 to 0.15 using the prior based on previous literature (Fig. 8F).
D2 SPN connection asymmetries appear during development
The development of connections between SPNs and their asymmetry can be tracked through postnatal development thanks to a recent study by Krajeski et al. (2019) who measured the probability of connection for different SPN pairs at three different stages of postnatal mouse development. The researchers reported that D1 neurons established lateral connections earlier than D2 neurons (a reproduction of these results with our added Wilson CIs is shown in Fig. 9A–C). We used our Bayesian approach to check this conclusion against the uncertainty in the experimental data, and provide further details of the development of the striatal microcircuit.
In order to apply our Bayesian method to these data, we resort to the uniform prior, since we have no particular expectation about these connection rates at these stages of development, and obtain the posteriors for each combination of neurons shown in Figure 9D–F. We have also tested the Jeffreys prior and obtained practically identical results (not shown). We can see that D1 neurons have already made some connections in the first 3-6 d of postnatal development (P3-P6; Fig. 9D), but it is hard to say from inspecting the posteriors at each subsequent developmental stage (P9-P12 and P21-P28) whether the connections made by D1 SPNs continue to develop or have already finished by P3-P6 (Fig. 9E,F). If we instead look at the difference in posteriors (fΔ) between consecutive developmental stages (Fig. 9G,H), we see some evidence that connections made by D1 neurons continue to develop, with respective probabilities of 0.81 (P3-P6 to P9-P12) and 0.77 (P9-P12 to P21-P28) that the connection density of D1 neurons increases (probabilities again found by integrating fΔ between 0 and 1). Comparing the earliest (P3-P6) and latest (P21-P28) stages gave a similar probability of 0.85 that D1 connections increased (data not shown).
For presynaptic D2 neurons on the other hand, it is clear that no or very few connections are present at P3-P6, and that they gradually appear later (Fig. 9D–F). Computing the difference in posteriors (fΔ) for P(D2 → D2) and P(D2 → D1) between consecutive stages (Fig. 9G,H), we find the probability that D2 SPNs increase their connection density is 0.95 between P3-P6 and P9-P12 and 0.92 between P9-P12 and P21-P28, indicating a gradual development of these connections up to postnatal days 21-28. By also calculating
Our finding of strong evidence that D1 neurons are less likely to receive connections from SPNs than D2 neurons, both in adults (Fig. 2) and at later stages of development (Fig. 9), implies a role for active wiring processes in the developing striatum. We considered a simple model of a passive wiring process in which the contact of an axon from a first neuron onto the dendrite of a second neuron is determined only by the probability that an axon segment and a dendritic segment simultaneously occupy the same location (Liley and Wright, 1994; Kalisman et al., 2003; Humphries et al., 2010). For SPNs, we have the repeated observation that D1 SPNs have denser dendritic trees for the same volume as D2 SPNs (Gertler et al., 2008; Fujiyama et al., 2011; Gagnon et al., 2017). A passive wiring model would thus predict that D1 SPN dendrites receive more axonal contacts than D2 SPN dendrites from the same presynaptic type of SPN.
If this were true, we would expect to find in our analyses here that the asymmetry of connection rates would depend on the type of the postsynaptic neuron, when we instead find it depends on the type of presynaptic neuron; and we would expect D1 → D1 connections to be quite numerous when indeed these are quite rare. Together with our confirmation that the data of Krajeski et al. (2019) show D1 and D2 neurons develop their connections at a different rate, our analyses thus suggest that there is an active wiring process in striatal development that causes either an underexpression of connections to D1 SPNs or overexpression of connections to D2 SPNs.
Discussion
We presented a Bayesian inference approach to analyzing connectivity using intracellular recording data, and applied it to reconstruct the microcircuit of the striatum from an exhaustive survey of data from pairwise intracellular recordings. None of these data have had any assessment of the uncertainty in their connection estimates or of the strength of evidence they provide. Our new approach allows us to now draw rigorous conclusions about the strength of evidence for claims about the microcircuit, and in turn synthesize these data into as complete a map as the data allow.
A Bayesian map of the striatal microcircuit in mice
Figure 10 synthesizes the complete map we obtained of the striatal microcircuit in mice. It emphasizes our key results: first, there is strong evidence of a connection asymmetry that depends on the type of presynaptic SPN, namely, that D2 SPNs are roughly twice as likely to contact another SPN as D1 SPNs, but no evidence of an asymmetry that depends on the type of postsynaptic SPNs; second, that there is strong evidence for FS interneurons preferentially connecting to D1 SPNs; third, that there is strong evidence of dense projections from NPY-NGF interneurons to SPNs, likely as dense or denser than those from FS interneurons; and, finally, that connections between SPNs occur on much shorter-length-scales than the connections made by interneurons.
For ease of interpretation, Figure 10 summarizes each connection probability as the best point estimate we can obtain (i.e., the MAP of the corresponding posterior distribution). But we now have the full posterior distributions underlying these estimates, and some of these are broad; for example, the connection probability from NGF to ACh interneurons has a 95% credible interval twice as wide as its best (MAP) estimate (Table 1). Our posterior distributions thus reveal that a wide range of potential striatal wirings are consistent with current data.
From this, it follows that any model of the striatum should sample its connection probabilities from these posteriors to understand the robustness of the results. It is now well established that parameters of neural models fall into two classes: those whose precise values are critical to the resulting predictions of a model, and those to which a model is not sensitive (Panas et al., 2015; Ponce-Alvarez et al., 2020). And it is likely that striatal dynamics are indeed sensitive to variations in the probabilities and distances of connections (Humphries et al., 2010; Spreizer et al., 2017). Thus, we propose that computational researchers change their usual practice of setting a single value for connection parameters, and instead sample from the posterior distribution on each run of their model; to this end, we give the complete form of all our posteriors for p in Table 1 and our credibility intervals for the decay parameter β in Table 3.
Extending the microcircuit map
Constructing our map also revealed or reemphasized further research questions. First, because the available experimental data are drawn from across the striatum, the map is silent on anatomical issues, such as whether connection probabilities differ between the patch and matrix compartments of the striatum or between different regions of the striatum. Second, we lack data on the connectivity of some types of striatal interneuron thought distinct to those examined here, including the calretinin-expressing interneurons and rare subtypes of 5HT3a-expressing interneurons (Tepper et al., 2018). Also omitted are the known connections from ACh interneurons to SPNs. As these synapses use muscarinic receptors, and so are metabotropic, they do not evoke postsynaptic currents detectable by the simultaneous stimulation and recording technique used in the studies relied on here. However, new techniques, such as induced overexpression of G-protein activated ion channels (Mamaligas and Ford, 2016), may allow the future quantification of connection rates. An advantage of our approach is that any new data on pairwise recording data in the striatum can build directly on our analyses, by either updating the posteriors we arrived at or by estimating new posteriors for connections that lack data at present.
Third, we emphasize that this is a map of the local microcircuit, for the connections from a source neuron to other types of neurons in its neighborhood. Our distance-dependent probability model assumes that connection probability falls monotonically with distance within the neighborhood. However, subtypes of interneurons that send axons longer distances (Tepper et al., 2018) could violate this assumption, such as reports of PLTS interneurons with an axon that spans a distance of >1 mm making infrequent bouquets of terminals (Kawaguchi, 1993). More detailed knowledge of these long-distance connections would allow for a more complete map of striatal connectivity.
Finally, this is a map of connectivity: full knowledge of the influence of one neuron type on another requires data on the strength of the different connections, which may in turn depend on where on the target neuron they fall (Oorschot et al., 2013; Du et al., 2017).
Implications for theories of the striatum
Ever since the paper by Jaeger et al. (1994), which reported finding no functional lateral connections among SPNs, computational modeling of the striatum and wider basal ganglia moved away from earlier lateral inhibition models implementing a winner-takes-all strategy where these connections took center stage (Groves, 1983) toward a predominantly feedforward view (Plenz, 2003; Tepper et al., 2004). According to these feedforward models (e.g., Mink, 1996; Gurney et al., 2001b; Frank, 2005; Humphries et al., 2006; Leblois et al., 2006), the output of the striatum is entirely determined by the pattern of its cortical inputs modulated by the strength of the different cortico-striatal synapses.
Our Bayesian map of the striatal microcircuit provides further evidence that this feedforward model is limited because the striatum's internal circuit is crucial in shaping its outputs. In the classic direct-indirect pathway model of the basal ganglia (Alexander and Crutcher, 1990), the D1 and D2 SPNs respectively form the populations originating the direct and indirect pathways to the basal ganglia output nuclei, D1 SPNs sending direct axonal projections, and the D2 SPNs projecting first to the globus pallidus, which relays to the output nuclei. We have shown that data on connections between D1 and D2 SPNs (Taverna et al., 2008; Planert et al., 2010) provide strong evidence for D2 SPNs making more connections to other SPNs than D1 SPNs. Some models and theories have interpreted these data as showing that the indirect pathway will dominate the direct pathway output (Bahuguna et al., 2015; Burke et al., 2017). However, that the same data provide no evidence of an asymmetry in the preferred targets of D1 or D2 SPNs further suggests that there is not a selective inhibition of the D1 SPNs by D2 SPNs, but that the D2 SPNs are as equally likely to inhibit themselves as D1 SPNs. As such, the nature of the interaction of the direct and indirect pathways, and hence their response to cortical and thalamic inputs, remains unclear.
Microcircuit mapping can be used anywhere in the brain
We showed a range of advantages that our Bayesian approach has over more traditional frequentist approaches. One is that it replaces a single point estimate surrounded by a flat CI by a posterior distribution covering all the possible values, so that overlaps between connection rates become immediately apparent. Second, as argued by Dienes (2014), when differences between connection rates are nonsignificant, Bayesian methods allow us to distinguish between cases where the data are insufficient to draw a conclusion from cases where there really is no difference. For instance, the posteriors for connection rates in the study of Cepeda et al. (2013) strongly overlap (Fig. 8); but because the posteriors are so broad, we know this is because of insufficient data rather than evidence of no difference. On the contrary, when we failed to find a difference in connection rates for different postsynaptic targets of the D1 and D2 SPNs, the posteriors are sufficiently narrow for us to confidently conclude that such a difference is either absent or quite small (Fig. 2). Third, when making comparisons between probabilities of connection, we can use the full posteriors to compute an explicit probability that the difference is less than or greater than zero. A final advantage of Bayesian inference is the use of priors to incorporate past results, as we did for the connections between SPNs, or test our starting assumptions, as we did by showing our SPN connection results were robust to the choice of prior distribution.
Our approach can easily be applied to any brain regions where paired recording experiments have taken place, such as the recent study of Ellender et al. (2019) on how the embryonic origin of cortical neurons influences their connection probabilities. Indeed, obtaining the posterior distributions given k positive tests of connection and n – k negative tests requires a single line of MATLAB or Python thanks to built-in functions (see Materials and Methods). Consequently, not only are these Bayesian methods easily applicable to intracellular recording data from any brain region, but also may be a rare case where it is easier to be Bayesian than frequentist.
Footnotes
This work was supported by the Medical Research Council Grants MR/J008648/1, MR/P005659/1, and MR/S025944/1. We thank Robert Schmidt and Benoît Girard for helpful comments on a draft of this paper.
The authors declare no competing financial interests.
- Correspondence should be addressed to François Cinotti at francois.cinotti{at}gmail.com or Mark D. Humphries at mark.humphries{at}nottingham.ac.uk