Abstract
There has been a growing interest in understanding the role of the lateral habenula (LHb) in reward processing, affect regulation, and goal-directed behaviors. The LHb gets major inputs from the habenula-projecting globus pallidus and the mPFC, sending its efferents to the dopaminergic VTA and SNc, serotonergic dorsal raphe nuclei, and the GABAergic rostromedial tegmental nucleus. Recent studies have made advances in our understanding of the LHb circuit organization, yet the precise mechanisms of its involvement in complex behaviors are largely unknown. To begin to address this unresolved question, we present here emerging cross-species perspectives with a goal to provide a more refined understanding of the role of the LHb circuits in reward and cognition. We begin by highlighting recent findings from rodent experiments using optogenetics, electrophysiology, molecular, pharmacology, and tracing techniques that reveal diverse neural phenotypes in the LHb circuits that may underlie previously undescribed behavioral functions. We then discuss results from electrophysiological studies in macaques that suggest that the LHb cooperates with the anterior cingulate cortex to monitor action outcomes and signal behavioral adjustment. Finally, we provide an integrated summary of cross-species findings and discuss how further research on the connectivity, neural signaling, and physiology of the LHb circuits can deepen our understanding of the role of the LHb in normal and maladaptive behaviors associated with mental illnesses and drug abuse.
- anterior cingulate cortex
- cognition
- globus pallidus
- lateral habenula
- punishment
- reward
- rostromedial tegmental nucleus
Introduction
The habenula is a complex anatomical region nestled in the epithalamus that is comprised of the medial (MHb) and lateral (LHb) nuclei. In particular, the LHb has received substantial research attention because of its involvement in processing of reward and aversive information, cognitive flexibility, and emotion (Hikosaka et al., 2008; Zhao et al., 2015). Dysfunctions of the LHb have been implicated in mental illnesses associated with maladaptive processing of positive and negative valence stimuli (Proulx et al., 2014); thus, it is important to understand molecular, cellular, and circuit interactions of the LHb circuits that may contribute to these disorders.
The LHb receives inputs from several forebrain limbic circuits involved in motivation and value processing and sends outputs to brainstem centers involved in prediction of aversive events and behavioral inhibition (Kim and Lee, 2012). The LHb is activated by aversive stimuli and reward omission and is inhibited by unexpected rewards, with the activation preceding inhibition of dopaminergic (DA) neurons (Matsumoto and Hikosaka, 2007). In an elegant set of experiments, exposure of mice to aversive stimuli was shown to augment the LHb excitatory inputs onto rostromedial tegmental nucleus (RMTg) neurons, whereas optogenetic activation of this pathway promoted behavioral avoidance (Stamatakis and Stuber, 2012). Furthermore, the specific RMTg projections to the VTA are activated following exposure to aversive stimuli and inhibited by those that signal reward (Jhou et al., 2009a), supporting the role of LHb-RMTg-VTA connections in aversive behaviors. In another study using viral-mediated tracing with electrophysiology and optogenetic manipulations, LHb inputs to the VTA were also shown to elicit aversive behaviors (Lammel et al., 2012). These observations in rodents were extended by findings that nonhuman primate brains have a similar structure to rat RMTg that receives excitatory input from the LHb and exhibits negative reward-prediction errors (PEs) (Hong et al., 2011). This further suggests that the LHb-RMTg circuit is involved in transmission of aversive signals and motivational behavior across species. However, emerging evidence that will be explored here suggests important functional differences between the LHb and RMTg responsiveness to aversive stimuli and learning.
In contrast to the LHb projections to midbrain, the behavioral consequences of the neural activation of LHb afferents are less understood. The VTA sends rich projections to the LHb and plays a role in reward, motivation, learning, and aversion (Deutch and Roth, 1990; Roeper, 2013), which have been linked to release of DA in the LHb by the VTA afferents. However, contrary to this longstanding hypothesis, recent studies have found that these mesohabenular neurons do not release DA because they lack vesicular monoamine transporter 2 and cannot package DA inside synaptic vesicles (Root et al., 2015). These authors show that electrical stimulation of the LHb induces release of norepinephrine that can activate postsynaptic DA receptors in the LHb of rats and lead to the expression of reward phenotype. In addition, it has been shown in mice that optogenetic stimulation of VTA efferents to the LHb does not induce release of DA but instead causes release of GABA (Stamatakis et al., 2013), which suggests that the lack of a large pool of vesicular DA in the projections from VTA to LHb may be a common characteristic of rodent brain.
The heterogeneity in neuronal phenotypes of the VTA has been associated with VTA's diverse behavioral roles. For example, activation of VTA-GABA neurons is aversive (Tan et al., 2012), whereas activation of VTA-GABA afferents to LHb induces suppression of LHb firing and promotes reward-related behaviors (Stamatakis et al., 2013). Activation of VTA-VGluT2 neurons that innervate LHb (Hnasko et al., 2012) also elicits aversion (Root et al., 2014).
In addition to VTA afferents, the LHb receives major glutamatergic and GABAergic inputs from the globus pallidus interna (GPi), lateral hypothalamus (LH), basal forebrain, and ventral pallidum (Hong and Hikosaka, 2008; Shabel et al., 2012; Golden et al., 2016; Stamatakis et al., 2016). Neurons in the GPi and LH corelease glutamate and GABA as their neurotransmitters with the net effect that they can increase LHb firing when activated (Shabel et al., 2012, 2014; Stamatakis et al., 2016). In both cases, neurons in the LH and GPi are modulated by the cues that predict the availability of reward and aversion (Hong and Hikosaka, 2008; Bromberg-Martin et al., 2010) and can drive aversive behavior (Shabel et al., 2012; Stamatakis et al., 2016). Recent experiments have shown that shifted corelease of glutamate and GABA is a crucial mediator of aversive states, such as depression and cocaine withdrawal (Shabel et al., 2014; Meye et al., 2016).
In addition to representing signals associated with negative outcome, evidence is emerging to support the involvement of the LHb in goal-directed behaviors and behavioral flexibility (Kawai et al., 2015; Baker et al., 2016). Studies in nonhuman primates suggest that the anterior cingulate cortex (ACC) and the LHb both signal the negative outcome of choice and subsequently the adjustment of choice behaviors, albeit in distinct ways (Kawai et al., 2015). Although the nature of neural signals that reach the ACC is not known, these data suggest that the ACC and the LHb may act jointly to mediate information processing and decision making. Recent results from the LHb inactivation experiments in rodents (Baker et al., 2016), discussed in this review, may facilitate interpretation of findings from macaque studies and offer insight into the possible mechanisms by which the LHb registers ongoing states and determines appropriate behavioral response.
By bringing together findings from anatomical, molecular, physiological, and behavioral studies, this review seeks to explore in-depth critical components of the larger LHb circuits with emerging cross-species perspectives with a goal to formulate more refined hypotheses regarding the role of the LHb in complex behaviors. Because disturbances in the neural mechanisms involved in processing of positive and negative valence and cognition are common across mental and drug abuse disorders, detailed investigation of the role of the LHb in reward and aversion as well as behavioral flexibility should facilitate understanding of how the LHb dysfunctions may be implicated in these disorders.
Dissecting the role of the habenula-projecting globus pallidus in behavioral reinforcement
The ability to choose appropriate actions in the face of potential reward and punishment is essential for survival. Such behavioral choice is subserved by the brain reward circuitry encompassing the basal ganglia and the mesocorticolimbic dopamine system (Floresco et al., 2008; Graybiel, 2008). The LHb has recently emerged as a key nucleus in this circuitry (Lecourtier and Kelly, 2007; Proulx et al., 2014) as neurons in this nucleus are excited by the expectation of unpleasant events or when an outcome is worse than expected, and are inhibited by the expectation of reward or when an outcome is better than expected (Matsumoto and Hikosaka, 2007, 2009; Hong et al., 2011; Lawson et al., 2014). In other words, LHb neurons encode motivational values and PEs (Hikosaka, 2010; Proulx et al., 2014), in an opposite manner compared with dopamine neurons in the VTA and SNc (Schultz, 1998). Notably, activation of the LHb suppresses dopamine neurons via the inhibitory RMTg (Christoph et al., 1986; Ji and Shepard, 2007; Jhou et al., 2009a), and LHb lesions impair the PE signals in dopamine neurons (Tian and Uchida, 2015). These results suggest that the LHb provides, at least in part, the PE signals to dopamine neurons.
A major input to the LHb arises from the GPi, also called the entopeduncular nucleus in rodents (Rajakumar et al., 1993; Parent et al., 2001; Hong and Hikosaka, 2008). Neurons in the habenula-projecting globus pallidus, hereafter named as GPh, are distinct from the canonical GPi neurons. Whereas the latter population is inhibitory and controls motor function through projections to the brainstem and thalamic motor areas (Graybiel, 2008), the former uses glutamate as a major neurotransmitter (Shabel et al., 2012, 2014) and projects exclusively to the LHb (Rajakumar et al., 1993; Parent et al., 2001; Hong and Hikosaka, 2008; Stephenson-Jones et al., 2016). Interestingly, GPh neurons respond to reward and punishment, as well as the expectation of reward and punishment, in a manner similar to that of LHb neurons (Hong and Hikosaka, 2008, 2013). In addition, the response of GPh neurons precedes that of LHb neurons, and stimulation of the GPh area excites LHb neurons (Hong and Hikosaka, 2013). These findings indicate that the GPh may provide a major source of the value and PE signals to the LHb.
Recent studies have also examined the behavioral effects of GPh-LHb circuit manipulations. Activation of neurons in the GPh-LHb-VTA pathway elicits aversive responses (Lammel et al., 2012; Shabel et al., 2012; Stamatakis and Stuber, 2012). These observations, together with the findings that both GPh (Hong and Hikosaka, 2008, 2013) and LHb (Matsumoto and Hikosaka, 2007, 2009; Hong et al., 2011; Lawson et al., 2014) neurons encode negative PE signals, have led to a prevailing hypothesis that the primary function of the GPh-LHb circuit is controlling aversive or negative motivational responses. However, this hypothesis was recently challenged as inhibition of the LHb prevented mice from forming a preference for large risky rewards over small certain rewards (Stopper and Floresco, 2014). This suggests that the LHb does not just carry the information about the cost of an action but rather contributes to the evaluation of both the cost and benefits of a given decision.
Information about the cost and benefit of a choice may arise from the GPh as recent in vivo recordings in mice have demonstrated that, as in primates (Hong and Hikosaka, 2008, 2013), GPh neurons signal both how good or how bad an outcome is through graded inhibition or excitation, respectively (Stephenson-Jones et al., 2016). These bidirectional signals have an opposing influence on reinforcement, as mimicking them with optogenetic inhibition or excitation can reinforce or discourage choices, respectively (Stephenson-Jones et al., 2016). Moreover, synapse-specific manipulations have recently revealed that excitation of the GPh is essential for discouraging unrewarded actions, whereas the inhibitory signals are essential for reinforcing rewarded outcomes (Stephenson-Jones et al., 2016). Consequently, the GPh does not just provide an aversive signal to the LHb; rather, it is a key site where information of opposing valence is integrated to signal whether a decision was better or worse than expected.
The evaluation function of the GPh-LHb pathway is likely mediated through the bidirectional control of dopamine neurons. The GPh is well placed to bidirectionally influence dopaminergic activity as it provides tonic excitatory input to the LHb (Shabel et al., 2012; Stephenson-Jones et al., 2013; Stephenson-Jones et al., 2016), which in turn regulates dopamine neurons (Ji and Shepard, 2007; Matsumoto and Hikosaka, 2007, 2009), disynaptically via the GABAergic RMTg (Jhou et al., 2009a; Hong et al., 2011). Indeed, bidirectional changes in LHb firing have opposing effects on motivation and dopaminergic firing, with activation of the LHb leading to aversion and a phasic decrease in dopamine (Ji and Shepard, 2007; Matsumoto and Hikosaka, 2007; Stamatakis and Stuber, 2012), whereas inhibition of the LHb is rewarding and leads to an increase in dopaminergic activity (Stamatakis et al., 2013). Furthermore, lesions of the LHb disrupt negative and impair positive reward PE coding in dopaminergic neurons (Tian and Uchida, 2015), which is critical for reinforcement learning. Consequently, we propose that an increase in GPh activity, when an outcome is worse than expected, increases the excitatory drive onto the LHb to inhibit dopamine neurons and discourage repeating the action, whereas decreases in GPh activity, when an outcome is better than expected, remove the tonic excitation of the LHb to increase dopaminergic activity and reinforce the choice (Fig. 1).
Influence of GPh and lateral habenula on downstream structures
Although the exact influences of the LHb on behavior are still being elucidated, a major part of this influence likely occurs via the LHb's powerful inhibitory influence on midbrain dopamine neurons by way of the RMTg (Jhou et al., 2009a, b; Balcita-Pedicino et al., 2011; Hong et al., 2011). This opposing relationship between LHb and dopamine suggests an elegant symmetry between opposing motivational systems in the brain, as dopamine release is reinforcing (Tsai et al., 2009), whereas activation of LHb or RMTg is aversive (Stamatakis and Stuber, 2012; Jhou et al., 2013). Notably, drugs of abuse, such as cocaine, activate both of these opposing systems at different times. For example, single doses of cocaine initially increase dopamine levels but, after an ∼15 min delay, begin to activate LHb neurons instead (Jhou et al., 2013). During the latter period, place preference studies show that the animal is actually in an aversive state, reflecting a postdrug “crash” (Ettenberg et al., 1999; Jhou et al., 2013). Furthermore, lesions of the LHb, or optogenetic inactivation of the RMTg occurring 15–25 min after cocaine delivery, greatly impair avoidance conditioning driven by the aversive phase of cocaine. Additional studies from Mameli and colleagues further implicate LHb activation in longer-term cocaine-induced depressive-like states, as these states seem to depend on glutamate receptor trafficking in the LHb itself (Meye et al., 2015), as well as on synaptic changes in the balance of neurotransmitter signaling in GPh axons that corelease GABA and glutamate (Shabel et al., 2014; Meye et al., 2016). Hence, the functional connections from the GPh to LHb and then to RMTg may be involved in aversive effects of cocaine withdrawal and possibly other addictive drugs, such as opioids (Sanchez-Catalan et al., 2016). Notably, delayed aversive effects of drugs had previously been described in so-called “opponent process” models of motivation, which had been postulated decades earlier to describe the negative hedonic state of drug withdrawal (Solomon and Corbit, 1974; Koob et al., 1989), raising the possibility that one of the functions of the LHb and RMTg is to drive the negative affective states occurring during drug withdrawal.
Although the above studies imply that the LHb and RMTg function in tandem, a combination of anatomical, electrophysiological, and behavioral findings suggest that there are also major differences in function. In primates, a subset of RMTg neurons discriminate between rewarded and nonrewarded cues a few milliseconds earlier than the LHb, despite the RMTg neurons being downstream from the LHb (Hong et al., 2011). This finding is consistent with anatomic evidence that the RMTg receives multiple convergent inputs, not only from the LHb, but also from structures such as cortex and extended amygdala, that likely carry information about motivationally relevant events (Jhou et al., 2009b; Kaufling et al., 2009; Gonçalves et al., 2012). This raises the question about how signals from the LHb are integrated into the larger networks of which it is a part. Different behaviors differ greatly in the degree to which they depend on either the LHb or RMTg. For example, as noted above, both RMTg and LHb inactivations greatly reduce conditioned aversive responses to cocaine. However, only RMTg lesions, and not LHb lesions, produce striking losses of freezing behavior in a conditioned fear paradigm (Heldt and Ressler, 2006; Jhou et al., 2009a). In addition, we have found that rats with RMTg lesions are highly resistant to punishment (i.e., will continue to seek rewards despite receiving large footshocks), whereas LHb lesioned rats are not different from fully intact rats (P.J. Vento et al., submitted). Hence, even though LHb activation appears sufficient to drive aversion, it is not always necessary. The exact reasons why some behaviors are more dependent on LHb function than others is unknown, and represents a major area of future inquiry. Given the wide distribution of afferents to the RMTg, considerable effort will be needed to determine which afferents are active at which times, and for what purposes. As will be detailed further in the following section, one area where the LHb may play a particular role is in the flexibility to drive behaviors that are different from ongoing or default behaviors. For example, it was noted above that LHb lesions do not abolish conditioned freezing in a fear paradigm. However, LHb lesions may abolish the enhancement of fear induced by isolation stress, without removing the baseline fear (Murphy et al., 1996), consistent with a role for the LHb in flexible adjustment of behavior beyond what is expected by default.
The lateral habenula in behavioral flexibility
The LHb is interconnected with forebrain and midbrain areas required for working memory and behavioral flexibility performance (Baker et al., 2015; Quina et al., 2015), often regarded as the ability to switch behaviors when external contingencies change. Refinement of early lesion work and electrophysiological recording in freely behaving rodents suggests the LHb plays a broad role in behavioral flexibility. For example, the ability to reverse learned behaviors relies on an intact LHb in both appetitive and aversively motivated tasks (Nielson and McIver, 1966; Thornton and Evans, 1984; Thornton and Davies, 1991; Baker et al., 2016). Specifically, habenula lesioned rats trained to run to a maze arm and press a lever to end foot shock continue to enter the original arm when the lever contingencies are reversed (Nielson and McIver, 1966). When required to reverse behavior from one reward arm to the opposite to receive reinforcement on a t-maze, LHb inactivation resulted in increased trials to criterion that was marked by increased lose-shift and decreased win-stay behavior, suggesting that rats were no longer sensitive to reward outcomes when making arm choices (Baker et al., 2015). In a subsequent study, rats were also insensitive to cues that signaled the appropriate upcoming arm choice and instead engaged in an ineffective egocentric response strategy (Baker et al., 2016). In the Morris water maze task, inactivation or blockade of excitatory AMPA receptors within the LHb using CNQX resulted in impaired learning and retrieval, but not consolidation of a hidden platform location (Mathis et al., 2015). The rats engaged in thigmotaxis when either drug was injected, suggesting that rats may be engaging in default behavior, which would prevent them from effectively finding the platform and readily acquiring the task.
This deficit is not likely due to decreased motivation but rather an inability to engage in alternative strategies, as thigmotaxis represents an active attempt to escape from the water. A possible explanation for findings across both appetitive and aversive tasks is that the LHb is critical for overcoming default behavior to engage in either exploratory or alternative learned behaviors. Somewhat speculatively, this could also account for findings from optogenetic stimulation of LHb neurons, which resulted in active, passive, and conditioned avoidance behaviors in mice (Stamatakis and Stuber, 2012; but see Jhou et al., 2009a), in which passive defenses were selectively impaired over active ones after RMTg disruption in rats. Namely, activation of these neurons could spur animals to engage in nonspecific alternative behaviors, which manifest as avoidance of neural stimulation.
In vivo electrophysiological recordings can reveal the nature of neural computations during performance of behaviors (e.g., adaptive navigation) previously identified with lesion and pharmacological or optogenetic studies of the LHb. To date, few studies have examined neural correlates of behavior in unrestricted animals. Two such studies have identified LHb neurons in freely navigating rats that correlate with the speed of the animal (Sharp et al., 2006; Baker et al., 2015). These speed cells displayed stability across changing contexts (Baker et al., 2015), suggesting that the LHb provides ongoing behavioral state information that interacts with other signals related to changing environmental contexts. Similar context-insensitive speed neurons have been described in the entorhinal cortex, and these are thought to provide important behavioral information to an overall cognitive map needed for accurate navigation (Kropff et al., 2015). Similarly, speed cells in the LHb may provide a stable readout of behavior, which is integrated with task relevant cues, such as reward, punishment, or cues that can be used to guide behavior when the ongoing or default behavior is not the optimal choice. As of yet, the limited number of electrophysiological studies in unrestricted, freely navigating animals hampers attempts to create a comprehensive account of LHb function in behavioral flexibility.
Together, these results suggest that the LHb broadly contributes to strategy generation and/or response selection when alternative strategies or responses are required. The neural responses that contribute to strategy generation and response selection within the LHb, and how various input structures contribute to neural codes, remain key components of understanding the role of the LHb in behavioral flexibility.
Negative reward signals in the lateral habenula regulate functions of the PFC
The LHb may interact with the PFC, to regulate aspects of executive function. Because the LHb does not send projections to the PFC, its signals do not directly reach this region. However, the LHb is known to regulate the activity of monoaminergic cell groups, such as dopamine (Christoph et al., 1986; Ji and Shepard, 2007) and serotonin neurons (Wang and Aghajanian, 1977) that project to the PFC. Pharmacological inactivation of the LHb increases dopamine concentration in the PFC (Lecourtier et al., 2008). Thus, the LHb could affect processing in the PFC by transmitting its signals via monoaminergic cell groups. Consistent with this view, Lammel et al. (2012) reported that manipulation of signals transmitted from the LHb to the mPFC via dopamine neurons in the VTA alters an avoidance behavior in rodents. Using optogenetics, these authors demonstrated that phasic stimulation of the pathway from the LHb to the VTA in a conditioned place preference assay causes a strong conditioned place aversion. They also found that LHb neurons synapse primarily on dopamine neurons in the VTA projecting to the mPFC and that injection of a D1 dopamine receptor antagonist into the mPFC prevents the conditioned place aversion induced by the optogenetic stimulation of the LHb-VTA pathway. These findings suggest that LHb signals reach the mPFC via VTA dopamine neurons where they can affect information processing and cognitive functions mediated by this area.
As described above, the interaction of the LHb with the PFC seems to be important for regulating its functions. To address how these brain structures cooperate, Kawai et al. (2015) recently investigated what signals are shared and what roles are divided between the LHb and the PFC, especially a ventral part of the PFC called the ACC (Kawai et al., 2015). The ACC is known for its crucial roles in monitoring negative outcome (Ito et al., 2003; Quilodran et al., 2008) and adjusting behavior using the outcome information (Shima and Tanji, 1998; Williams et al., 2004). Kawai et al. (2015) recorded neuronal activity from the LHb and ACC in monkeys performing a reversal learning task in which choosing one of two options was followed by a liquid reward in a probabilistic manner while choosing the other option was followed by no reward. The reward contingency was frequently reversed without any instruction. To maximize reward gain, the monkey was required to store information about negative outcomes in several past trials and to use that information to adjust subsequent choice behavior. Their findings show that ACC neurons store outcome information from several past trials, whereas LHb neurons mainly represent current negative outcome with shorter latencies. Furthermore, ACC neurons, but not LHb neurons, signaled a behavioral adjustment after negative outcome experiences. These findings suggest that the LHb provides signals about ongoing negative outcome, whereas the ACC accumulates this outcome information across several past trials and uses this information to adjust subsequent choice behavior. Consistent with the idea that the ACC receives negative outcome information from the LHb, dopamine neurons that mediate the LHb-ACC circuit interaction also encode negative outcome information by their inhibitory response. Such an inhibitory dopamine signal has been proposed to be transmitted to the ACC by disinhibiting ACC neurons (Holroyd and Coles, 2002). Indeed, pharmacological blockade of dopaminergic transmission reduces the event-related potential evoked by negative outcome in the ACC (Vezoli and Procyk, 2009). Together, LHb neurons might transmit negative outcome signals via dopamine neurons to the ACC that acts as a source of the outcome information accumulated across trials. Further studies will determine the causal relationship between the LHb and ACC signals and their role in performance monitoring.
In conclusion, recent discoveries from rodents and nonhuman primate studies implicate the LHb circuits in the encoding of motivational values and PEs, which are critical signals for learning and flexible decision making when faced with potential reward and/or punishment. Although the detailed mechanisms that underlie the role of LHb in these behaviors still remain unclear, the findings described above support a conceptual framework where, through distinct neural populations and synaptic connections, the LHb inputs and outputs act both cooperatively and distinctively to guide complex behaviors. The studies that examined the role of LHb-RMTg circuits in aversive behaviors suggest that these regions may act both synergistically and distinctively to drive negative affective states. Given that RMTg also receives inputs from other brain regions that exhibit reward-dependent modulation, this could account for temporal differences in LHb and RMTg activity and in representation of state value. Moreover, the GPh-LHb circuits play a role not only in representation of negative motivational valence or promotion of aversive responses, but also in the evaluation of positive and negative outcomes, which are the bases of flexible decision making. The results from nonhuman primate studies are congruent with the notion that the LHb plays a role in decision making, possibly by acting as a source of neural signal necessary for the prefrontal brain areas, such as ACC, to signal behavioral adjustments following negative or positive outcome experiences. Recent studies in rodents further reveal that a specific population of cells within the LHb represents the information relevant to ongoing behavior that is not linked to reward specific responses. These data suggest that the LHb tracks the current behavioral state of the animal to inform action selection within a given context. Together, these results suggest that neural heterogeneity within the LHb serves as a source of diversity of behavioral phenotypes associated with this circuit. These findings also raise open questions about the cellular identity and anatomical connectivity of cellular ensembles that play a role in cognition and positive and negative valence that need to be investigated more extensively.
An additional open question for lateral habenula research is whether neural signals that play a role in learning develop stable patterns of activity over the course of learning or whether the neural code remains relatively stable from naive to stable behavior. Thus far, evidence suggests that individual neurons and theta power of local field potentials remain relatively stable when environmental or reward contingencies change (e.g., light to dark or reward omission) (Baker et al., 2015). This seems to contrast with evidence suggesting that the LHb is important for performing learned behaviors or switching between behavior (Stopper and Floresco, 2014; Baker et al., 2016). It may be that population ensembles order themselves in unique ways under learned contexts to provide efferent connections of the specific context of the animal as is seen in prefrontal cortical areas (Kobak et al., 2016; Ma et al., 2016). Ongoing experiments are attempting to address this possibility.
Future studies using cell type- and projection-specific manipulations, electrophysiological recordings, and new methods to study signaling dynamics should aid in better understanding of the nature of neural computations in the LHb circuit that guide cognitive functions and reward and aversive-related behaviors. Given that learning about what predicts rewarding or aversive outcomes is adaptive, studies addressing molecular, cellular, and circuit properties of the LHb circuits will offer insights into the mechanisms underlying dysfunctions of these domains in mental disorders and drug addiction ultimately guiding the development of more effective treatments.
Footnotes
This work was supported by National Institute of Drug Abuse Grant T32 DA07278–20 to P.M.B., National Institute of Mental Health Grant R01MH108924 to B.L., National Institute of Drug Abuse Grants 1R01DA037327 and National Institute on Drug Abuse 1R21DA032898 to T.J., Cabinet Office, Government of Japan Funding Program for Next Generation World-Leading Researchers LS074, Ministry of Education, Culture, Sports, Science and Technology of Japan Grants-in-Aid for Scientific Research 26120707, Takeda Science Foundation, Inamori Foundation, and Uehara Memorial Foundation to M.M., National Institute of Mental Health Grant R01MH58755 and University of Washington Research Royalty Fund Grant to S.J.Y.M., and National Alliance for Research on Schizophrenia and Depression and EMBO Long-Term Fellowship Awards to M.S.-J.
The authors declare no competing financial interests.
- Correspondence should be addressed to Dr. Aleksandra Vicentic, Division of Neuroscience and Basic Behavioral Science, National Institute of Mental Health, Rockville, MD 20892. vicentica{at}mail.nih.gov; or Dr. Bo Li, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724. bli{at}cshl.edu