Abstract
Anticipating rewards is fundamental for decision-making. Animals often use cues to assess reward availability and to make predictions about future outcomes. The gustatory region of the insular cortex (IC), the so-called gustatory cortex, has a well-established role in the representation of predictive cues, such that IC neurons encode both a general form of outcome expectation and anticipatory outcome-specific knowledge. Here, we used pavlovian–instrumental transfer (PIT) in male rats to assess if the IC is also required for predictive cues to exert both a general and specific influence over instrumental actions. Chemogenetic inhibition of IC impaired the ability of a reward-predictive stimulus to energize instrumental responding for reward. This deficit in general transfer was evident whether the same or different outcomes were used in the pavlovian and instrumental conditioning phases. We observed a similar deficit in specific PIT, such that rats with IC inhibition failed to use a reward-predictive stimulus to guide choice toward actions that deliver the same food reward. Finally, we show that rats with IC inhibition also fail to show outcome-selective reinstatement. Together, these data suggest a crucial role for IC in the representation of appetitive outcomes and particularly in using this representation to guide instrumental action.
Significance Statement
Animals use environmental cues to anticipate future rewards. The influence of these predictive cues on behavior can be studied using pavlovian–instrumental transfer (PIT), in which pavlovian outcome expectancies can energize (general PIT) or selectively guide (specific PIT) instrumental actions. In the current study, we show that chemogenetic inhibition of the gustatory region of the insular cortex (IC) attenuates both general and specific transfer, as well as the selectivity of outcome-induced reinstatement. These results demonstrate a critical role for the IC in the representation of appetitive outcomes and significantly contribute to a broader understanding of the cortical bases of PIT.
Introduction
Environmental cues are often used to infer the availability of rewards and to make predictions about future outcomes. Our behavior can thus be shaped by the presence of stimuli that we have learned signal particular outcomes (Colwill and Rescorla, 1988). This influence of predictive stimuli on decision-making is exemplified by pavlovian–instrumental transfer or PIT (Walker, 1942; Estes, 1943; Colwill and Rescorla, 1988; Holmes et al., 2010), which refers to the capacity of pavlovian outcome expectancies to affect instrumental action. Two forms of transfer have been observed. General transfer reveals that a pavlovian stimulus can invigorate instrumental responding via an increase in appetitive arousal. That is, the presence of a reward-predictive stimulus is thought to activate a broad motivational drive which can invigorate instrumental behavior for reward (Estes, 1943, 1948; Rescorla and Solomon, 1967; Dickinson and Dawson, 1987; Holland and Gallagher, 2003). In specific transfer, the predictive stimulus selectively facilitates performance on an action that delivers the same reward as the stimulus, but not other actions earning different rewards, on the basis of a shared, detailed outcome representation (Trapold and Overmier, 1972; Kruse et al., 1983). Thus, pavlovian stimuli can elevate (general PIT) or selectively guide (specific PIT) instrumental action.
The gustatory region of the insular cortex (IC), the gustatory cortex, has a well-documented role in the representation of predictive stimuli. In an elegant series of studies, Fontanini and colleagues (Maffei et al., 2012; Samuelsen et al., 2012, 2013; Gardner and Fontanini, 2014; Kusumoto-Yoshida et al., 2015; Vincis and Fontanini, 2016; Vincis et al., 2020) demonstrated that IC neurons show robust responses not only to auditory stimuli predicting the general availability of taste (Samuelsen et al., 2012, 2013) but also to stimuli signaling-specific taste outcomes (Gardner and Fontanini, 2014; Vincis et al., 2020; see also Saddoris et al., 2009). These anticipatory IC signals are acquired with learning and extinguish, suggesting that they track the predictive status of pavlovian stimuli (Gardner and Fontanini, 2014).
While the neural substrates of PIT have been extensively studied (Cartoni et al., 2016), the potential involvement of the IC in transfer remains unknown. This is surprising given the clear evidence that the gustatory region of IC encodes information about anticipated outcomes, including outcome value, to guide instrumental action (Balleine and Dickinson, 2000; Saddoris et al., 2009; Parkes and Balleine, 2013; Parkes et al., 2015, 2018). The IC is also anatomically well connected to several regions known to mediate both general and specific PIT, including the nucleus accumbens (Allen et al., 1991), mediodorsal thalamus (Wright and Groenewegen, 1996), ventral tegmental area (Ohara et al., 2003), orbitofrontal cortex (Barreiros et al., 2021; Chen et al., 2022; Mathiasen et al., 2023), and basolateral and central nucleus of the amygdala (Sripanidkulchai et al., 1984; Yamamoto et al., 1997; Yamamoto, 2006; Gehrlach et al., 2020). Thus, the IC appears ideally placed to contribute to PIT.
Here, we examined the functional involvement of the gustatory region of the IC in general and specific PIT. We first show that general transfer is impaired under chemogenetic inhibition of IC when the same outcome is used during pavlovian and instrumental conditioning. However, while it is largely accepted that such experimental designs indeed elicit general transfer (Hall et al., 2001; Holland and Gallagher, 2003; Corbit and Balleine, 2005, 2011), it has been noted that these “single outcome” designs might allow for mixed transfer effects (Cartoni et al., 2016). Therefore, to ensure that our results reflected deficits in general and not specific PIT, we replicated these findings using two distinct outcomes for pavlovian and instrumental conditioning and again found that IC inhibition attenuated general PIT. We then demonstrate that the IC is also required for specific PIT, as well as outcome-selective reinstatement. Together, these results indicate that IC plays a critical role in the representation of appetitive outcomes and inform our understanding of the cortical bases of PIT.
Materials and Methods
Subjects
Subjects were male Long–Evans rats, aged 3–4 months. Rats were housed in pairs within plastic boxes in a climate-controlled room, maintained on a 12 h light/dark cycle. All behavioral observations occurred during the light phase of this cycle. Rats were handled daily for 5 d prior to the behavioral procedures. Two days before behavior, rats were food restricted to maintain their weight at ∼90–95% of their ad libitum feeding weight. All experimental protocols adhered to French (Council Directive 2013-118, Feb. 1, 2013) and European (Directive 2010-63, Sept. 22, 2010, European Community) legislations and received approval from the local ethics committee.
Viral vectors
An adeno-associated viral vector carrying the inhibitory hM4Di designer receptor exclusively activated by designer drugs (DREADDs; Armbruster et al., 2007; Rogan and Roth, 2011) was obtained from Addgene [AAV8-CaMKIIa-hM4D(Gi)-mCherry; 1.46 × 1013 gc/ml; Addgene plasmid #50477; http://n2t.net/addgene:50477; RRID:Addgene_50477; gift from Bryan Roth]. A control vector lacking the hM4Di receptor was also used (AAV8-CaMKII-EGFP; 2.1 × 1013 gc/ml Addgene viral prep #50469-AAV8; http://n2t.net/addgene:50469; RRID:Addgene_50469; gift from Bryan Roth). To activate the inhibitory receptor, the exogenous ligand deschloroclozapine (DCZ; MedChemExpress HY-42110) was dissolved in dimethyl sulfoxide (DMSO) to obtain a final concentration of 50 mg/ml, which was aliquoted and stored at −80°C (stock solution). This stock solution was further diluted in physiological saline to obtain a final concentration of 0.1 mg/ml, which was injected at 1 ml/kg (i.e., 0.1 mg/kg). DCZ was always handled in dim light conditions and fresh solutions were prepared each day. Rats were injected with DCZ or vehicle (0.2% DMSO in physiological saline) intraperitoneally (i.p.) 30 min before behavior. We, and others, have previously demonstrated the efficacy of this ligand (Nagai et al., 2020; Nentwig et al., 2022; Cerpa et al., 2023).
Surgery
Rats were anesthetized with isoflurane (5% induction, 1–2% maintenance) and placed on a stereotaxic apparatus (Kopf). The incision site was disinfected with Betadine and subcutaneously injected with a local anesthetic mixture of lidocaine and ropivacaine. Vaseline was applied to the eyes and a heating pad was placed under the rat. Rats were rehydrated with subcutaneous injections of warm saline (0.9%, 10 ml/kg/h) throughout the surgery. The viral vector was injected using a 10 µl Hamilton syringe connected to a microinjector (UMP3 UltraMicroPump II with Micro4 Controller, World Precision Instruments). One microliter of AAV was injected at a rate of 0.2 µl/min at two sites in each hemisphere of IC. The coordinates were as follows (in mm from the bregma): AP +0.3, ML ±5.5, DV −7.3, and AP +1.3, ML ±5.5, DV −7.3 (Paxinos and Watson, 2014). After surgery, rats were subcutaneously injected with a nonsteroidal anti-inflammatory drug (meloxicam, 2 mg/kg), and sutures were covered with aluminum spray. Rats were then individually housed in a warm cage with facilitated access to food and water and continuously monitored for 2 h. Finally, rats were returned to their collective home cage for postoperative care and were allowed a minimum of 8 d to recover before behavioral procedures began. Injections of vehicle or DCZ occurred 4–5 weeks after surgery.
Behavioral apparatus
Behavioral procedures were conducted in eight operant cages (40 cm width × 30 cm depth × 35 cm height, Imetronic) that were individually enclosed in sound-resistant shells. Each chamber was equipped with two pellet dispensers that delivered 45 mg grain (Bio-Serv) or sugar pellets (TestDiet) into a food port. Cages also contained two retractable levers that could be inserted to the left and right of the food port, speakers that provided auditory stimuli (3 kHz tone or 10 Hz clicker), and a house light.
Behavioral procedures
Experiment 1a: general pavlovian–instrumental transfer—single outcome
Rats first received a habituation session to the two conditioned stimuli (CS+ and CS−, tone and clicker, counterbalanced). This session was identical to the pavlovian conditioning sessions except no outcomes were delivered. Pavlovian conditioning began the following day. Rats received eight pavlovian training sessions, each lasting 1.5 h. During each session, rats were presented with 2 min presentations of the two CSs, one paired with grain pellets (CS+) and the other with no outcome (CS−). Grain pellets were delivered on a random time 30 s schedule throughout the CS+ (i.e., 4 pellets per CS+ presentation). During each session, the CS+ was presented nine times and the CS− was presented three times in a pseudorandom order with a variable intertrial interval (ITI) that averaged to 5 min (ranging from 3 to 7 min). Food port entries were recorded during CS+, CS−, and a 2 min interval before each CS presentation (preCS).
Following pavlovian conditioning, rats underwent 12 sessions of instrumental training (two sessions per day). During each session, the left and right levers were available and responding on one lever (active lever) earned grain pellets (i.e., the same outcome used for pavlovian conditioning), whereas responding on the other lever (inactive lever) was unrewarded. Responding on the active lever was continuously reinforced on the first session and then shifted to a variable interval 30 s schedule of reinforcement for sessions 2–12. We used interval schedules of reinforcement as they are commonly employed in other general PIT studies (Hall et al., 2001; Cardinal et al., 2003; Holland and Gallagher, 2003; Lex and Hauber, 2008; Lingawi and Balleine, 2012; Halbout et al., 2019); however, random ratio schedules of reinforcement can also be used to produce general transfer (Corbit and Balleine, 2005; Holmes et al., 2010). The identity of the active and inactive lever (left vs right) was counterbalanced. Rats could earn a maximum of 30 pellets per session and the maximum session duration was 30 min.
The next day, all rats underwent instrumental extinction during which the two levers were extended for 30 min and no outcomes were delivered. Rats were then given a pavlovian extinction session, which was identical to the CS habituation session previously described. The aim of these two extinction sessions was to (1) reduce the baseline rate of instrumental responding and (2) reduce competition between the pavlovian and instrumental responses (Holmes et al., 2010).
Twenty-four hours after the extinction sessions, rats were injected intraperitoneally with vehicle or DCZ and, 30 min later, they were given a PIT test. At the beginning of the test, the two CSs were each presented twice in the absence of the levers. Then, both levers were extended and instrumental responding was extinguished for 8 min (in the absence of the CSs) to reduce baseline instrumental responding. Subsequently, each CS was presented twice in a pseudorandom order while both levers were extended. Each CS lasted 2 min with a fixed 4 min ITI. Food port entries and lever pressing were recorded throughout the session and responses were separated into preCS, CS+, and CS− periods (2 min each). No outcomes were delivered during the PIT test.
Experiment 1b: general pavlovian–instrumental transfer—two outcomes
The procedure was identical to that described for Experiment 1a except that two distinct outcomes were used for pavlovian conditioning and instrumental training. That is, half of the rats were trained with sugar pellets during the pavlovian phase and grain pellets during the instrumental phase and vice versa for the remaining half. For all rats, CS+ was the clicker and CS− was the tone, and all rats were injected with DCZ prior to the general PIT test.
Experiment 2: specific pavlovian–instrumental transfer
Rats first received a single habituation session to the two CSs (tone and clicker). During this 90 min session, each CS was presented six times in a pseudorandom order, during which no outcomes were delivered. The duration of each CS was 2 min and the average ITI was 5 min (range, 3–7 min). Then, all rats received eight sessions of pavlovian conditioning (one session per day) during which each CS (tone and clicker) was associated with one of the food outcomes (grain or sugar pellets, counterbalanced). Each session consisted of four tone and four clicker presentations. During each 2 min CS, the associated reward was delivered on a 30 s random time schedule, resulting in four pellet deliveries during each CS presentation. CSs were delivered pseudorandomly with an average ITI of 5 min (range, 3–7 min). Food port entries were recorded and separated into CS period and a 2 min period before each CS presentation (preCS).
Rats then received instrumental training to perform two actions to earn two distinct food rewards (e.g., left lever earns a grain pellet and right lever earns a sugar pellet, or vice versa). These food rewards were the same outcomes that were used for pavlovian conditioning. Rats were trained under continuous reinforcement (CRF) for the first 3 d, then under a random ratio 5 schedule (RR5) on Days 4–6 and, finally, an RR10 schedule on Days 7–9. A random ratio schedule of reinforcement was used in line with other several other studies on specific PIT (Corbit and Balleine, 2003; Ostlund and Balleine, 2007a; Corbit and Janak, 2010; Bradfield et al., 2015, 2018; Sias et al., 2021). During these sessions, each lever was presented twice for a maximum of 10 min each or until 20 outcomes were earned. The ITI between lever presentations was 2.5 min. Hence, each subject could earn a maximum of 40 grain and 40 sugar pellets during each session.
The next day, rats were given an instrumental extinction session followed by a pavlovian extinction session, as described above for the general PIT paradigm. Twenty-four hours later, all rats were injected with DCZ and were then given an outcome-specific PIT test 30 min later. The test procedure was identical to the general PIT test. Performance was evaluated under three conditions: lever pressing during the preCS period, and lever pressing during the CS periods on the lever that earned the same outcome as the presented cue (same condition), and on the lever that earned the different outcome (different condition). No outcomes were delivered during the PIT test.
Experiment 3: outcome-induced reinstatement
Rats were first given instrumental training, as described for Experiment 2. The training lasted for 7 d with continuous reinforcement (CRF) for the first 3 d, then an RR5 on Days 4–5 and, finally, an RR10 schedule on Days 6–7. On Day 8, rats were given an outcome-induced reinstatement test. The test began with a 15 min extinction period to reduce baseline instrumental responding. During this period, both levers where available but no outcomes were delivered. This was followed by four reinstatement trials with an ITI of 7 min. Each trial consisted of a single outcome delivery (either the grain or sugar pellet) in the following order: grain, sugar, sugar, grain. Responding was measured during the 2 min periods immediately before (Pre) and after (Post) each outcome delivery (Ostlund and Balleine, 2007b; Bradfield et al., 2015, 2018; Abiero et al., 2022). Responding during the reinstatement test was not rewarded. Rats were then given 1 d of rest and received a second reinstatement test on Day 9 with the order of outcome deliveries counterbalanced: sugar, grain, grain, sugar. Data are presented as an average of these two tests.
Finally, rats were given an additional RR10 retraining session and then received a reinstatement test with a novel outcome (chocolate pellet) that had not been associated to either action. This session was identical to that described, with four chocolate pellets delivered with a fixed ITI of 7 min.
Tissue processing and immunohistochemistry
After behavioral testing, rats received an intraperitoneal injection of xylazine (20 mg/kg, i.p.) followed by an intraperitoneal injection of pentobarbital (Euthasol; 200 mg/kg, diluted in saline). Rats were then perfused transcardially with 4% paraformaldehyde in 0.1 M phosphate buffer. Subsequently, 40 µm coronal sections were cut using a VT1200S Vibratome (Leica Microsystems). Every fourth section was collected to form a series and immunoreactivity was performed for mCherry (hM4Di) but not for GFP as the endogenous fluorescence was sufficient to detect infected cells.
For hM4Di-injected rats, free-floating sections were rinsed several times in 0.1 M phosphate-buffered saline (PBS), permeabilized with 0.3% Triton-X in 0.1 M PBS (PBS-T), blocked for 1 h (blocking solution: 4% normal goat serum in PBS-T), and placed in 1:1,000 rabbit anti-RFP (CliniSciences, PM005) at room temperature for 24 h. Sections were then washed in PBS-T and incubated in 1:500 biotin goat anti-rabbit (Jackson ImmunoResearch, 111-065-144) diluted in PBS-T for 2 h at room temperature. Sections were then rinsed with PBS and incubated with 1:400 A594-Streptavidin (Jackson ImmunoResearch, 016-580-084) in PBS. Finally, sections were rinsed in PB 0.1 M, then mounted, and coverslipped with Fluoroshield with DAPI (Invitrogen, 00-4959-52).
Slides were scanned using an upright fluorescent microscope system (Leica DM5500B) attached to a Jenoptik ProgRes MF Cool (#D-07739 Jena) that was equipped with a motorized stage (Märzhäuser #W215885) driven by the open-source acquisition software Micro-Manager (MMStudio V1.4.10). Acquired mosaics (10× objective) were stitched to reconstruct images, using the “Stitching - Grid/Collection” plugin of the Fiji freeware (Preibisch et al., 2009). Manual tracing of injection sites for each subject was performed using the mosaic images. These hand-traced injection sites were then superimposed onto corresponding atlas sections (Paxinos and Watson, 2014) and combined to visually convey the degree of overlap in virus expression across rats.
Experimental design and statistical analyses
All experiments used a mixed method design. In Experiment 1a, the between-subject factor was group (GFP vehicle, GFP DCZ, hM4Di vehicle, hM4Di DCZ) and within-subject factors were lever (active vs inactive), CS presentation (CS+ vs CS− or preCS, CS+, and CS−), and session/time. The final group sizes, after histology and outlier exclusions, were the following: group GFP vehicle, n = 14; group GFP DCZ, n = 14; group hM4Di vehicle, n = 11; and group hM4Di DCZ, n = 11. For the between-subject analysis, we used a planned orthogonal contrast analysis to compare group hM4Di DCZ with the control groups (GFP vehicle, GFP DCZ, and hM4Di vehicle). We then compared the GFP DCZ group to the two vehicle groups and, finally, the two vehicle groups (GFP vehicle vs hM4Di vehicle).
In Experiment 1b, the between-subject factor was group (GFP: n = 7 vs hM4Di: n = 8) and within-subject factors were lever (active vs inactive), CS presentation (CS+ vs CS− or preCS, CS+ and CS−), and session/time. All rats were injected with DCZ. For Experiment 2, the between-subject factor was again group (GFP: n = 13 vs hM4Di: n = 12) and within-subject factors were CS presentation (same vs different) and session/time, and all rats were injected with DCZ. Finally, in Experiment 3, the between-subject factor was group (GFP: n = 12 vs hM4Di: n = 12) and the within-subject factors were lever (reinstated or nonreinstated), period (pre- versus post-outcome delivery), and session/time. Again, all rats were injected with DCZ.
The dependent variables were the rate of lever presses or food port entries (FPE). Data were analyzed using a mixed-model ANOVA followed by simple effects analyses to establish the source of any significant interactions. Statistical significance was set at p ≤ 0.05. Data are presented as mean ± SEM, and individual data points are supplied on histograms. Rats were excluded if they failed to properly learn the pavlovian associations, defined as greater responding during CS− than during CS+ during the final 2 d of training for general PIT or greater responding during the preCS period versus the CS period during the final two training days for specific PIT. Four rats were excluded based on this criterion, two from general PIT and two from specific PIT.
Results
Histology
Schematics of viral expression in IC for each rat are illustrated in Figure 1A along with representative images (Fig. 1B). In total, two rats were excluded due to unilateral viral expression, one in Experiment 1a and one in Experiment 2.
A, Schematics of the viral expression in IC (in mm from bregma) for rats injected with the inhibitory DREADD (hM4Di) or control virus (GFP) for all experiments. B, Representative images from a rat in group GFP and a rat in group hM4Di (mCherry). Scale bars represent 1 mm in the map image (10×) and 50 µm in the magnified inset image (20×).
Experiment 1a: general transfer is impaired following IC inhibition
We first assessed the impact of chemogenetic inhibition of the IC on general PIT using a single outcome in pavlovian conditioning and instrumental training. Rats received bilateral IC injections of a virus carrying the inhibitory DREADD (hM4Di) or a control virus (GFP). One rat was excluded due to unilateral viral infection and two rats were excluded because they failed to properly discriminate between CS+ and CS− during pavlovian conditioning. This yielded the following between-subject group sizes: GFP vehicle: n = 14, GFP DCZ: n = 14; hM4Di vehicle: n = 11, hM4Di DCZ n = 11.
Following recovery, rats underwent the general PIT procedure with a single outcome, as shown in Figure 2A. They first received pavlovian conditioning during which CS+ was paired with the food reward and CS− was not. Rats were then given instrumental training whereby responding on the active lever earned the food reward while responding on the inactive lever was unrewarded. The food reward was the same for pavlovian and instrumental phases. Finally, all rats received a transfer test that assessed lever pressing in the presence of the two stimuli (CS+ and CS−). Rats were injected intraperitoneally with vehicle or DCZ 30 min before the general PIT test.
IC inhibition impairs general PIT when a single outcome is used. A, Outline of behavioral procedure, vehicle or DCZ was injected during the transfer test. B, Mean (±SEM) rate of food port entries (FPE) during CS+ (filled plot) and CS− (open plot) across pavlovian conditioning for GFP (blue) and hM4Di (red) rats. Calculated as FPE rate during the 2 min CS minus the FPE rate during the 2 min before each CS onset (preCS). Vehicle (veh) groups are shown in circles and DCZ groups are in squares (veh*; DCZ* indicates treatment to be during the general PIT test). C, Lever pressing rate (mean ± SEM) across instrumental training on active (filled plot) and inactive levers (open plot) for GFP (blue) and hM4Di (red) rats. D, Performance on the active lever during CS+ (filled bars) and CS− (open bars) in the general PIT test for GFP and hM4Di rats treated with vehicle or DCZ. Calculated as the pressing rate during each 2 min CS minus the pressing rate during the 2 min before each CS onset (preCS). E, Performance on active lever during the preCS (averaged across all preCS periods), CS+, and CS− in the transfer test. *statistical significance.
Pavlovian conditioning proceeded smoothly for all groups (Fig. 2B) such that the rate of food port entries was higher in the presence of the rewarded stimulus (CS+) than the unrewarded stimulus (CS−). There was also a main effect of session (F(1,46) = 133.67, p < 0.01), and a significant session × CS period interaction (F(1,46) = 41.82, p < 0.01) indicating that responding during both CS+ and CS− increased across training but this increase was greater for CS+ (F(1,46) = 107.52, p < 0.01) than CS− (F(1,46) = 25.19, p < 0.01). There was no main effect of group (largest F(1,46) = 0.32, p = 0.57) or significant interactions between group and any other factor (largest F(1,46) = 3.59, p = 0.064).
Rats also successfully acquired instrumental responding (Fig. 2C) with a main effect of lever (F(1,46) = 523.19, p < 0.01), session (F(1,46) = 307.28, p < 0.01), and a lever × session interaction (F(1,46) = 337.97, p < 0.01). Simple effects analyses conducted on the session × lever interaction indicated that lever pressing increased on both levers across training but this increase was greater on the active (F(1,46) = 332.49, p < 0.01) than the inactive lever (F(1,46) = 6.09, p = 0.02). There was no main effect of group (largest F(1,46) = 1.54, p = 0.22) or interactions between group and any other factor (largest F(1,46) = 2.28, p = 0.14).
To reduce baseline instrumental responding and pavlovian/instrumental response competition, all rats underwent a 30 min instrumental extinction session followed by a pavlovian extinction session on the day preceding the transfer test (data not shown). We detected a significant main effect of lever (F(1,46) = 376.33, p < 0.01), time (F(1,46) = 223.576, p < 0.01), and a significant lever × time interaction (F(1,46) = 161.48, p < 0.01). Simple effects analyses indicated that responding on both the active (F(1,46) = 205.75, p < 0.01) and inactive lever (F(1,46) = 15.31, p < 0.01) decreased across the session, although this decrease was greater for the active lever. There was no effect of group or any interactions between group and the other factors (largest F(1,46) = 0.63, p = 0.43). During pavlovian extinction, we observed a main effect of CS (F(1,46) = 7.26, p = 0.01) but no effect of time (F(1,46) = 1.994, p = 0.17) or group (largest F(1,46) = 1.29, p = 0.26). One significant group × CS interaction was detected (F(1,46) = 5.64, p = 0.02) such that group hM4Di vehicle responded more during CS+ than CS− (F(1,46) = 7.18, p = 0.01) but group GFP vehicle did not (F(1,46) = 0.31, p = 0.58).
Performance on the active lever during the general PIT test is shown in Figure 2D. Statistical analyses revealed a main effect of CS (F(1,46) = 11.67, p < 0.01), no main effect of group (largest F(1,46) = 0.56, p = 0.46), but, importantly, a significant CS × group interaction (F(1,46) = 4.47, p = 0.04). Simple effects indicated that groups GFP vehicle (F(1,46) = 4.42, p = 0.04), GFP DCZ (F(1,46) = 3.84, p = 0.056), and hM4Di vehicle (F(1,46) = 9.24, p < 0.01) pressed more during CS+ than CS− but this was not the case for group hM4Di DCZ (F(1,46) = 0.03, p = 0.86). We also examined responding during the CSs versus baseline (preCS; Fig. 2E). Here, there was no main effect of group (largest F(1,46) = 3.08, p = 0.09) but significant main effect of period such that, overall, rats pressed more during the CSs than during preCS (F(1,46) = 18.82, p < 0.01) and more during CS+ than CS− (F(1,46) = 12.56, p < 0.01). Unfortunately, the CS period × group interactions failed to reach significance (largest F(1,46) = 3.57, p = 0.065) precluding simple effects analyses on each group.
The rate of food port entries during the transfer test (Fig. 3A) was higher during CS+ than CS− (F(1,46) = 4.34, p = 0.04), and there was no effect of group (largest F(1,46) = 1.75, p = 0.19) or CS × group interaction (largest F(1,46) = 2.05, p = 0.16).
Food port entries during the general transfer tests. Mean number of food port entries (FPE) during CS+ and CS− during the transfer tests in Experiment 1a (A) and Experiment 1b (B). Due to potential response competition between FPE and lever presses during the transfer test, we plotted and analyzed total FPE during the four CS alone presentations prior to lever extension and the four CS presentations while the levers were present (i.e., 8 min total for CS+ and CS−). Data are presented as the total number of FPE during each CS period minus the number FPE during the 2 min period before each CS onset (preCS). *indicates statistical significance.
Finally, a subset of the same rats used for general PIT underwent a progressive ratio (PR) task to assess motivation to lever press under IC inhibition (GFP vehicle n = 6; GFP DCZ n = 6; hM4Di vehicle n = 5; hM4Di DCZ n = 5). Rats were injected with vehicle or DCZ 30 min prior to PR. During the PR, rats were required to press the lever an increasing number of times on a fixed ratio (FR) 3 to obtain successive food pellet rewards (+3 increments, e.g., 1 press for the first pellet, 4 presses for the second pellet, 7 presses for the third pellet, etc.). The breakpoint, defined as the last fixed ratio completed (number of lever presses to obtain the final pellet), was recorded. Statistical analyses revealed that the breakpoint was significantly lower for GFP rats injected with vehicle compared with GFP rats injected with DCZ (F(1,10) = 6.31, p = 0.03; Fig. 4A); however, the breakpoint did not differ for hM4Di rats injected with vehicle versus those injected with DCZ (F(1,8) = 1.86, p = 0.21; Fig. 4B).
Performance in a progressive ratio task. Breakpoints, defined as the last fixed ratio (FR) completed (i.e., number of lever presses to obtain the final pellet), for GFP (A) and hM4Di rats (B) that were injected with vehicle (white bars) or DCZ (black bars) 30 min prior to the progressive ratio task.
Experiment 1b: IC inhibition also impairs general transfer when two distinct outcomes are used
The procedure was identical to that described for Experiment 1a except that half of the rats were trained with sugar pellets during the pavlovian phase and grain pellets during the instrumental phase and vice versa for the remaining half of the rats (Fig. 5A). One rat was excluded for responding > 2 standard deviations from the mean during CS+ at the end of the pavlovian extinction session (>20 entries per min during final two extinction trials). As in Experiment 1a, rats received bilateral IC injections of the inhibitory hM4Di virus (n = 8) or the control GFP virus (n = 7), and all rats were injected with DCZ 30 min prior to the general PIT test.
IC inhibition impairs general PIT when different pavlovian and instrumental outcomes are used. A, Outline of the behavioral procedure, DCZ was injected during the transfer test. B, Mean (±SEM) rate of food port entries (FPE) during CS+ (filled plots) and CS− (open plots) across pavlovian conditioning for GFP and hM4Di rats. Calculated as the rate of entries during the 2 min CS minus the rate of entries during the 2 min before CS onset (preCS). C, Mean (±SEM) lever pressing rate across instrumental training on the active (filled plots) and inactive lever (open plots) for GFP and hM4Di rats. D, Performance on the active lever during CS+ (filled bars) and CS− (open bars) in the general PIT test for GFP and hM4Di rats under DCZ injection. Performance is calculated as the pressing rate during each 2 min CS minus the pressing rate during the 2 min before each CS onset (preCS). E, Performance on active lever during the preCS (averaged across all preCS periods), CS+, and CS− periods in the transfer test. *statistical significance.
Across pavlovian conditioning (Fig. 5B), food port entries were greater during CS+ than during CS− (F(1,13) = 39.38, p < 0.01) and entries increased across training sessions (F(1,13) = 38.27, p < 0.01) but this increase was greater for CS+ (CS × session interaction: F(1,13) = 9.83, p < 0.01; simple effect of session for CS+: F(1,13) = 23.15, p < 0.01; and CS−: F(1,13) = 7.87, p = 0.02). GFP and hM4Di groups did not differ (F(1,13) = 0.29, p = 0.60) and there were no interactions between group and any other factors (largest F(1,13) = 1.4, p = 0.26).
Instrumental training (Fig. 5C) was also successful with statistical analyses revealing a main effect of session (F(1,13) = 56.28, p < 0.01), lever (F(1,13) = 165.10, p < 0.01), and a lever × session interaction (F(1,13) = 66.88, p < 0.01). Simple effects analyses revealed that responding increased on the active (F(1,13) = 62.33, p < 0.01) but not inactive lever (F(1,13) = 0.98, p = 0.34). There was no main effect of group or any interactions between group and the other factors (largest F(1,13) = 2.18, p = 0.16).
Lever pressing decreased across the instrumental extinction session (data not shown), with a main effect of time (F(1,13) = 117.67, p < 0.01), lever (F(1,13) = 146.52, p < 0.01), and a time × lever interaction (F(1,13) = 109.03, p < 0.01). Simple effects analyses indicated that responding decreased across time on the active (F(1,13) = 125.31, p < 0.01) but not the inactive lever (F(1,13) = 2.92, p = 0.11). No main effect of group was detected (F(1,13) = 0.63, p = 0.44) but the three-way interaction approached significance (F(1,13) = 4.29, p = 0.06), suggesting that the decrease in responding on the active lever was perhaps greater in group GFP than group hM4Di. During pavlovian extinction (data not shown), food port entries were greater during CS+ than during CS− (F(1,13) = 6.21, p = 0.03), and there was no main effect of group, time, or any significant interactions (largest F(1,13) = 2.83, p = 0.12).
Figure 5D shows responding on the active lever during the general PIT test.
Statistical analyses revealed a main effect of CS (F(1,13) = 11.98, p < 0.01) and group (F(1,13) = 8.19, p = 0.01), indicating that pressing was greater during the CS+ and that GFP rats pressed more than hM4Di rats. Importantly, there was also a significant CS × group interaction (F(1,13) = 5.37, p = 0.04), and simple effects analyses indicated that group GFP responded more during CS+ than CS− (F(1,13) = 15.65, p < 0.01) but this was not the case for group hM4Di (F(1,13) = 0.7, p = 0.42). Similarly, when comparing responding during the CSs versus baseline (preCS; Fig. 5E), we observed a main effect of group (F(1,13) = 6.8, p = 0.02) and period (preCS vs CSs: F(1,13) = 14.0, p < 0.01; CS+ vs CS−: F(1,13) = 15.41, p < 0.01) as well as significant group × period interactions (smallest F(1,13) = 6.51, p = 0.02). Simple effects analyses indicated that, overall, group GFP pressed more during the CSs than during preCS (F(1,13) = 8.19, p = 0.01) and, importantly, they pressed more during CS+ than during CS− (F(1,13) = 19.66, p < 0.01). In contrast, the presence of the CSs did not impact lever pressing in group hM4Di with no significant difference in pressing observed during the CSs compared with preCS (F(1,13) = 0.41, p = 0.53) or during CS+ versus CS− (F(1,13) = 1.01, p = 0.33). Food port entries did not differ between groups with no main effect of CS, group, or CS × group interaction detected (largest F(1,13) = 2.48, p = 0.14; Fig. 3B).
Experiment 2: specific transfer is impaired following IC inhibition
We next examined if IC is also required for specific PIT. Rats received bilateral IC injections of the inhibitory hM4Di virus (n = 14) or the control GFP virus (n = 14). One rat was excluded due to misplaced viral expression and two rats were excluded because they failed to learn pavlovian conditioning. This yielded the following between-subject group sizes: GFP: n = 13, hM4Di: n = 12. All rats were injected intraperitoneally with DCZ before the specific PIT test.
Following recovery from surgery, rats underwent the specific PIT protocol (Fig. 6A). They first received pavlovian conditioning during which two distinct conditioned stimuli (CS1 and CS2) were associated with two different food outcomes (O1 and O2). Rats were then trained to perform one action (e.g., left lever) to earn one outcome (e.g., O1) and another action (e.g., right lever) to earn a different outcome (e.g., O2). Finally, all rats were injected with DCZ and then received a transfer test that assessed responding on both levers during the presence of the two stimuli (CS1 and CS2).
IC inhibition abolishes specific PIT. A, Outline of the behavioral procedure. B, Mean (±SEM) rate of food port entries (FPE) during CSs (clicker and tone averaged) across pavlovian conditioning for GFP (circles) and hM4Di rats (squares). Calculated as the rate of entries during the 2 min CS minus the rate of entries during the 2 min before CS onset (preCS). C, Mean (±SEM) lever pressing rate across instrumental training (left and right lever averaged) for GFP (circles) and hM4Di rats (squares). D, Lever pressing during the CS that shared the same outcome (Same; filled bars) and the different outcome (different; open bars) in the specific PIT test for GFP and hM4Di rats under DCZ injection. Performance is calculated as the rate of pressing during the 2 min CS minus the rate of pressing during the 2 min before CS onset (preCS).
As shown in Figure 6B, pavlovian conditioning was successful for both GFP and hM4Di groups with rats increasing their food port entries across sessions (F(1,23) = 23.69, p < 0.01). There was no difference between GFP and hM4Di groups (F(1,23) = 0.01, p = 0.92) and no significant interaction (F(1,23) = 0.47, p = 0.50). Rats also acquired instrumental responding, with a main effect of session (F(1,23) = 248.23, p < 0.01) but no effect of group or session × group interaction (Fig. 6C; largest F(1,23) = 0.9, p = 0.35).
All rats were then given an instrumental extinction session followed by a pavlovian extinction session on the day preceding the transfer test (data not shown). Instrumental responding decreased across the extinction session (F(1,23) = 133.42, p < 0.01), and there was no significant difference between groups or a significant group × time interaction (largest F(1,23) = 2.63, p = 0.12). Food port entries across the pavlovian extinction session did not differ between groups (F(1,23) = 0.66, p = 0.43) and there was no effect of time or group × time interaction (largest F(1,23) = 0.85, p = 0.37).
The results from the specific PIT test are shown in Figure 6D. Data are plotted as the rate of lever presses (collapsed across A1 and A2) during the stimulus (CS1 or CS2) that shared a common outcome (same condition) or a different outcome (different condition). Statistical analyses revealed no main effect of group (F(1,23) = 0.03, p = 0.86) or lever (F(1,23) = 3.96, p = 0.06) but a significant group × lever interaction (F(1,23) = 4.5, p = 0.045). Simple effects conducted on the interaction indicated that GFP rats responded significantly more on the same lever than on the different lever (F(1,23) = 8.81, p = 0.007); however, hM4Di rats did not (F(1,23) = 0.008, p = 0.93). Specific PIT can manifest as increased responding on the lever associated with the same outcome as the presented CS or as a suppression of responding on the lever associated with a different reward from the presented stimulus (Delamater and Holland, 2008; Holmes et al., 2010). Inspection of Figure 6D suggests that the specific PIT effect in GFP rats is driven by the latter. One sample t tests indeed confirmed that responding on the same lever did not differ significantly from zero for both GFP (p = 0.14) and hM4Di rats (p = 0.49). However, the comparison of responding on the different lever to zero approached significance for GFP (p = 0.06) but not hM4Di rats (p = 0.63). Thus, IC inhibition may have impaired the suppression of responding on the different lever, which appears to be driving the specific transfer effect in this case.
Finally, we observed that food port entries were greater during the CSs than during the preCS periods (F(1,23) = 11.06, p < 0.01), with no difference between groups (F(1,23) = 0.09, p = 0.77) or CS period × group interaction (F(1,23) = 0.21, p = 0.65; Fig. 7).
Food port entries during the specific transfer test. Mean food port entries (FPE) during the CSs and the preCS period (Pre) during the specific transfer test. Due to potential response competition between FPE and lever presses during the transfer test, we plotted and analyzed total FPE during the four CS alone presentations prior to lever extension and the four CS presentations while the levers were present. Data are presented as the total number of FPE during each CS period minus the number FPE during the 2 min period before each CS onset (preCS). *statistical significance.
Experiment 3: IC inhibition abolishes outcome-selective reinstatement
The previous experiments showed that IC inhibition caused deficits in both general and specific PIT. Importantly, PIT relies on the ability of a stimulus-induced outcome representation to guide instrumental action. In this final experiment, we asked if similar deficits would be observed if the outcome itself was physically present. That is, would IC inhibition also impair the ability of an appetitive outcome to reinstate responding on extinguished actions. To achieve this, we used outcome-induced reinstatement, whereby a previously extinguished instrumental action is reinstated by “free” delivery of its associated outcome (Fig. 8A). Rats received bilateral IC injections of the hM4Di (n = 12) or the control GFP virus (n = 12) and no rats were excluded. All rats were injected intraperitoneally with DCZ 30 min prior to the reinstatement test.
IC inhibition impairs the selectivity of outcome-induced reinstatement. A, Behavioral procedure. B, C, Lever pressing rate (±SEM) across instrumental training and the extinction period that preceded the reinstatement test, respectively. D, Lever pressing rate during the reinstatement test before (Pre; averaged across levers) and after outcome deliveries on the nonreinstated (NR) and reinstated lever (Reinst). E, Food port entry (FPE) rate pre- and post-outcome delivery during the test. F, Behavioral procedure for the outcome-induced reinstatement test with a novel outcome (O3). G, Lever pressing rate (±SEM) across the extinction period that preceded the reinstatement test. H, Lever pressing rate pre- and post-outcome deliveries, averaged across levers. I, FPE rate pre- and post-outcome deliveries.*,*** statistical significance.
Following recovery from surgery, rats received instrumental training with two A–O associations (A1–O1; A2–O2). This training proceeded smoothly (Fig. 8B), with responding increasing across days (F(1,22) = 110.70, p < 0.001) and no significant difference between groups or significant group × session interaction (largest F(1,22) = 1.14, p = 0.30). Figure 8C shows lever pressing across the 15 min extinction session that immediately preceded the outcome-induced reinstatement test. There was a main effect of group, such that hM4Di rats pressed less than GFP rats (F(1,22) = 15.736, p = 0.001) and a main effect of time (F(1,22) = 62.361, p < 0.001) as well as a group × time interaction that approached significance (F(1,22) = 4.22, p = 0.052).
The results from the outcome-induced reinstatement test are shown in Figure 8D. Overall, GFP rats responded significantly more than hM4Di rats (F(1,22) = 4.42, p = 0.047). There was also a significant main effect of responding before (pre) versus after outcome deliveries, averaged across the nonreinstated (NR) and reinstated (Reinst) levers, such that lever pressing increased following outcome deliveries (F(1,22) = 36.40, p < 0.001). There was also a significant interaction with group such that the difference in instrumental responding before versus after outcome delivery was greater in the GFP group than in the hM4Di group (F(1,22) = 7.40, p = 0.01). However, simple effects analyses confirmed that responding was indeed higher after outcome delivery for both groups (GFP: F(1,22) = 38.31, p < 0.001; hM4Di: F(1,22) = 5.49, p = 0.03), indicating successful (nonselective) outcome-induced reinstatement in GFP and hM4Di groups.
We then assessed if outcome-selective reinstatement was intact by examining the difference in responding on the nonreinstated versus reinstated lever immediately following outcome delivery. While no significant overall difference was detected (F(1,22) = 0.77, p = 0.39), there was a significant interaction with group (F(1,22) = 4.43, p = 0.047). Simple effects analyses indicated that group GFP responded more on the reinstated than the nonreinstated lever (F(1,22) = 4.45, p = 0.047), but hM4Di rats did not (F(1,22) = 0.76, p = 0.39). That is, in the 2 min following each outcome delivery, GFP-injected rats performed the action that was trained with the reinstating outcome more than the action trained with the nonreinstating outcome. Thus, while both GFP and hM4Di rats showed nonselective outcome-induced reinstatement (albeit less so in hM4Di rats), only GFP rats showed outcome-selective reinstatement. Food port entries (Fig. 8E) during the 2 min period after outcome deliveries were greater than during the 2 min prior to outcome deliveries (F(1,22) = 78.95, p < 0.001), with no difference between groups or significant period × group interaction (largest F(1,22) = 0.61, p = 0.44).
Given the differences in overall rates of lever pressing during the extinction period, we also expressed responding pre- and post-outcome delivery as a percentage of baseline responding during the extinction period (data not shown). Note that the baseline period was calculated as average responding for each lever during the first 13 min; the last 2 min were excluded as they correspond to the pre-outcome period for the first pellet delivery. Statistical analyses revealed no main effect of group (F(1,22) = 0.02, p = 0.9) and confirmed a significant main effect of responding before (pre) versus after outcome deliveries, averaged across the nonreinstated (NR) and reinstated (Reinst) levers, such that lever pressing increased following outcome deliveries (F(1,22) = 25.56, p < 0.001). There was no significant interaction with group (F(1,22) = 1.58, p = 0.22), which confirms that nonselective outcome-induced reinstatement was intact in both GFP and hM4Di groups. Moreover, group GFP responded more on the reinstated lever than the nonreinstated lever but this was not the case for hM4Di-injected rats (no main effect of lever: F(1,22) = 1.54, p = 0.29; group × lever interaction: F(1,22) = 4.18, p = 0.05; simple effect for GFP group: F(1,22) = 5.39, p = 0.03; but not hM4Di group: F(1,22) = 0.32, p = 0.58).
Finally, to confirm that IC inhibition did indeed leave nonselective outcome-induced reinstatement intact, rats received one session of instrumental retraining and then underwent an outcome-induced reinstatement test with a novel food reward (O3; Fig. 8F). During the extinction period (Fig. 8G), overall responding significantly decreased (F(1,22) = 29.25, p < 0.001) and there was no main effect of group or group × time interaction (largest F(1,22) = 0.87, p = 0.36). The outcome-induced reinstatement results are shown in Figure 8H, averaged across the two levers. There was a main effect of period (F(1,22) = 7.76, p = 0.01), indicating that lever pressing was greater post-outcome delivery and there was no effect of group or group × period interaction (largest F(1,22) = 2.59, p = 0.12). Thus, the novel food reward successfully reinstated instrumental responding in both GFP and hM4Di rats. Food port entries (Fig. 8I) were again greater during the 2 min period after outcome deliveries than during the 2 min prior to outcome deliveries (F(1,22) = 23.74, p < 0.001), with no difference between groups or significant period × group interaction (largest F(1,22) = 0.85, p = 0.37).
Discussion
We examined the role of the gustatory region of the IC in the general and specific forms of PIT. Our findings revealed that chemogenetic inhibition of the IC during the transfer test attenuated general and specific PIT. It should be noted that we did observe a slightly different pattern of results in our two general transfer experiments. In the single outcome design, IC inhibition abolished general PIT when we directly compared CS+ to CS− but, when we compared with baseline, the group with IC inhibition did not differ significantly from the control groups. However, in the two outcome design, IC inhibition impaired general transfer both when comparing CS+ to CS− and when responding during the CSs was compared with baseline. Importantly, IC inhibition did not diminish the rats' motivation to perform an instrumental action in the progressive ratio task and the expression of pavlovian associations appeared intact, at least to some extent. Indeed, IC-inactivated rats visited the food port more during the rewarded stimulus than during the unrewarded stimulus in general transfer and visited the food port more frequently during the reward-predictive stimuli than during the baseline period in specific transfer. However, we cannot be sure if this food port behavior in specific transfer reflects an outcome-specific expectancy, as only a single food port was used for delivering the outcomes predicted by both stimuli. Therefore, the pavlovian conditioned responses observed could result from either a general or specific reward expectancy.
Our transfer results indicate that IC is required to retrieve both the general rewarding and sensory-specific representation of an absent outcome to guide responding. These results are consistent with findings showing that inhibition of gustatory IC renders rats unable to retrieve the value of an absent outcome in devaluation studies (Balleine and Dickinson, 2000; Parkes and Balleine, 2013; Parkes et al., 2015, 2018). In contrast, when the outcome is experienced (as in outcome reinstatement), we found that rats with IC inhibition could use the general rewarding properties of the outcome to energize instrumental responding but not its sensory-specific features to selectively guide responding. This failure to show outcome-selective reinstatement is unlikely the result of reduced outcome discriminability, as perturbing IC leaves sensory-specific satiety intact such that rats consume less of a sated food outcome compared with a nonsated food outcome (Parkes and Balleine, 2013; Parkes et al., 2015, 2018).
The involvement of IC in both general and specific transfer is consistent with evidence that IC neurons respond to pavlovian cues predicting the availability of multiple tastants (Samuelsen et al., 2012, 2013) and show selective responses to auditory cues predicting distinct outcomes (Gardner and Fontanini, 2014; Vincis and Fontanini, 2016; Vincis et al., 2020). This anticipatory activity in IC is also functionally relevant. Vincis et al. (2020) used a perceptual discrimination task in which head-restrained mice were trained to lick a central spout to receive one of four taste cues during a sampling epoch. The mice then had to wait through a short delay before deciding whether to lick left or right for a water reward. Each taste cue signaled that the left or right lick would be rewarded: sweet sucrose or bitter quinine indicated that mice should perform a left lick for the water reward, while sweet maltose or bitter sucrose octaacetate indicated a right lick. Optogenetic inactivation of IC during sampling of the tastant had no impact on performance; however, the same manipulation during the delay epoch decreased the number of correct choices (Vincis et al., 2020). Moreover, consistent with the current results, immediate early gene activation is increased in IC following presentation of a pavlovian reward-predictive cue (Dardou et al., 2006, 2007; Kerfoot et al., 2007; Saddoris et al., 2009), and this activation appears to be outcome specific (Saddoris et al., 2009).
The associative cybernetic model of instrumental conditioning is currently the most comprehensive theoretical account for transfer effects (Dickinson and Balleine, 1994; Balleine and Ostlund, 2007; Cartoni et al., 2013). A key feature of this model is that transfer is mediated by a stimulus-induced outcome representation, which serves as a stimulus to trigger the representation, and subsequent performance, of the associated action. The emergence of this SO–A chain results notably from the ability to use the backward A–O association (i.e., the O–A association), where the representation of the outcome (SO) leads to a mental representation of the corresponding action, resulting in the activation of motor responses (S1O1–A1; S2O2–A2), thus generating specific PIT. In addition, the model proposes that reward-predictive stimuli can prompt a general expectancy for reward, which in turn excites motor responses through a Sreward–A chain.
Thus, the model proposes that both forms of PIT rely on the use of a S–O–A chain through an O–A association (Balleine and Ostlund, 2007). Based on this model, the lack of transfer effects observed in rats with IC inhibition may result from either a failure to represent the outcome as a stimulus (SO), which is required to activate the representation of the associated action (A), or an inability to use the triggered A–O representation in a backward manner. In both cases, there appears to be a failure to use the SO–A associative chain within S–A memory, reflecting an inability of stimulus-induced outcome expectations to influence instrumental actions. It must also be noted that the impairment in specific PIT could instead result from a failure to recall the specific S–O associations. While rats with IC inhibition entered the food port more during the reward-predictive stimuli than during the preCS period, we are unable to conclude if these rats were recalling the specific (S1–O1 and S2–O2) associations or simply a general expectation of reward during the CSs (S-reward).
While we have shown that the IC contributes to the expression of both forms of PIT, distinct insular pathways are likely at play. The IC is anatomically connected to the basolateral amygdala (BLA; Sripanidkulchai et al., 1984; Yamamoto et al., 1997; Yamamoto, 2006; Gehrlach et al., 2020), the OFC (Barreiros et al., 2021; Chen et al., 2022; Mathiasen et al., 2023), and the nucleus accumbens shell (Allen et al., 1991), which have all been functionally implicated in specific PIT. One might therefore hypothesize that communication between the IC and the BLA, accumbens shell, and lateral or medial OFC could support specific, but not general, PIT. Consistent with this suggestion, previous studies have shown that pharmacological inactivation of the BLA reduces the neural excitatory responses to predictive stimuli in the IC (Samuelsen et al., 2012) and evidence from outcome devaluation studies proposes that the BLA updates and encodes information about the value of the instrumental outcome and then sends this information to IC, where it is retrieved to guide behavior (Parkes and Balleine, 2013).
By contrast, general PIT may require interaction between the IC and the central amygdala (CeA; Gehrlach et al., 2020) and/or NAc core (Wright and Groenewegen, 1996; Reynolds and Zahm, 2005), as these regions are involved in mediating the general excitatory influences of reward-predictive stimuli on instrumental action (Hall et al., 2001; Holland and Gallagher, 2003; Corbit and Balleine, 2005, 2011; Lex and Hauber, 2008; Lingawi and Balleine, 2012). This is supported by the crucial role of CeA in generating stimulus-induced general motivation for rewards (Warlow and Berridge, 2021) and evidence showing the IC→CeA pathway contributes to taste-related choice behavior (Schiff et al., 2018). We propose that a double dissociation may therefore exist at the cortico-striatal and cortico-amygdalar level, such that IC→core and IC→CeA may support general (but not specific) transfer, and IC→shell and IC→BLA potentially mediate specific (not general) PIT. While research on the cortical regions mediating general transfer has remained relatively limited in comparison with specific PIT, it has been shown that the neural activity of medial PFC (prelimbic region) and lateral OFC neurons is selectively modulated during the presentation of the CS+ compared with the CS− and baseline periods during a general PIT test (Homayoun and Moghaddam, 2009). Thus, communication between the IC and the OFC may be required for both general and specific transfer.
Our results provide the first demonstration that the IC mediates the ability of predictive cues to exert both a general and specific influence over instrumental responding, likely via the retrieval of outcome representations to guide behavior. Moreover, we show that rats with IC inhibition are unable to use the specific sensory features of an experienced outcome to retrieve its associated action. This suggests a deficit in the use of backward A–O associations. These findings significantly contribute to our understanding of the cortical bases of general and specific PIT and add to the growing body of literature investigating the broad role of the IC in decision-making.
Footnotes
This work was supported by the French National Agency for Scientific Research (CoCoChoice ANR-19-CE37-0004-07) and the French government in the framework of the University of Bordeaux’s IdEx “Investments for the Future” program/GPR BRAIN_2030. We thank Y. Salafranque for animal care as well as Gilles Courtand and Angélique Faugère for their technical assistance. We also thank Dr. Etienne Coutureau for comments on an earlier version of this manuscript.
The authors declare no competing financial interests.
- Correspondence should be addressed to Shauna L. Parkes at shauna.parkes{at}u-bordeaux.fr.