Abstract
The cerebral cortex, cerebellum and basal ganglia are essential for flexible learning in mammals. Although traditionally thought to operate under different learning rules, recent evidence suggests that both the basal ganglia and the cerebellum may employ reinforcement learning mechanisms. This raises the question of how these structures coordinate when a common reward prediction error mechanism is active. To address this issue, we first examined output signals from the basal ganglia and cerebellum following the activity of the cerebral cortex. We recorded single-neuron activity from the output regions of the cerebellum and basal ganglia - the cerebellar nuclei (CN) and substantia nigra pars reticulata (SNr) - in both male and female ChR2 transgenic rats. Neurons in the CN and SNr exhibited distinct temporal response patterns; notably, the fast excitatory response in the CN, driven by mossy fiber input, was synchronized with the inhibitory response in the SNr, mediated via the direct pathway. Using these experimental findings together with connectome data, we developed both a semi-realistic spiking network model and a reservoir-based reinforcement learning model. In the latter model, successful learning depended on synaptic plasticity in both the cerebellum and basal ganglia with a temporal precision on the order of 10 ms. Furthermore, cortical β-oscillations enhanced learning and optimal reinforcement learning occurred when the output of cerebellar and basal ganglia signal phase-locked at the frequency of cortical oscillation. Taken together, our results suggest that the coordinated output of the cerebellum and basal ganglia, driven by tightly tuned cortical input, underlies brain-wide synergistic reinforcement learning.
Significance Statement The cerebral cortex, cerebellum, and basal ganglia support learning. Recent research suggests that both the basal ganglia and cerebellum use a similar learning process called reinforcement learning, which involves predicting rewards. To understand how these brain regions work together, we recorded brain activity in rats while photo-stimulating the cerebral cortex. We found that two types of responses in the cerebellum and basal ganglia were synchronized, which might help activate the cerebral cortex. A computer model showed that precise timing of signals from both the cerebellum and basal ganglia is important for learning. This timing was important only when the cerebral cortex worked in a specific frequency range. Our findings suggest that coordinated brain activity enhances learning.
Footnotes
The authors have no conflicts of interest to declare.
We thank T. Shimada, R. Mizuno, Reiko Hira for animal husbandry and genotyping, M. Kawabata, A. Rios, T. K. Fujita, Sakairi, S.L. Smith, J.N. Stirman, S. Aoki, A. Funamizu K. Ishizu, S. Tsutsumi, M. Morishima, T. Ishikawa, T. Yamazaki, H. Mori for technical advices and discussion. We also thank the editor and reviewers for helpful comments. This work was supported by JP22wm0525007 (RH), JP24wm0625405 (RH), JP19dm0207089 (YI) from AMED, JP24H02156 (RH), JP22H02731 (RH), JP20K22678 (RH), JP21B304 (RH), JP21H05134 (RH), JP21H05135 (RH), JP16H06276 (YI), JP21H0524 2(YI), JP19H03342 (YI), JP23H02589 (YI), and JP20H05053 (YI) from MEXT/JSPS, JPMJCR1751 (YI) from JST, Nakatani Foundation (RH), Shimadzu Foundation (RH), Takeda Science Foundation (RH), Takeda Science Foundation (YI), The Precise Measurement Technology Promotion Foundation (RH), Tateishi Science and Technology Foundation (RH), and Research Foundation for OptoScience and Technology (RH).
↵*These authors equally contributed to this work.