PROTECT YOUR DNA WITH QUANTUM TECHNOLOGY
Orgo-Life the new way to the future Advertising by Adpathway-
Loading metrics
Open Access
Peer-reviewed
Research Article
- Tomas G. Aquino,
- Robert Kim,
- Nuttida Rungratsameetaweemana
x
- Published: July 2, 2026
- https://doi.org/10.1371/journal.pbio.3003831
Abstract
Flexible behavior requires the ability to modulate sensory processing based on task context, yet the circuit-level mechanisms supporting this capacity remain poorly understood. Here, we combine recurrent neural network modeling and neural recordings from mouse visual cortex to investigate how task context shapes sensory coding. Networks trained on an instruction-based discrimination task develop a disinhibitory interneuron-to-interneuron motif that dynamically gates task-relevant sensory information. Perturbation and lesion analyses show that this motif is necessary for task performance and for maintaining distinct sensory representations across contexts. We validate key predictions in mouse visual cortex, where interneuron activity patterns exhibit comparable task-dependent modulation. These results identify a biologically plausible circuit motif that supports flexible sensory processing and link recurrent connectivity structure to adaptive context integration in both artificial and biological systems.
Citation: Aquino TG, Kim R, Rungratsameetaweemana N (2026) Disinhibitory signaling enables flexible coding of top-down information in cortical networks. PLoS Biol 24(7): e3003831. https://doi.org/10.1371/journal.pbio.3003831
Academic Editor: Adam Kohn, Yeshiva University Albert Einstein College of Medicine, UNITED STATES OF AMERICA
Received: July 6, 2025; Accepted: May 18, 2026; Published: July 2, 2026
Copyright: © 2026 Aquino et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: The code for the analyses performed in this study is available in Zenodo at https://doi.org/10.5281/zenodo.20148662 (Shared code for Disinhibitory signaling enables flexible coding of top-down information in cortical networks). The trained RNN models used in the study, as well as the analysis data, are available in Zenodo at https://doi.org/10.5281/zenodo.20148306 (Shared data for Disinhibitory signaling enables flexible coding of top-down information in cortical networks).
Funding: This work was funded by the ARL Human Guided Intelligent Systems grant (W911NF-23-2-0067) and the Strengthening Teamwork for Robust Operations in Novel Groups (STRONG) grant (W911NF-22-2-0148). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Successfully navigating complex environments requires neural systems to efficiently encode and process incoming sensory inputs alongside contextual cues and task goals [1–3]. This capacity to integrate across different sources of information allows the brain to dynamically convert identical sensory inputs into diverse behavior in a context-dependent manner. Sensory inputs can be understood as bottom-up signals with regard to their encoding within lower cortical areas. In contrast, top-down factors refer to information regarding the behavioral relevance of each stimulus, prior expectations about sensory statistics, and task contexts which provide guidelines for how the encoded sensory information should be interpreted and mapped onto behavior [4–8].
The ability to rapidly adapt to temporally varying top-down factors is a hallmark of cognitive flexibility and has been primarily attributed to the frontoparietal networks and medial temporal lobes [9–15]. However, the extent to which lower cortical areas contribute to this adaptive process is not well understood. Past work has shown that responses within sensory areas could be modulated by top-down signals such as selective attention [16–25], and that nonsensory information such as memory content can be decoded from activity within sensory areas [26–29]. Yet, it remains poorly understood how these neural codes complement computations that take place in higher cortical areas. This knowledge gap highlights the need for deeper insight into the computational principles and circuit mechanisms that enable flexible information processing for adaptive behavior.
In nature, where identical stimuli elicit diverse responses, inhibition may be a key mechanism in averting conflicting behaviors. This ensures the selection of appropriate sensorimotor mapping, aligning behavioral outcomes with the current task contexts rather than being driven solely by the stimuli. This account of inhibition as a modulatory mechanism, crucial for shaping task-optimized behavior, has been supported by past studies demonstrating the role of inhibition for the integration of top-down factors within the cortical hierarchy [17,30–33]. In addition, discoveries of specialized sensory microcircuits that rely on inhibitory-to-inhibitory connections among local interneurons further highlight the importance of inhibitory function in flexible information processing [34–39]. Notably, disinhibitory circuits have been proposed as key in facilitating top-down modulation of sensory coding [40–43] and novelty coding [44,45], highlighting the plastic nature of neural codes within lower cortical areas. This account underscores the flexibility of neural codes along the cortical hierarchy and suggests a potential mechanism for sensory multiplexing [46,47].
Models based on recurrent neural networks (RNNs) of continuous-variable firing rate units have been widely used to propose computational mechanisms for previously observed experimental findings and to explore neural dynamics associated with cognitive functions such as attention and working memory [9,48–55].
Still, further work is required to understand how top-down information and bottom-up stimuli are integrated to efficiently represent current task demands. Importantly, these top-down factors, including selective attention, task contexts, and prior knowledge, can exhibit distinct temporal structures, imposing varied but complementary guiding rules on how sensory input should be processed. This results in temporally rich and highly dynamic decision-making, a phenomenon that can be modeled with artificial neural networks, including RNNs [56]. Neglecting to account for the nuanced dynamics of top-down factors and their temporal influence limits our understanding of how neural systems efficiently integrate bottom-up and top-down signals to consistently produce optimal behavior. Additionally, we can gain better insights into this dynamic process through modifying the architecture of past models to incorporate several fundamental properties inherent in biological neural networks. These constraints, such as heterogeneous neuronal timescales, cortical hierarchies, and the balance between inhibitory and excitatory connections, can offer valuable insights into the functionality of neural networks and the types of computations they can perform [57–60].
To address these knowledge gaps, we designed dynamic decision-making tasks that require seamless integration of bottom-up sensory inputs with top-down factors. These tasks included diverse top-down signaling with varying temporal dynamics as well as orthogonal manipulation of bottom-up and top-down signals. By training biologically plausible RNNs on these tasks and parsing the trained networks, we show that disinhibitory circuit motifs governed by inhibitory-to-inhibitory connections are critical for the intricate interplay between the two essential components of decision processes. By extending our model to incorporate the functional cortical hierarchy, we further demonstrate that feedback signaling from higher-level areas and selective local inhibition enable lower-level areas to differentially process identical sensory inputs in a context-dependent manner. Moreover, we tested our model predictions using publicly available data recorded from the mouse primary visual cortex [61] and confirmed that local inhibitory-to-inhibitory connections within the visual cortex play an integral role in encoding top-down information.
Results
RNN model and dynamic decision-making tasks
To understand the cortical computations required for flexibly and optimally integrating bottom-up and top-down information, we employed a biologically plausible RNN model consisting of continuous-variable firing units (model schematic shown in Fig 1a right; see Methods) and trained the model to perform novel dynamic decision-making tasks (Fig 1a, 1b). We trained the RNNs using a supervised gradient-descent procedure, in which the network learned to adjust its recurrent synaptic weights, output readout weights, and neuron-specific synaptic decay time constants to perform the task accurately (Fig 1; see Methods). To preserve biological interpretability, we enforced Dale’s principle by constraining each unit to be either purely excitatory or purely inhibitory, using a sign-constrained weight parametrization [48]. These constraints ensure that learned network dynamics can be meaningfully related to cortical circuit motifs, enabling mechanistic interpretation of how specific inhibitory circuits support flexible encoding of top-down information.
Fig 1. Dynamic decision-making tasks and model schematic.
a. Schematic of a one-modality DMS task with sequential stimuli separated by a delay period. A task cue, indicating the stimulus-response mapping, is presented either before (early instruction) or after stimulus 1 (late instruction). The recurrent neural networks (RNNs) generate an output of +1 or −1 for matched and mismatched trials, respectively (pro-DMS; and vice versa for anti-DMS). The schematic of the RNN model is shown on the right. b. Schematic of a two-modality DMS task where each stimulus comprises two modalities. In addition to the task cue, an attention cue is presented on each trial, indicating the modality on which the RNNs need to perform the DMS task. The task and attention cues are presented either before (early instruction) or after stimulus 1 (late instruction). The trial timing for both one- and two-modality DMS tasks is depicted in a. c. Testing performance of the trained RNNs on the DMS tasks. The average accuracy of the trained RNNs is presented, with statistical comparisons computed using two-sided Wilcoxon rank-sum tests. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05; **P < 0.01; ***P < 0.001. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
These RNNs were employed to execute flexible decision-making tasks designed to achieve several distinct objectives. The first aim was to investigate how identical sensory inputs might be processed differently depending on distinct top-down inputs. These tasks also allowed for the independent manipulation of both bottom-up and top-down inputs. Lastly, the task incorporated separate stimulus modalities, each providing independent sources of top-down information. To meet these criteria in task design, we modified the classic delayed match-to-sample (DMS) task, commonly used in experimental studies [62–66], to develop one-modality DMS and two-modality DMS tasks.
For the one-modality DMS task (Fig 1a), the RNN model received one stream of input stimulus signal (bottom-up) containing two sequential stimuli separated by a delay period. Each stimulus was set to either +1 or −1. The model additionally received a task cue signal (top-down), indicating whether the network should employ the pro- or anti-DMS stimulus-response mapping. For the pro-DMS task (a task cue of +1 during the instruction period), the network was required to generate an output of +1 if the signs of the two stimuli matched, and −1 otherwise. For the anti-DMS (a task cue of −1 during the instruction period), the network was required to generate a negative output if the signs of the two stimuli were the same, and a positive output otherwise. The task cue was given either before (early instruction; 150–400 ms) or after the first stimulus (late instruction; 650–900 ms; Fig 1a).
Critically, in nature, not all aspects of sensory input are relevant to the task at hand. To effectively navigate this, organisms rely on additional top-down signals, such as selective attention, alongside direct task signals to optimize decisions. To study this more complex scenario, we developed the two-modality DMS task by incorporating two streams of input stimulus signals (bottom-up; modality 1 and modality 2) and an attention cue (top-down) which instructs the model which stimulus modality is behaviorally relevant on a given trial and thus should be attended to (Fig 1b; see Methods). An attention signal of −1 during the instruction period informs the model to focus on modality 1 while ignoring modality 2, and vice versa for a +1 attention cue signal. The task cue manipulation is identical to that used in the one-modality DMS task. Similar to the one-modality task, the timing of the instruction period was manipulated such that task and attention cues were delivered either before (early instruction) or after the presentation of the first stimulus (late instruction).
For each decision task (one-modality DMS and two-modality DMS), we trained 20 RNNs to perform the task with high accuracy (> 95%; see Methods). For the one-modality DMS, the models performed the task accurately regardless of the timing of the instruction window (Fig 1c; see S1 Fig for example single neuron activity and output signals). The model performance on the two-modality DMS was lower when the top-down cues were provided early (P < 0.001, two-sided Wilcoxon rank-sum test; Fig 1c). This can be attributed to the model’s requirement to retain both the task and attention signals for a longer duration in the early instruction condition.
Dynamic encoding of top-down information by RNNs
After training RNNs to perform the dynamic decision tasks, we next investigated the effects of the top-down factors (task and attention cues) on the network dynamics. To visualize RNN dynamics in response to distinct external inputs of both bottom-up and top-down signals under various conditions, we plotted the kinetic energy landscape (see Methods) of a representative RNN. This was achieved by using two principal components (Fig 2a, 2b; see Methods). The kinetic energy landscape provides a useful guideline for understanding RNNs as dynamical systems, since it offers a visual summary that illustrates how external inputs shape the structure of fixed points and “slow” points (local minima in the energy landscape) as well as neural trajectories within the state space [67–69]. Therefore, seeking to identify these fixed and “slow” points in the two-dimensional energy landscape enabled us to gain insight into how the task and attention signals influenced the RNN network dynamics.
Fig 2. Flexible incorporation of task and attention signals.
a. Heat map of kinetic energy in an example RNN performing the one-modality DMS task during the first instruction period (150–400 ms; Fig 1a). Background heat maps belong to the late instruction condition. The middle cross indicates the local energy minimum during the first instruction period for the late instruction condition (no instruction), while the left and right crosses indicate the new local energy minima for the pro- and anti-DMS conditions, respectively. b. Same as a, for an example, RNN performing the two-modality DMS task. Legend: pro-task modality 1, pro-DMS and first modality attended; pro-task modality 2, pro-DMS and second modality attended; anti-task modality 1, anti-DMS and first modality attended; anti-task modality 2, anti-DMS and second modality attended. c. Individual trial one-modality RNN trajectories in principal component space during the first instruction period. Principal component projections of fixed points are included. Trial top-down cue conditions are divided into pro-DMS (teal), anti-DMS (dark blue) or no instructions (gray). d. Same as c, for a two-modality RNN. e. Latent space one-modality RNN trajectories during the first instruction and first stimulus periods (150-650 ms; Fig 1a) for all combinations of top-down cues and stimulus identities. f. Same as e, for a two-modality RNN, highlighting the attended stimulus modality. g. Same as e, for a two-modality network, highlighting the unattended stimulus modality. h. Latent space two-modality RNN trajectories for pro-DMS trials, splitting trajectories by the identity of attended vs. unattended stimuli. i. Same as h, for anti-DMS trials. j and k. Temporal discriminability for the first stimulus identity of the attended modality when the attention signal is given early (j) or late (k). The task signal was fixed to anti-DMS. l and m. Temporal discriminability for the first stimulus identity of the unattended modality when the attention signal is given early (l) or late (m). The task cue was fixed to anti-DMS. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
We hypothesized that the presence of top-down cues would condition network dynamics even before the arrival of the bottom-up sensory inputs. In addition, we postulated that opposing cues would correspondingly lead to separable shifts in fixed point structure, which we first visualized through the energy landscape of the network during the first instruction period (150–400 ms, see Fig 1a). For the one-modality DMS task, the energy landscape of the example RNN model displayed an area with an inverted U-shape where the kinetic energy was notably low (darker blue area in Fig 2a). The local minimum of the landscape stayed around the center of this U-shaped profile during the period before the first stimulus presentation when the task cue was not yet presented (no instruction). However, when the task cue was presented early (early instruction), the local minimum shifted to the left and to the right for the pro- and anti-DMS condition, respectively (Fig 2a).
For the two-modality DMS task, the energy landscape again revealed a U-shaped region characterized by a low-energy profile (dark blue in Fig 2b). Consistent with what we observed in the one-modality task, the local minimum was situated at the center of the low energy structure during the first instruction period for late instruction trials, where no task or attention cues were yet presented (no instruction). Analyzing different combinations of the task and attention cue signals displayed distinct local minima points along the low-energy profile. For instance, the local minima corresponding to the pro-DMS signal were positioned to the right of the no-instruction minima. Along this pro-DMS “arm” of the energy trajectory, two local minima were observed: one corresponding to the modality-one attention signal and the other to the modality-two attention signal. These results suggest that network dynamics shift as a function of the combination of top-down attention and task signals, in a dissociable manner, even before the sensory inputs arrive.
Subsequently, we performed a quantitative fixed point search analysis [67] for different values of external input to the network during the first instruction period (150–400 ms, Fig 1a). During this time, the network could receive either a pro-DMS, an anti-DMS, or no task cue (i.e., late instruction trials). We plotted network trajectories as well as fixed points discovered for each of these conditions in the principal component space, for the representative RNN performing the one-modality (Fig 2c) and the two-modality DMS tasks (Fig 2d). We observed that the presence and content of top-down signals given to the network not only altered its trajectories in the state space but also affected its fixed point structure. The network performing the two-modality DMS task exhibited a branching pattern that reflected symmetrical top-down coding across attended modalities for each task cue condition (anti- versus pro-DMS).
The energy landscape plots and the low-dimensional trajectories are qualitative ways to visualize which portions of the neural network’s activity landscape are more likely to be stable or have slower trajectories. The intuition behind the energy landscapes is that their local minima correspond to lower trajectory velocity. It is useful to visualize the transformation of these landscapes as a function of external inputs to the network, since that determines how specific instructions reshape the structure of the energy landscapes, and, therefore, the likelihood of each type of trajectory given a stimulus-instruction input pair.
To further examine network dynamics as a function of distinct top-down and bottom-up information, we performed a manifold discovery analysis, utilizing the CEBRA method [70]. This approach enables latent structure discovery and has the advantage of conditioning latent structure discovery over variables of interest. The contrastive learning objective and nonlinear mapping give CEBRA the ability to discover structured, consistent representations that preserve data relationships beyond simple variance maximization along orthogonal axes.
Here, we performed an unsupervised analysis with regard to all task covariates except for time. This ensured that adjacent time points were mapped to closer points in latent space and that smooth latent dynamics were obtained. This method allowed us to determine the latent trajectory bifurcations in the state space following presentation of stimuli and top-down cues in the network performing the one-modality DMS (i.e., one-modality RNN; Fig 2e) and the two-modality DMS tasks (i.e., two-modality RNN; Fig 2f–2i). In the one-modality RNN, the trajectories during the first instruction period were separated into three pathways corresponding to the pro-DMS cue signal (light green and light purple in Fig 2e), anti-DMS cue signal (dark green and dark purple in Fig 2e), and the absence of the task cue instruction (i.e., late instruction condition; gray in Fig 2e). The pro-DMS and anti-DMS pathways were then further bifurcated based on the identity of the first stimulus during the first stimulus period. For the two-modality RNN, we observed similar latent trajectories for the attended modality (Fig 2f). However, the bifurcations driven by the identity of the first stimulus were not as well separated in the unattended modality (Fig 2G). To quantify this effect, we measured the Euclidean distance between latent trajectories corresponding to opposite first-stimulus identities during the stimulus-1 period using repeated random subsampling. The average trajectory separation was significantly greater for the attended modality than for the unattended modality, confirming that stimulus-driven bifurcations are selectively amplified by attention (S2 Fig). This pattern held true across pro- and anti-DMS trials, which exhibited qualitatively similar bifurcations (Fig 2h, 2i). These findings were further confirmed by the cross-temporal discriminability analysis, which revealed that the identity of the first stimulus of the attended modality was more robustly encoded throughout the trial duration compared to the unattended modality (Figs 2j–2m; S3; see Methods). Linear support vector machine (SVM) decoding analysis also revealed that the top-down cue signals were robustly encoded by the RNN models (S4 Fig; see Methods for more details). These results suggest that lower-dimensionality representations of network dynamics sufficiently capture the network’s flexibility in encoding stimuli in a context-dependent manner, depending on stimulus relevance, as flexibly determined by top-down cues.
Inhibitory units play a critical role in both encoding and maintaining top-down information
After establishing the model’s capacity for flexible encoding of top-down information, we next investigated the relative contributions of excitatory and inhibitory units in facilitating this process. Through dissection of the RNNs performing the one-modality and two-modality DMS tasks, we found that the optimized synaptic decay time constants for the excitatory units were significantly larger compared to the time constants of the inhibitory units (P < 0.001, two-sided Wilcoxon rank-sum test, Fig 3a, 3b). Importantly, these optimized decay time constants should be interpreted as effective functional timescales that emerge through training to support the task’s computational demands.
Fig 3. Inhibitory units robustly maintain top-down signals and establish functional connectivity hubs.
a and b. Average synaptic decay time constants of inhibitory (blue) and excitatory (red) units in one-modality RNNs (a) and two-modality RNNs (b). c. SVM decodability of the task signal from inhibitory (blue) and excitatory (red) units from the one-modality RNNs. d. SVM decodability of the task signal from inhibitory (blue) and excitatory (red) units from the two-modality RNNs. Black bars indicate significant differences between excitatory and inhibitory units (P < 0.05, two-sided KS test). e. Average functional connectivity centrality for all inhibitory (blue) and excitatory (red) units, across all one-modality RNNs, in early (solid) or late (dashed) instruction trials. Arrows indicate start and end of the first instruction period. Shaded areas indicate standard error of the mean. f. Same as e, for two-modality RNNs. g. Histogram of functional centrality values for all units in all one-modality networks at the beginning of the first instruction period. h. Same as g, at the end of the instruction period. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
Building on these findings, which indicate distinct roles for excitatory and inhibitory units, we conducted a series of linear SVM decoding analyses. These analyses aimed to decode the task cue identity (anti- versus pro-DMS) from the activity of excitatory and inhibitory units within RNNs engaged in the one-modality DMS task (see Methods). While the task cue identity could be readily decoded from the activities of both excitatory and inhibitory units following the cue presentation on early instruction trials, the SVM decoding accuracy was significantly higher for the inhibitory units throughout the trial duration (P < 0.05, two-sided two-sample KS test, Bonferroni corrected across time; Fig 3c). We observed similar trends for RNNs performing the two-modality DMS task (Fig 3d). In these models, the inhibitory units also exhibited stronger maintenance of the attention signal compared to the excitatory units (S5 Fig). Interestingly, decoding performance for stimulus identity was higher in inhibitory units for both the attended and unattended modalities, when compared to excitatory units (S5 Fig).
Given these findings that highlight the distinct role of inhibitory units on maintaining task-related information, we next investigated their network-level implications through the functional connectivity patterns of inhibitory units. More specifically, we measured the functional closeness centrality of all units in each network across time (see Methods). Assuming that correlations between activation rates over time establish a metric of distance between units, it follows that units with greater centrality emerge as potential candidates for serving as functional hubs in their respective networks. While both excitatory and inhibitory units increased in centrality following the top-down cue presentation (early instruction) for the RNNs performing the one-modality and two-modality DMS tasks, the average centrality was higher for the inhibitory units after the onset of task and attention cue presentation (P < 0.05, two-sided two-sample KS-test, Fig 3e, 3f). Comparing the distribution of the centrality measure from individual units in the one-modality RNNs before (at 145 ms) and after (at 395 ms) the first instruction period on early instruction trials revealed that the centrality values of inhibitory units were distributed more widely post-instructions. This finding indicates the formation of inhibitory functional connectivity hubs following the top-down cue presentation (Fig 3g, 3h). To understand the network structure underlying this increase in inhibitory centrality, we examined the connectivity patterns between rule-selective populations in the one-modality RNNs. Inhibitory neurons preferentially inhibited other inhibitory neurons with the same rule selectivity, with significantly stronger within-selectivity compared to across-selectivity I → I connections (S6 Fig). This structured motif was also observed across connection types, suggesting the emergence of rule-specific inhibitory subnetworks. Taken together, these results suggest a central role for inhibitory units in encoding task-specific top-down information, as well as stimulus identity.
Inhibitory-to-inhibitory connections carry top-down information
Given the crucial role of inhibitory units in encoding and sustaining top-down signals, we next investigated the neural circuitry surrounding these inhibitory units to further elucidate their contribution to the maintenance of task-specific top-down information.
Lesioning inhibitory connections to inhibitory units selective for the anti-DMS task cue (by reducing synaptic weights by 50%; see Methods) in a representative RNN trained to perform the one-modality DMS task resulted in the disruption of anti-DMS task signaling (Fig 4a). Consequently, the network exhibited a tendency to perform the pro-DMS task irrespective of the task cue presented. By visualizing the neural dynamics of this lesioned RNN using CEBRA [70], we confirmed that the neural trajectories were no longer separated by the task cue condition during the first instruction period in early instruction trials (Fig 4b).
Fig 4. Inhibitory-to-inhibitory connections primarily encode task and attention cue signals.
a. Reducing incoming inhibitory synaptic weights to inhibitory units selective to the anti-DMS cue by 50% in an example one-modality RNN led to the network performing the pro-DMS even when the anti-DMS cue was given (bottom). b) CEBRA neural trajectories of the lesioned model shown in a during the first instruction and first stimulus periods. Inset, trajectories from the intact network (identical to Fig 2e). c. Performance of all 20 RNNs trained to perform the one-modality task when task-specific inhibitory-to-inhibitory (; top) or inhibitory-to-excitatory (
; bottom) synaptic weights were reduced by 50%. d. Performance of all 20 RNNs trained to perform the two-modality DMS when task-specific
(top) or
(bottom) synaptic weights were reduced by 50% when modality 1 was attended. e. Performance of all 20 RNNs trained to perform the two-modality DMS when modality-specific
(top) or
(bottom) synaptic weights were reduced by 50% when the task signal was fixed to pro-DMS.
, inhibitory units preferring anti-DMS cue;
, inhibitory units preferring pro-DMS cue;
, excitatory units preferring anti-DMS cue;
, excitatory units preferring pro-DMS cue;
, inhibitory units preferring modality-1 attention cue;
, inhibitory units preferring modality-2 attention cue;
, excitatory units preferring modality-1 attention cue;
, excitatory units preferring modality-2 attention cue. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
Motivated by these findings, we employed a systematic approach where we lesioned synaptic connections based on task cue selectivity (pro- versus anti-DMS) and unit type (excitatory versus inhibitory) for the one-modality DMS task. We found that disrupting inhibitory-to-inhibitory () connections led to profound impairment in task performance (Fig 4c top; **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test). Disrupting similarly tuned inhibitory connections impaired performance in both pro- and anti-DMS trials (Ia → Ia and Ip → Ip in Fig 4c top; **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test). However, lesioning oppositely tuned
connections resulted in greater impairment in one type of task trials (Ia → Ip and Ip → Ia in Fig 4c top; **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test). We also observed significant changes in task performance when the
connections were lesioned (Fig 4c bottom; **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test). Importantly, the disproportionately large behavioral effects observed following lesioning of task cue-specific
connections cannot be explained by a greater number of connections being perturbed, as
synapses constitute the smallest fraction of outgoing connections compared to other synaptic classes (S7 Fig). The effects of lesioning task cue-specific connections were similar in the RNN models trained to perform the two-modality task (Figs 4d; S8). In addition,
connections also carried attention cue-specific information in RNNs performing the two-modality DMS task (Figs 4e; S8). To further assess the relative contributions of excitatory versus inhibitory outputs, we extended our lesion analyses to include selectivity-matched perturbations of excitatory-to-excitatory (
) and excitatory-to-inhibitory (
) connections. Across both one- and two-modality RNNs, perturbation of
connections produced minimal performance changes, while lesioning
connections led to modest impairments that were substantially smaller than those observed following disruption of
connectivity (S9–S13 Figs). These results indicate that, although excitatory projections contribute to shaping inhibitory competition, disinhibitory
circuitry remains the dominant mechanism supporting the maintenance and flexible deployment of task- and attention-related top-down information.
These findings suggest that inhibitory-to-inhibitory connections are essential for integrating context-defining top-down information.
Two-module RNN model with hierarchical organization
Recent empirical studies have shown that inhibitory-to-inhibitory connections within early sensory areas receive strong task-specific and context-based modulation by higher cortical areas [40,61,71].
Based on our results thus far, which underscore the significance of connections in contextual coding, we next investigated whether inhibitory-to-inhibitory connections in low-level sensory areas would emerge as a critical target for task-oriented top-down modulation in our model. To accomplish this, we developed a two-module RNN model inspired by the cortical hierarchy. In this model, the first module (sensory module) is modeled after the early sensory cortex and receives the stimulus input signal only (Fig 5a). The first module subsequently conveys the processed sensory signal to the second, nonsensory module through exclusively feedforward excitatory connections. Given previous findings from animal studies showing that long-range feedforward projections are overwhelmingly excitatory, whereas top-down modulation can additionally engage sparse long-range inhibitory influences [31,33], we trained our two-module RNNs to perform the one-modality DMS task without intermodule feedforward inhibitory projections. The second module of the model, designed after higher cortical areas, receives the task cue signal (Fig 5a). Using this architecture, we trained 20 two-module RNNs to perform the one-modality DMS task (see S14 Fig for example single neuron activity and output signals). On average, the two-module RNNs took longer to learn the one-modality task compared to the one-module networks (mean ± stdev, 38,571 ± 6,306 trials for the one-module RNNs versus 55,221 ± 10,187 trials for the two-module RNNs; S15 Fig).
Fig 5. Dynamic coding and emergence of hierarchical inhibitory organization in two-module RNNs.
a. Schematic of the two-module RNN model trained to perform the one-modality DMS task. The sensory module (red shade) contains 200 units and the non-sensory module (blue shade) contains 800 units. The sensory module receives the input stimulus signal, while the nonsensory module receives the task cue signal. There are no feedforward inhibitory projections from the sensory module to the nonsensory module. b. Heat map of kinetic energy in an example one-modality two-module RNN during the first instruction period. The middle cross indicates the local energy minimum for the late instruction condition (no instruction), while the left and right crosses indicate the new local energy minima for the pro- and anti-DMS conditions, respectively. c. Individual trial two-module RNN trajectories in principal component space during the first instruction period. Principal component projections of fixed points are included. Trial task cues are divided into pro-DMS (teal), anti-DMS (dark blue) or no instructions (gray). d and e. Average synaptic decay time constants of excitatory units (d) and inhibitory units (e) in the sensory and nonsensory modules from 20 two-module RNN models trained to perform the one-modality DMS task. f and g. Comparison of average excitatory (f) and inhibitory (g) synaptic connection strengths within and between the sensory and nonsensory modules. h. Average decoding accuracy in early instruction trials for pro- vs. anti-DMS signal in the sensory module of two-module networks, including either excitatory (red) or inhibitory (blue) units. Shaded areas indicate standard error of the mean. Black bars indicate significant differences between excitatory and inhibitory units (P < 0.05, two-sided two-sample KS test). i. Same, for late instruction trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test (d and e) or Kruskal–Wallis test followed by Dunn’s multiple comparison test (f and g). Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
To investigate if the two-module RNNs processed the task cue signal in a manner similar to that observed in the one-module RNNs, we characterized the kinetic energy landscape and performed the fixed-point analysis on the two-module networks during the first instruction period (150–400 ms; see Methods). Similar to the landscape and energy minima seen in the one-module RNNs (Fig 2a), we observed an inverted U-shaped energy landscape and displacement of energy minima as a function of the task cue identity (Fig 5b). While the fixed-point analysis suggests a line-like structure as an attractor for the visualized example network (Fig 5c), this is not a consistent pattern across random initializations. The most common observation was for fixed points to cluster along unordered point clouds as opposed to line attractors, similar to the one-module network. Still, these results suggest that network trajectories are conditioned by a systematic shift in fixed point structure, which is brought about by the presence of distinct top-down signals in the form of task instructions.
Characterizing the synaptic decay dynamics of the two-module RNNs revealed slower excitatory dynamics in the nonsensory module compared to the sensory module (Fig 5d; P < 0.001 by two-sided Wilcoxon rank-sum test). The inhibitory units exhibited the opposite trend (Fig 5e; P < 0.001, two-sided Wilcoxon rank-sum test). While further work is required to understand the heterogeneous timescales across excitatory and inhibitory neurons, there are a few possible interpretations for this finding. Longer excitatory time constants in the sensory module may support integration over longer times, beneficial to context integration via top-down feedback to sensory areas. Faster excitation relative to inhibition, on the other hand, could support rapid sensory stimulus encoding. In the nonsensory module, more closely related to the final decision process, it is possible that an inverted dynamic is at play. Decision making often requires integrating evidence over time, which functionally necessitates longer effective integration windows. Slower excitatory synaptic dynamics are often incorporated to ensure that network dynamics smooth out noisy inputs and accumulate information gradually.
Investigating the average synaptic connection strength revealed strong excitatory connections within the sensory module and from nonsensory to sensory modules (Fig 5f; P < 0.001, Kruskal–Wallis test followed by Dunn’s multiple comparison test). Interestingly, the average inhibitory signaling was the strongest within the nonsensory module (Fig 5g; P < 0.001, Kruskal–Wallis test followed by Dunn’s multiple comparison test). Both and
connections within the nonsensory module were significantly stronger than their counterparts in the sensory module (S16 Fig). These findings are consistent with previous empirical evidence suggesting increased inhibitory strength and diversity along the cortical hierarchy [60,72]. A possible interpretation for these findings is that stronger inhibition in the decision areas allows a tighter control of excitation, which supports precise modulation of internal computations that depend on context and goals rather than direct sensory drive. In higher-order modules where computations mainly rely on recurrent internal dynamics (e.g., evidence accumulation, rule maintenance), stronger inhibition may improve stability and prevent excessive activity. Recent literature suggests such a stabilizing role for inhibition in the context of strongly connected excitatory populations [53], expanding the parameter space where decision-making circuits can operate without instability.
Linear SVM decoding analyses in the two-module RNNs showed that the task signals were more readily decodable from the inhibitory units than from the excitatory units in both sensory and nonsensory modules (Fig 5h, 5i; see S17 Fig for the nonsensory module results, P < 0.05, two-sided two-sample KS test with Bonferroni corrected across time). Most notably, the SVM decodability of the task signal decayed more slowly for the inhibitory units compared to the excitatory units in both modules.
Inhibitory-to-inhibitory connections in both sensory and nonsensory modules encode top-down information
Analyzing the neural trajectories of an example two-module RNN during the first stimulus period for early instruction trials revealed trajectory grouping based on the task cue identity (Fig 6a). Lesioning connections in both sensory and nonsensory modules by reducing the weights by 50% abolished this top-down modulation (Fig 6b), mirroring the results observed in the one-module RNN (Fig 4b). Lesioning other synaptic connections (i.e.,
,
,
) did not lead to disruption of the top-down information encoding (S18–S21 Figs). Thus, these results suggest that the hierarchical, two-module RNNs also rely on inhibitory-to-inhibitory connections to encode contextual information.
Fig 6. connections in both sensory and nonsensory modules contribute to task cue encoding.
a. CEBRA neural trajectories during the first instruction and first stimulus periods from an example two-module RNN. b. CEBRA trajectories during the first instruction and first stimulus periods from the same RNN model shown in a, but with connections lesioned. c and d. Performance of all 20 two-module RNNs trained to perform the one-modality DMS task when task-specific
connection strengths within the sensory module (c) or within the nonsensory module (d) were reduced by 50%. e and f. Performance of all 20 two-module RNNs trained to perform the one-modality task when task-specific
connection strengths (e) or
(f) from the nonsensory to the sensory module were reduced by 50%. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
Performing a systematic lesioning analysis, similar to the one utilized for the one-module RNNs in Fig 4, demonstrated the importance of inhibitory units with opposite tuning to the task cue (i.e., anti- versus pro-DMS) in the sensory module for reliably encoding the task signal (Fig 6c). In addition, disrupting connections within the nonsensory module severely impaired the task performance of all the RNNs, confirming that top-down information is primarily encoded by
connections in the nonsensory module (Fig 6d). Lesioning feedback inhibitory connections had minimal influence on the task performance (Fig 6e, 6f). Lesioning other synaptic connections did not lead to disruption of top-down information encoding (S18–S21 Figs), although perturbations of
connections led to modest performance impairments, supporting a role for long-range excitatory drive in recruiting inhibitory populations in higher-order areas.
These results suggest that the task cue information is encoded by synapses in the non-sensory module. What is even more unexpected is that the top-down information is transmitted to the sensory module via feedback connections and encoded by
connections within the sensory module (Fig 6c).
Disinhibitory circuits in mouse primary visual cortex and RNN sensory module encode top-down information
Building on the findings from the previous section, which indicate the role of sensory areas in encoding both stimulus-driven (bottom-up) and task-related (top-down) information, we proceeded to test our hypothesis. Specifically, we proposed that cortical connections within early sensory processing regions represent top-down information via feedback from higher cortical areas. To investigate this, we analyzed a publicly available experimental data which demonstrated the capability of the mouse primary visual cortex (V1) in representing contextual information [61]. This electrophysiology dataset was collected with a linear array recording electrode covering all layers of V1 and preprocessed with standard filtering and spike sorting pipelines.
In this study, during V1 activity recording, mice were trained to identify the location of a salient grating visual pattern (“figure”) that stood out from the visual background (“ground”) [61]. These patterns were distinguished based on either orientation, phase, or texture (Fig 7a). Crucially, in each of these conditions, the sensory inputs chosen to fall within the exact receptive fields of the recorded neurons remained strictly identical in both the figure and background conditions (see Methods).
Fig 7. Top-down encoding in mouse V1 and RNN sensory module.
a. Task summary. In the orientation condition, mice viewed an orientation pattern as either part of a figure (top left) or as part of the background (top right). In control trials (bottom row), mice must contrast a target against a gray background, which did not require figure-ground segregation. This design ensured that the same sensory information within receptive fields could possess different contextual meanings (figure vs. ground). Green circles represent the receptive field. Reproduced from [61]. b. SVM decoding of top-down figure vs. ground signal from V1 neurons, holding sensory input constant. Black bars indicate significant decoding (P < 0.05, random permutation test). Dashed lines indicate 95% confidence intervals from permutations. c. SVM decoding of top-down signals during optogenetic inhibition of VIP interneurons in V1, which normally mediate disinhibitory inhibitory-to-inhibitory () signaling through inhibition of other interneurons, in laser-on (blue) and laser-off (red) trials. The shaded area indicates laser on time window. P-value indicates significant difference in decoding across conditions (P < 0.05, one-sided t test). d. CEBRA trajectories within the sensory module of an example two-module RNN during the first stimulus period when the task signal was presented early. e. Same as d but when
connections within the sensory module were completely removed. f. Comparison of average decoding accuracy for the task cue identity across 20 trained, two-module RNNs under two conditions: one with intact connections and the other without
connections within the sensory module. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
This allows for probing the role of recurrent connections from higher visual areas into V1 on top-down representations, given previous results which causally link recurrent feedback from higher visual areas and top-down feedback into V1, beyond lateral connections within this area [61,73,74]. This mirrors our approach in configuring our RNNs to process identical bottom-up sensory information under varying contexts informed by top-down cues. Therefore, disparities in neural activity observed across conditions in the mouse dataset could be attributed to the context indicated by stimulus regions outside of the measured neural receptive field.
First, we tested whether the activity of V1 neurons could differentiate between figure and ground conditions, even when the sensory information within their receptive fields remained identical. A successful figure versus ground decodability would be evidence that feedback from regions outside receptive fields projects into the measured V1 sites. Performing a linear SVM decoding analysis on the V1 activity to decode the condition label (figure versus ground) revealed that the V1 responses were modulated by the task context (Fig 7b; P < 0.05, permutation test, Bonferroni corrected). The high decoding accuracy indicates that feedback signals from regions outside the receptive fields robustly modulate V1 activity in response to information about task context.
In sensory cortical areas, including V1, vasoactive intestinal peptide-expressing (VIP) interneurons send inhibitory projections to somatostatin-positive (SST) interneurons. This forms microcircuitry characterized by inhibitory-to-inhibitory connections, similar to the connections observed in the sensory module of our two-module RNN model (Fig 5a). The canonical VIP-SST disinhibitory circuit in cortex is a natural candidate for the biological implementation of the top-down regulatory mechanism elicited by inhibitory-to-inhibitory connections that arose in our artificial models because VIP neurons are known to receive long-range regulatory feedback from frontal areas [75,76]. However, it is important to acknowledge that other types of inhibitory-to-inhibitory connections exist in sensory circuitry, such as SST to parvalbumin (PV) neuron connections [77].
Leveraging the experiments within the dataset where VIP interneurons of mouse V1 were optogenetically silenced, we analyzed the role of VIP neurons in figure-background discrimination, in trials where top-down cues were presented through the orientation of visual grating patterns. Through the SVM decoding analysis, we observed a decrease in decoding accuracy following optogenetic silencing of V1 VIP neurons (P < 0.05, one-sided t test; Fig 7c). These results demonstrate the importance of intact inhibitory-to-inhibitory projections for successfully encoding top-down information in V1 circuits.
We next utilized CEBRA [70] to analyze the neural dynamics of the sensory module in an example two-module RNN model during the first stimulus presentation in early instruction trials of the one-modality DMS task. The recovered latent space revealed distinct neural trajectories delineated by the task cue signal (Fig 7d). The trajectories were initially divided into two arms based on the identity of the task cue (light green and light purple versus dark green and dark purple trajectories in Fig 7d). Within each arm, the trajectories were further separable by the first stimulus identity (+1 or −1). Removing all the inhibitory-to-inhibitory connections within the sensory module abolished the effects of the task signal in the sensory module (Fig 7e). These results highlight a convergence between theoretical models and experimental evidence for the importance of feedback into lower sensory areas in coding top-down information.
Performing a time-resolved linear SVM decoding analysis on the sensory module across all 20 trained two-module RNNs revealed that the task cue identity could be reliably decoded from sensory population activity, providing additional confirmation of top-down modulation imposed by the nonsensory module (Fig 7f; see Methods). When inhibitory-to-inhibitory () connections within the sensory module were removed, task cue decoding accuracy was markedly reduced compared to the intact network (Fig 7f), indicating that these disinhibitory connections play a critical role in enabling reliable top-down information encoding within the sensory circuitry.
Introducing retro-cue condition elicits shift in strategy
To test whether the disinhibitory circuit mechanisms identified above generalize to settings in which attentional relevance is specified after stimulus encoding, we introduced a retro-cue variant of the two-modality task as illustrated in Fig 8a. The task still involves two streams of stimuli (modality 1 and modality 2) along with the task cue (pro- and anti-DMS). In this task version, the task cue was fixed to be given before the first stimulus (see Methods). Notably, in this modified version, the attention cue may be presented before the first stimulus (early instruction), after the first stimulus (late instruction), or after the second stimulus (retro-cue), as depicted in Fig 8a. This modification provides a more nuanced exploration of attentional processes by allowing flexibility in the timing of attention cue presentation.
Fig 8. Retro-cue signal forces the model to maintain both unattended and attended sensory signals.
a. Schematic of a modified version of the two-modality DMS task, which includes a retro attention cue signal. The task cue signal was always given early (before the onset of the first stimulus), while the attention cue was given before the first stimulus (early instruction), during delay (late instruction), or after the second stimulus (retro-cue). b. CEBRA trajectories during the first instruction and first stimulus periods for the attended modality when the attention cue signal was given early (example RNN). c. Same as b, but for the unattended modality. d–f. Average temporal discriminability of the first stimulus identity of the unattended modality when the attention cue was given early (d), late (e), or after the second stimulus (retro; f). g–i. Same format as d–f, but with inhibitory-to-inhibitory () connections lesioned (synaptic weights reduced by 50%). White lines denote task epoch boundaries. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
We trained 20 one-module RNNs to perform this modified two-modality DMS task. We then compared the resulting network dynamics for the case where top-down cues (both task and attention cue signals) were given before the first stimulus against the dynamics observed from the RNNs trained for the original two-modality DMS task (Fig 2). Applying CEBRA to an example RNN trained for the modified two-modality DMS task revealed that the trajectories for the attended modality closely resembled those obtained from the example RNN trained for the original two-modality DMS (i.e., compare Fig 8b with Fig 2f). The trajectories were initially distinguished by the task cue (pro- versus anti-DMS). Subsequently, each task “arm” bifurcated based on the identity of the attended first stimulus (Fig 8b). Interestingly, in the case of pro-DMS, where the unattended first stimulus had an identity of +1, the trajectories diverged based on the identity of the attended first stimulus (illustrated by the green trajectories in Fig 8c). This stands in stark contrast to the trajectories observed in the RNN model performing the original two-modality DMS task, where the trajectories of the unattended modality were unaffected by the attended modality (Fig 2g). In other words, the introduction of the retro-cue condition to the two-modality task forced the RNNs to maintain not only the attended but also the unattended stimuli throughout the trial period. This is further underscored by the robust temporal discriminability of the first stimulus of the unattended modality across the three attention cue conditions (i.e., early, late, and retro cue conditions) in the 20 RNNs trained for the modified two-modality DMS task (Fig 8d–8f; compare these to Fig 2l, 2m). Carrying out systematic lesioning analyses revealed that reducing inhibitory-to-inhibitory () connections led to a pronounced decrease in the temporal discriminability of the first stimulus (Fig 8g–8i), whereas lesioning the other synaptic classes produced substantially weaker effects (S22 Fig). These results demonstrate that, despite the altered task demands imposed by the retro-cue condition, stimulus representations continue to rely critically on disinhibitory
circuitry for their maintenance.
Overall, introducing the retro-cue condition resulted in the RNN model employing a different strategy to flexibly prioritize and maintain relevant information throughout the task execution. Notably, this shift in strategy preserves the same underlying circuit mechanism, with disinhibitory inhibitory-to-inhibitory interactions remaining essential for stabilizing stimulus representations under dynamically changing attentional demands.
Discussion
The present study provides a unique set of cognitive tasks that require flexible stimulus-response mapping and enable investigation into the interaction between bottom-up and top-down signaling. By constructing and training biologically constrained RNNs on these tasks, we propose possible neural mechanisms essential for such flexible information processing and cognitive control. Importantly, we show that selective inhibition plays a central role in shaping network dynamics during context-dependent decision-making. Moreover, through the utilization of hierarchically structured RNN models (i.e., two-module RNNs) and data from the mouse primary visual cortex, we demonstrate the involvement of selective inhibition in encoding top-down information within early sensory areas. Together, these findings offer a complete computational and empirical account for the role of top-down feedback into lower-level sensory areas.
Previous experimental and computational studies have highlighted the significance of disinhibitory circuits governed by inhibitory-to-inhibitory signaling in performing high-level cognitive functions [34,43,78–80]. For instance, disinhibitory effects exerted by VIP and SST interneurons in higher-level cortical areas, such as the prefrontal cortex, have been shown to be critical for working memory and social memory [59,81]. A role for VIP and SST neurons in encoding the novelty of visual stimuli has also been found, suggesting a broader role in top-down regulation [38]. While recent animal studies have started to explore how these mechanisms are utilized for other cognitive functions [75,82,83], the role of inhibitory-to-inhibitory connections in facilitating flexible cognitive switching remains poorly understood. This ambiguity is primarily attributed to the lack of computational models incorporating relevant biological constraints, coupled with scarcity of cognitive tasks that allow for orthogonal manipulation of sensory and temporally varying top-down signals. To bridge this gap, we trained RNNs composed of model neurons to perform cognitive tasks requiring adaptive switching. Through several computational analyses, including lesion studies, we demonstrate that a subset of synapses naturally emerge in our model to adeptly map sensory information into context-dependent decisions. Consistent with our findings, recent animal studies have identified disinhibitory circuit motifs facilitating flexible routing of sensory information [34,40,41,61,83].
Specifically, the canonical VIP-SST disinhibitory circuit in the cortex is a candidate for the biological implementation of the top-down regulatory mechanism elicited by inhibitory-to-inhibitory connections that arose in our artificial models. While VIP neurons are known to receive long-range feedback from frontal areas [75,76], and to be flexibly modulated by task demands [84], one of their roles in cortical circuitry is to inhibit SST-expressing neurons, which are inhibitory neurons themselves [34,35,75]. Since SST neurons target excitatory pyramidal neurons in visual circuitry [85], the end goal of this circuitry may be to allow for a regulated integration of top-down feedback into disinhibited excitatory sensory neurons [86–88]. Above and beyond being utilized for top-down regulation alone, inhibition plays a role in stabilizing cortical activity as a whole, as evidenced by the onset of seizures caused by loss of cortical inhibition [89]. In networks with strong recurrent connections, excitatory-to-excitatory connections amplify responses so that the network is unstable if the firing rates of inhibitory neurons are kept fixed [90], requiring inhibitory-based stabilization to control self-amplification [91]. From the perspective of top-down control, disinhibitory feedback can act functionally as regulatory gating, which modulates bottom-up circuits as opposed to just re-writing sensory information coded by sensory pyramidal neurons, allowing for context-based selection of sensory inputs [92,93]. This type of long-range regulatory gating plays an especially important role in top-down attention-based gating of sensory inputs [17].
Although the retro-cue condition introduces task demands that differ from the standard delayed match-to-sample structure, our additional lesion analyses demonstrate that the same disinhibitory inhibitory-to-inhibitory circuitry remains critical for maintaining stimulus representations under these altered temporal and attentional constraints. This suggests that the emergence of disinhibitory motifs in our models is not specific to a particular task configuration, but instead reflects a more general circuit mechanism for stabilizing and selectively prioritizing internal representations in the presence of dynamically changing top-down demands. In line with this, prior experimental and computational studies have shown that reward expectation and value signals strongly modulate cortical processing through inhibitory interneuron populations, often via disinhibitory pathways that gate sensory or mnemonic representations in a context-dependent manner [43,53,79,83]. Within this broader framework, our results suggest that disinhibitory motifs may constitute a canonical circuit mechanism through which diverse top-down signals can dynamically regulate information flow and representational stability across cortical hierarchies.
Particularly noteworthy are our findings that connections within the sensory module of our two-module RNN model encode top-down information, and that these connections are necessary for context-dependent mapping of the sensory information. Furthermore, we reported that feedback signals from higher-level cortical areas contribute to emergence of these
connections in early sensory areas. These findings indicate a hierarchical organization of inhibitory circuits involved in encoding and processing top-down information. In support of this, Kirchberger and colleagues demonstrated the critical role of feedback inputs from higher visual areas in generating context-dependent signals in the mouse primary visual cortex through optogenetic manipulation [61]. While the authors observed reduced context-encoding upon optogenetic silencing of excitatory neurons in the higher visual cortex [61], the specific circuitry underlying these feedback projections warrants further investigation.
The possibility that aspects of complex computations, typically ascribed to higher cortical areas, may also occur in lower-level areas challenges the prevailing notion of the cognitive limitations of lower-level areas. Building representational redundancies into cortical hierarchies allows for complex computations to occur in both lower sensory areas and higher-level areas [25,46,47]. This mechanism enhances the robustness of representations and facilitates efficient processing, enabled by top-down feedback signals guiding multiplexing in sensory areas.
Building on the insights gained from this study, future directions could explore whether similar circuit mechanisms are employed across various cognitive domains, potentially uncovering universal circuit motifs underlying flexible sensory processing and cognitive control across various contexts. Furthermore, enhancing the biological realism of computational models and RNNs by incorporating spiking model neurons and biologically plausible learning rules could unveil other neural mechanisms based on precise timing-based computations. Finally, validating our experimentally testable hypotheses and predictions using neural data from both nonhuman primates and humans, particularly epilepsy patients with depth electrodes implanted for seizure monitoring and potential intervention, represents a crucial future direction. This step would provide an improved understanding of representational redundancies within cortical networks and help identify compensatory mechanisms that preserve flexible computation in networks disrupted by disorders. Additionally, these insights will inform the development of targeted interventions and therapeutic strategies for neurological disorders that impair sensory processing and cognitive control.
Methods
Continuous-rate RNN model
In this study, we constructed the following continuous-variable RNN model:
where refers to the synaptic time constants,
denotes the synaptic current variable from N units across T time points. By applying a sigmoid nonlinearity, we estimated the firing rates
based on the synaptic current values (
). The connection weights from the time-varying inputs (
;
, the number of input channels) to the network were represented by the weight matrix
. Additionally,
contains connection weights between the N units.
To compute the network output (), we linearly combined all the firing rates specified by the output connection weight matrix,
, along with the constant bias term, b. The network size (N) was set to 1,000 for all the networks. Output weights could be positive or negative.
For the one-modality DMS task, the input signals () contained three channels: two channels for the two sequential stimuli and one channel for the task cue signal (Fig 1a). For the two-modality DMS task,
contained 6 channels, corresponding to two stimulus modalities (two channels for each modality), one channel for the task cue signal, and another one for the attention cue (Fig 1b). For the one-module RNNs,
was projected to all the units in the network. However, for the two-module RNNs, the stimulus signals were injected into the first 200 units, while the task and attention cues were provided to the second module comprising 800 units (Fig 5a).
For the two-module RNN model, the recurrent connectivity weight matrix () was constrained to remove the feedforward inhibitory connections from the first module (first 200 units; sensory module) to the second module (nonsensory module) by applying a binary mask.
Training and task details
The dynamics in Equation 1 were discretized using the first-order Euler approximation method and using the step size () of 5 ms:
where . Given that the units in this network model communicate through differentiable and continuous signals, a gradient-descent supervised method, known as backpropagation through time (BPTT; [94]), was employed to train our RNNs to perform cognitive tasks. Specifically, the trainable parameters of the model included
,
,
, and b. We used Adam (adaptive moment estimation) optimization algorithm to update these parameters. The learning rate was set to 0.01, and the TensorFlow default values were used for the rest of the parameters, including the first and second moment decay rates. To further impose biological constraints, we enforced Dale’s law (uniform neurotransmitter release characteristics within separate excitatory and inhibitory neurons) using methods similar to those implemented in previous studies [48,54,95]. To adhere to empirical findings regarding the ratio of excitatory and inhibitory neurons observed in the brain, each RNN consists of 80% excitatory and 20% inhibitory units (i.e., E–I ratio of 80/20; [30,96,97]).
Importantly, instead of fixing the synaptic decay constant () to a fixed value for all the units, we optimized the parameter for each unit. The parameter was trained to range from 20 ms to 125 ms to model heterogeneous synaptic dynamics of different receptors in the cortex [98,99]. We initialized the synaptic decay time constant parameter (
) using
where is the sigmoid function and
refers to the standard normal Gaussian distribution.
ms and
ms were used to constrain the parameter to range from 20 ms to 125 ms.
Our model training was deemed successful if the following two criteria were satisfied within the first 70,000 epochs: 1) loss value (defined as the root mean squared error between the network output and target signals) < 7; and 2) task performance > 95% for the one-modality DMS task and > 85% for the two-modality DMS task. If the network did not meet the criteria within the first 70,000 epochs, the training was terminated. For each successfully trained RNN, we simulated 400 trials for the one-modality task and 1,000 trials for the two-modality task, randomizing each trial to contain one possible combination of instruction order, top-down cues, and stimulus identities. Using these criteria, we trained 20 RNNs for each task.
One-modality DMS task.
For the one-modality DMS task, the trial duration was set to 350 steps with 5 ms step size, and the input matrix () contained two input channels for the two sequential stimuli (Stim 1 and Stim 2 in Fig 1a). The third channel was used to deliver the task cue signal. The first channel was employed to present Stim 1 (lasting 250 ms) at 400 ms, and the second channel delivered Stim 2 (lasting 250 ms) 250 ms after the offset of the first stimulus. During the stimulus period, the input channel was set to either −1 or +1. For the early instruction condition, the task cue channel was set to either −1 or +1 from 150 ms to 400 ms. For the late instruction condition, the task cue channel was adjusted to −1 or +1 during the delay period, spanning from 650 ms to 900 ms.
Two-modality DMS task.
The input matrix for the two-modality task contained 6 input channels (). The initial two channels were employed to present two sequential stimuli for the first modality, whereas the subsequent two channels delivered stimuli for the second modality. The fifth channel was dedicated to the task cue signal, while the last channel was used to present the attention cue. For the early instruction condition, both task and attention signals were delivered at 150 ms. For the late instruction condition, the task and attention signals were presented at 650 ms.
Modified two-modality DMS task.
For the modified version of the two-modality DMS task (Fig 8), the input matrix closely resembled that of the standard two-modality task (see above). The trial duration for this variant was reduced to 250 steps (1,250 ms), and the stimulus window was also shortened to 125 ms. In this task, the task cue signal was always delivered before the first stimulus at 125 ms. However, the attention cue signal was randomly delivered either before the first stimulus (early instruction), after the first stimulus at 375 ms (late instruction), or after the second stimulus at 625 ms (retro-cue).
SVM decoding and state space analyses
We performed decoding on RNN activity with a 5-fold cross-validated linear support vector machine analysis (SVM; sklearn function SVC). Given a trained RNN model, we first generated firing rate timecourses ( in Equation 3) of 1,000 units across 1,000 simulated trials. Next, the data were randomly sampled for instruction order (early versus late), stimulus identity (+1 versus −1), and top-down task signals (pro-DMS versus anti-DMS; as well as attended versus unattended for the RNNs performing two-modality DMS). The SVM classifier was trained on the firing activities from either excitatory or inhibitory units and then tested on the left-out trials to distinguish each of the signals of interest.
The decoding plots represent single trial decoding accuracy for the signal of interest. All model units of a given kind (inhibitory or excitatory) are included simultaneously in the SVM decoder as features. The measures of variability (standard error of the mean) are included as shaded areas around the decoding accuracy trajectories, which are computed in cross-validated fashion across 5 folds.
To characterize the network dynamics in response to distinct external inputs, we performed a fixed-point analysis [67] using the FixedPointFinder toolbox [68]. This method involves numerically minimizing a proxy energy function derived from the RNN update equations to identify fixed points and subsequently obtaining linearized dynamics around these points to describe their behavior. In our case, this is equivalent to minimizing q(x), defined from the RNN dynamics of Equation 1:
We repeated this search for every set of external inputs u(t), which included null inputs for late instruction trials, task cue (pro-DMS versus anti-DMS), and attention cues (for the two-modality DMS).
For each optimization run, we randomly selected 200 out of the 1,000 simulated trials to create initial conditions for the optimizer. From each trial, one initial condition was generated by taking the 1,000-dimensional system state at the end of the first instruction period. Each optimization run had the following parameters: number of iterations = 30,000; initial learning rate = 0.1; outlier distance scale = 10; unique tolerance = 10. The outlier distance scale served as a cutoff distance for discarding outlier putative fixed points, measured from the centroid of the vector of initial conditions. The unique tolerance is the numerical precision used to determine if two fixed points are unique. Fixed points that were too close to previously identified ones were discarded. Two fixed points were considered unique if their distance exceeded this threshold.
To visualize the fixed points and RNN neural trajectories, we plotted them in principal component space spanned by the top three principal components extracted from the average activities during the first instruction period. In addition, we visualized the entire landscape of the energy function q(x) in the top two principal components during the first instruction period along with their respective energy minima for each external input u(t) condition (Figs 2a, 2b, 5b). For qualitative visualization purposes only, we created a grid of possible (x,y) coordinates in two-dimensional principal component space and performed an inverse PCA transform with sklearn function , which returns the unique orthogonal projection of the PC point onto native space. This function works by multiplying the PCA-space representation by the transpose of the PCA rotation matrix and adding the dataset mean. This implies that any variance that is perpendicular to the principal components is treated as zero and discarded. This is still lossy but is the best approximation possible for our visualization purposes.
CEBRA analysis
To visualize network dynamics and analyze the impact of top-down signaling on RNN dynamics, we applied unsupervised manifold discovery CEBRA [70] on the 1,000 simulated trials of representative RNN models during the first instruction and first stimulus periods. This method is capable of uncovering latent structures in multidimensional dynamical datasets, either unsupervised or conditioned on specific variables of interest. Throughout this study, we conducted unsupervised latent structure discovery based solely on the temporal sequence of data points across the entire dataset, without considering trial conditions, tasks, or stimulus identities. More specifically, we utilized the default model parameters for the ‘offset10-model’ architecture, using a batch size of 256, time offsets of 10, learning rates of , and cosine as the distance measure.
Cross-temporal discriminability analysis
In addition to the SVM decoding analysis (see above), we also quantified the amount of information encoded by each unit within RNNs using cross-temporal discriminability analysis [100–102]. For Figs 2j–2m, the firing rate estimate time courses of each unit for each first stimulus identity were initially divided into two splits (odd versus even trials) and averaged across trials within each split. Subsequently, for each unit in each split, the difference in the average firing rates between the two conditions based on the first stimulus identity (i.e., + 1 and −1) was computed. Next, Pearson’s correlation coefficient between the two splits was calculated. The discriminability score was then obtained by applying Fisher’s z-transformation. This process was carried out separately for the attended modality (Fig 2j, 2k) and the unattended modality (Fig 2l, 2m). For Fig 7f, the first stimulus identity was fixed to −1, and the discriminability of the task cue signal was computed using the two task cue identities (pro- versus anti-DMS). The diagonal values of the discriminability matrices are shown. For Fig 8d–8f, similar procedures to those employed for Fig 2j–2m were implemented.
Centrality measure
We utilized a functional closeness centrality measure to determine the centrality of each unit within RNNs. For this, we first defined a functional distance metric between unit i and j a network, based on the absolute of the Pearson correlation (
) between their synaptic current activity over time, denoted as
and
:
This function ensures that unit pairs that are more predictive of each other’s activity, either via positive or negative correlation, have a distance function closer to 0, while unit pairs with weak correlations have a distance function closer to 1.
Next, we used the complete distance matrix as the edge distances between nodes in a graph and employed the Python package networkx [103] to compute the network closeness centrality measure for each unit i from all n units:
Here, S(i,j) represents the shortest path distance between two units i and j, computed utilizing the distance matrix via Dijkstra’s algorithm with the distance attributes as edge weights. Concretely, a node has high centrality when it is, on average, only a few steps away along shortest paths. Intuitively, a neuron is a good vantage point or information broadcast hub if communication follows shortest paths. We computed centrality for all neurons within the entire network and plotted it separately for excitatory versus inhibitory neurons.
Identifying task and attention cue selective units
To identify units selective for pro-DMS versus anti-DMS task signals, we first generated 400 trials for the one-modality DMS and 1,000 trials for the two-modality DMS. We next extracted firing rate estimates of all units in each of the trained RNNs. Then, the trials and firing rate estimates were sorted based on the early task cue signal, separating them into pro-DMS and anti-DMS groups. For each unit, we averaged the firing rate estimates during the first instruction window for each trial. Subsequently, we performed the one-sided Wilcoxon rank-sum statistical test on the average firing rates from the two task groups for each unit. Units with significantly higher average firing rates in the pro-DMS group compared to the anti-DMS group were classified as pro-DMS selective units. Similar procedures were performed to identify attention-selective units for the two-modality RNNs.
Lesioning analyses
For Figs 4 and 6, trained synaptic connections were lesioned systematically to identify specific connections important for encoding and maintenance of top-down information. For each trained network, connections were initially classified into four subgroups: (inhibitory units inhibiting other inhibitory units),
(inhibitory units inhibiting excitatory units),
(excitatory units exciting inhibitory units), and
(excitatory units exciting other excitatory units). Within each subgroup, connections were further categorized based on their selectivity for the task and attention cue signal (see above). The selective connections were then lesioned by reducing their strength by 50%.
Experimental data
To further characterize the impact of top-down signaling over information processing, we utilized publicly available experimental data on the role of recurrent processing in the activity of mouse primary visual cortex [61]. In this task, mice had to determine the location of a salient grating visual pattern which contrasted from a visual background, distinguishing them based either on the orientation, the phase, or the texture of the grating pattern. For the purposes of our analyses, we averaged results across orientation, phase, and texture conditions. Target patterns could appear either in the right or left visual field, prompting mice to indicate their locations by licking a spout on the corresponding side to receive water or milk droplets as rewards. Crucially, neural recordings and stimuli were set up in such a way that image contents falling within the receptive fields of the recorded neurons were strictly identical in the figure and background conditions. Thus, any difference in neural activity measured across conditions was caused by the context indicated by image regions outside of the measured neural receptive field.
We first sought to determine whether V1 activity contained information distinguishing target grated stimuli from nontarget grated backgrounds even when the sensory input within receptive fields was identical. This would indicate that feedback from regions outside receptive fields projects into the measured V1 sites. For this analysis, neural activity consisted of multiunit activity from 198 recording sites in 13 different V1 electrode penetrations in 6 awake mice. To determine which information was contained in V1 multiunit activity, we performed a 4-fold cross-validated linear SVM decoding analysis (MATLAB function fitcsvm), for figure versus background labels from 0.5 s before to 1 s after stimulus presentation, utilizing activity in separate channels as features. To obtain a null distribution for decoding accuracy, we generated 1,000 random label permutations and performed the same decoding analysis.
Furthermore, we leveraged an additional analysis using the aforementioned dataset. Specifically, we determined whether activity of inhibitory neurons in V1 is necessary for figure versus background discrimination, as predicted by the central role of inhibitory neurons in our modeling approach. The analysis contained multiunit activities of different subgroups of inhibitory interneurons in V1: vasoactive intestinal peptide–expressing (VIP) and somatostatin-positive interneurons (SST). In addition, the dataset also contains recordings when VIP or SST neurons were optogenetically inhibited (see [61] for more details).
Using this dataset, we decoded figure versus background labels from V1 multiunit activity with a 5-fold cross-validated linear SVM, both with and without optogenetic VIP suppression (time window length: 250 ms, time steps: 25 ms). In the original data manuscript, authors tested 3 different figure-ground segregation schemes for visual gratings: orientation, phase, and texture. Behaviorally, authors found that orientation elicited the highest accuracy rates. Optogenetically, orientation was the one condition that elicited the highest differences in spike rates comparing laser on versus laser off conditions. Therefore, for our purposes of demonstrating the impact of optogenetic silencing on SVM decoding, we sub-selected only trials which had orientation as a top-down signaling cue and performed 5-fold cross-validation. Other than that, no further sub-selection was performed. We then compared decoding accuracy between the laser-on and laser-off conditions by performing a two-sample one-sided t test for whether accuracy was larger in trials without laser suppression. We performed this test within the main time window of interest, when the laser was delivered (250 ms to 500 ms).
In parallel to the neural data analysis, we applied an analogous linear SVM decoding framework to activity from the sensory module of the trained two-module RNNs. For each model, task cue identity (pro versus anti) was decoded from population activity using a time-resolved linear SVM. Neural activity was first averaged within sliding temporal windows (window size: 50 ms; step size: 25 ms) to generate feature vectors for each trial and time window. Classifiers were trained using a random 70% subset of trials and evaluated on the remaining 30% held-out trials, and this procedure was repeated across 100 random train-test splits. Decoding accuracy was computed independently for each neuron and time window, then averaged across neurons within the sensory module and across trained models.
Statistical analyses
All RNNs trained in the present study were randomly initialized (with random seeds) before training. Nonparametric statistical methods were employed throughout the study. In figures containing boxplots, two-sided Wilcoxon rank-sum or signed-rank tests were performed to ascertain statistically significant differences between two groups.
Following linear SVM decoding and functional centrality analyses, we aimed to determine whether decoding accuracy or functional centrality values were higher in excitatory or inhibitory neurons. To achieve this, we utilized the scipy.stats function ks2samp and performed two-sided Kolmogorov–Smirnov (KS) tests across all time bins and Bonferroni corrected for the number of tested time bins. Similarly, we performed a one-sided t test in the time bin of interest to specifically test the hypothesis that decoding accuracies in V1 would decrease following optogenetic silencing.
For cross-temporal discriminability analyses, we performed binomial tests and Bonferroni corrected for the number of tests to determine whether each decoding accuracy value was higher than chance. Assuming a false positive rate of 5%, and a Bernoulli process to generate significant decoding events at this rate, the number of instances S falsely classified as significant within a population of size N is given by a binomial distribution: . Accordingly, we derived a binomial p-value for the probability of obtaining an observed sensitive instance count K larger than expected by chance, using the cumulative binomial distribution:
.
Supporting information
S1 Fig. Example single-unit activity and output dynamics from a trained one-modality RNN.
a. Neuron-by-time heatmaps showing activity of all 1,000 units from a representative single-module, one-modality RNN during Pro-DMS (left) and Anti-DMS (right) trials. For visualization purposes only, each neuron’s activity was z-scored across time. White vertical lines demarcate key task epochs, including instruction, stimulus, delay, and response periods. b. Corresponding RNN output signals for Pro-DMS (teal) and Anti-DMS (purple) trials. Gray shaded regions indicate the presentation of stimulus 1 (Stim 1) and stimulus 2 (Stim 2). The output reflects the learned stimulus–response mapping conditioned on the task cue. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s001
(EPS)
S2 Fig. Quantification of latent trajectory separation for attended versus unattended modalities.
Boxplots show the Euclidean distance between CEBRA latent trajectories corresponding to opposite first-stimulus identities, computed during the stimulus 1 epoch. Distances were calculated separately for attended and unattended modalities by randomly subsampling latent trajectories across task conditions and repeating the procedure 100 times. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s002
(EPS)
S4 Fig. Temporal encoding of top-down signals during DMS tasks.
Cross-temporal decodability of task and attention signals is shown as the average generalization accuracy for each pair of training/testing time windows. Highlighted boundaries indicate significant decoding accuracy, , binomial test, Bonferroni corrected. White lines denote the reference event of each trial period (task and attention cues, stimulus onsets, response). a. One-modality DMS, task signal, early instruction. b. Same, for late instruction. c. Two-modality DMS, task signal, early instruction. d. Same, for late instruction. e. Two-modality DMS, attention signal, early instruction f. Same, for late instruction. g. Two-modality DMS, attended modality signal, early instruction, attended modality. h. Same, for unattended modality. i. One-modality DMS, stimulus identity. j. Two-modality DMS, stimulus identity, stimulus modality 1. k. Same, for stimulus modality 2. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s004
(EPS)
S5 Fig. Inhibitory units robustly maintain top-down signals.
a. SVM decodability of the attention signal from inhibitory (blue) and excitatory (red) units from the two-modality RNNs. b. SVM decodability of the attended first stimulus identity from inhibitory (blue) and excitatory (red) units from the two-modality RNNs. c. SVM decodability of the unattended first stimulus identity from inhibitory (blue) and excitatory (red) units from the two-modality RNNs. Black bars indicate significant differences between excitatory and inhibitory units (P < 0.05, two-sided two-sample KS test). Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s005
(EPS)
S6 Fig. Structured connectivity across rule-selective populations.
Average synaptic weights (magnitude) between rule-selective populations across all trained one-module, one-modality RNNs (n = 20), separated into within-selectivity (“Same”; anti → anti and pro → pro) and across-selectivity (“Cross”; anti → pro and pro → anti) connections. Panels show results for inhibitory-to-inhibitory (I → I), excitatory-to-inhibitory (E → I), inhibitory-to-excitatory (I → E), and excitatory-to-excitatory (E → E) connections. Each gray line represents an individual network, and black markers denote mean ± s.e.m. ***P < 0.001 by paired Wilcoxon signed-rank test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s006
(EPS)
S7 Fig. Quantification of outgoing synaptic connections in single-module one-modality RNNs.
Boxplots show the total number of outgoing synaptic connections for task-selective excitatory and inhibitory neuron populations across 20 single-module RNNs trained to perform the one-modality DMS task. a. Outgoing connections from pro-DMS-selective populations; b. Outgoing connections from anti-DMS-selective populations. For each condition, outgoing connections are grouped by presynaptic and postsynaptic cell type (,
,
,
). Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. Statistical comparisons were performed using a Kruskal–Wallis test followed by Dunn’s post hoc multiple-comparisons test. Significance levels are indicated as **P < 0.01 and ***P < 0.001. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s007
(EPS)
S8 Fig. Inhibitory-to-inhibitory connections primarily encode task and attention cue signals.
a. Performance of all 20 RNNs trained to perform the two-modality DMS when task-specific (top) or
(bottom) synaptic weights were reduced by 50% when modality 2 was attended. b. Performance of all 20 RNNs trained to perform the two-modality DMS when modality-specific
(top) or
(bottom) synaptic weights were reduced by 50% when the task signal was fixed to anti-DMS.
, inhibitory units preferring anti-DMS cue;
, inhibitory units preferring pro-DMS cue;
, excitatory units preferring anti-DMS cue;
, excitatory units preferring pro-DMS cue;
, inhibitory units preferring modality-1 attention cue;
, inhibitory units preferring modality-2 attention cue;
, excitatory units preferring modality-1 attention cue;
, excitatory units preferring modality-2 attention cue. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s008
(EPS)
S9 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections in one-modality RNNs.
Task performance following selective lesioning of synaptic connections based on presynaptic/postsynaptic cell type and task selectivity. Rows correspond to different connection classes (,
,
,
), and columns show lesions applied to selectivity-matched or crossed populations (anti versus pro). Lesions were implemented by reducing the corresponding synaptic weights by 50%, and performance is shown separately for anti and pro trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. **P < 0.01 and ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s009
(EPS)
S10 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections in two-modality RNNs when modality 1 is attended.
Task performance following selective lesioning of synaptic connections in two-modality RNNs, with the attention cue fixed to modality 1. Figure organization follows the same layout and conventions as S9 Fig. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s010
(EPS)
S11 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections in two-modality RNNs when modality 2 is attended.
Task performance following selective lesioning of synaptic connections in two-modality RNNs, with the attention cue fixed to modality 2. Figure organization follows the same layout and conventions as S9 Fig. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s011
(EPS)
S12 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections in two-modality RNNs with the task signal fixed to the pro-DMS condition.
Task performance following selective lesioning of synaptic connections in two-modality RNNs, with the task cue fixed to the pro-DMS condition and performance compared between trials in which modality 1 or modality 2 was attended. Figure organization follows the same layout and conventions as S9 Fig. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s012
(EPS)
S13 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections in two-modality RNNs with the task signal fixed to the anti-DMS condition.
Task performance following selective lesioning of synaptic connections in two-modality RNNs, with the task cue fixed to the anti-DMS condition and performance compared between trials in which modality 1 or modality 2 was attended. Figure organization follows the same layout and conventions as S9 Fig. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05, **P < 0.01, ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s013
(EPS)
S14 Fig. Example single-unit activity and output dynamics from a trained two-module, one-modality RNN.
a. Neuron-by-time heatmaps showing activity of all 1,000 units from a representative two-module, one-modality RNN during Pro-DMS early (left) and late (right) instruction trials. For visualization purposes only, each neuron’s activity was z-scored across time. White vertical lines demarcate key task epochs, including instruction, stimulus, delay, and response periods. b. Corresponding RNN output signals for Pro-DMS early (red) and late (blue) instruction trials. Gray shaded regions indicate the presentation of stimulus 1 (Stim 1) and stimulus 2 (Stim 2). The output reflects the learned stimulus–response mapping conditioned on the task cue. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s014
(EPS)
S15 Fig. Training two-module RNNs takes longer for the one-modality DMS compared to the one-module RNNs.
a. Comparison of the number of training trials required for training between the one-module and two-module RNNs, with 20 RNNs trained for each case. b. Both one-module and two-module RNNs achieved high accuracy in performing the one-modality DMS task (mean±stdev; 97.7±1.3% for the one-module model; 97.6±1.2% for the two-module model). Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s015
(EPS)
S16 Fig. Comparison between
and
connection strength in two-module RNNs.
(a) and
(b) connections were significantly stronger within the non-sensory module than the sensory module. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s016
(EPS)
S17 Fig. Decoding task signals in nonsensory module of two-module RNNs.
a. Average decoding accuracy in early instruction trials for pro- versus anti-DMS signal in the nonsensory module of two-module networks, including either excitatory (red) or inhibitory (blue) units. Shaded areas indicate standard error of the mean. Black bars indicate significant differences between excitatory and inhibitory units (P < 0.05, two-sided KS test). b. Same, for late instruction trials. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s017
(EPS)
S18 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections within the sensory module in two-module RNNs.
Task performance following selective lesioning of synaptic connections within the sensory module of two-module RNNs trained to perform the one-modality task. Rows correspond to different connection classes (,
,
,
), and columns show lesions applied to selectivity-matched or crossed populations (anti versus pro). Lesions were implemented by reducing the corresponding synaptic weights by 50%, and performance is shown separately for anti and pro trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05 and **P < 0.01 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s018
(EPS)
S19 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections from non-sensory module to sensory module in two-module RNNs.
Task performance following selective lesioning of synaptic connections from the nonsensory module to the sensory module of two-module RNNs trained to perform the one-modality task. Rows correspond to different connection classes (,
,
,
), and columns show lesions applied to selectivity-matched or crossed populations (anti versus pro). Lesions were implemented by reducing the corresponding synaptic weights by 50%, and performance is shown separately for anti and pro trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. *P < 0.05 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s019
(EPS)
S20 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections from sensory module to non-sensory module in two-module RNNs.
Task performance following selective lesioning of synaptic connections from the sensory module to the nonsensory module of two-module RNNs trained to perform the one-modality task. Rows correspond to different connection classes (,
,
,
), and columns show lesions applied to selectivity-matched or crossed populations (anti versus pro). Lesions were implemented by reducing the corresponding synaptic weights by 50%, and performance is shown separately for anti and pro trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. **P < 0.01 and ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s020
(EPS)
S21 Fig. Perturbational analysis of excitatory and inhibitory synaptic connections within non-sensory module in two-module RNNs.
Task performance following selective lesioning of synaptic connections within the nonsensory module of two-module RNNs trained to perform the one-modality task. Rows correspond to different connection classes (,
,
,
), and columns show lesions applied to selectivity-matched or crossed populations (anti versus pro). Lesions were implemented by reducing the corresponding synaptic weights by 50%, and performance is shown separately for anti and pro trials. Boxplot: central lines, median; bottom and top edges, lower and upper quartiles; whiskers, 1.5 × interquartile range; outliers are not plotted. ***P < 0.001 by two-sided Wilcoxon rank-sum test. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s021
(EPS)
S22 Fig. Effects of synaptic lesioning on unattended stimulus representations in two-module RNNs.
Average cross-temporal discriminability of the first stimulus identity of the unattended modality in two-module RNNs trained to perform the two-modality DMS task. Discriminability matrices are shown for trials in which the attention cue was presented early (left), late (middle), or as a retro-cue (right). Rows correspond to selective lesioning (50% reduction in synaptic weights) of different connection types: a. Inhibitory-to-inhibitory (). b. Inhibitory-to-excitatory (
). c. Excitatory-to-inhibitory (
). d. Excitatory-to-excitatory (
) connections. White lines denote task epoch boundaries. Results are averaged across 20 trained RNNs. Source data can be found in Zenodo at https://doi.org/10.5281/zenodo.20148306.
https://doi.org/10.1371/journal.pbio.3003831.s022
(EPS)
Acknowledgments
We thank Jorge Aldana and the Computational Neurobiology Laboratory at the Salk Institute for assistance with computing resources. We thank Lisa Kirchberger of the Netherlands Institute for Neuroscience for the support in providing access to mouse data for this manuscript. We are also grateful to Mackenzie Mathis for her invaluable feedback on our work.
References
- 1.
Grossberg S, Grossberg S. How does a brain build a cognitive code? In: Studies of mind and brain: neural principles of learning, perception, development, cognition, and motor control; 1982:1–52.
- 2. Friston K. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005;360(1456):815–36. pmid:15937014
- 3. Mumford D. On the computational architecture of the neocortex. II. The role of cortico-cortical loops. Biol Cybern. 1992;66(3):241–51. pmid:1540675
- 4. Fenske MJ, Aminoff E, Gronau N, Bar M. Top-down facilitation of visual object recognition: object-based and context-based contributions. Prog Brain Res. 2006;155:3–21. pmid:17027376
- 5. Bar M, Kassam KS, Ghuman AS, Boshyan J, Schmid AM, Dale AM, et al. Top-down facilitation of visual recognition. Proc Natl Acad Sci U S A. 2006;103(2):449–54. pmid:16407167
- 6. Bullier J. Integrated model of visual processing. Brain Res Brain Res Rev. 2001;36(2–3):96–107. pmid:11690606
- 7. Rungratsameetaweemana N, Squire LR, Serences JT. Preserved capacity for learning statistical regularities and directing selective attention after hippocampal lesions. Proc Natl Acad Sci U S A. 2019;116(39):19705–10. pmid:31492814
- 8. Melloni L, van Leeuwen S, Alink A, Müller NG. Interaction between bottom-up saliency and top-down control: how saliency maps are created in the human brain. Cereb Cortex. 2012;22(12):2943–52. pmid:22250291
- 9. Mante V, Sussillo D, Shenoy KV, Newsome WT. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature. 2013;503(7474):78–84. pmid:24201281
- 10. Cole MW, Reynolds JR, Power JD, Repovs G, Anticevic A, Braver TS. Multi-task connectivity reveals flexible hubs for adaptive task control. Nat Neurosci. 2013;16(9):1348–55. pmid:23892552
- 11. Remington ED, Narain D, Hosseini EA, Jazayeri M. Flexible sensorimotor computations through rapid reconfiguration of cortical dynamics. Neuron. 2018;98(5):1005-1019.e5. pmid:29879384
- 12. Schapiro AC, Gregory E, Landau B, McCloskey M, Turk-Browne NB. The necessity of the medial temporal lobe for statistical learning. J Cogn Neurosci. 2014;26(8):1736–47. pmid:24456393
- 13. Uddin LQ. Cognitive and behavioural flexibility: neural mechanisms and clinical considerations. Nat Rev Neurosci. 2021;22(3):167–79. pmid:33536614
- 14. Douw L, Wakeman DG, Tanaka N, Liu H, Stufflebeam SM. State-dependent variability of dynamic functional connectivity between frontoparietal and default networks relates to cognitive flexibility. Neuroscience. 2016;339:12–21. pmid:27687802
- 15. Voss JL, Gonsalves BD, Federmeier KD, Tranel D, Cohen NJ. Hippocampal brain-network coordination during volitional exploratory behavior enhances learning. Nat Neurosci. 2011;14(1):115–20. pmid:21102449
- 16. Luck SJ, Chelazzi L, Hillyard SA, Desimone R. Neural mechanisms of spatial selective attention in areas V1, V2, and V4 of macaque visual cortex. J Neurophysiol. 1997;77(1):24–42. pmid:9120566
- 17. Zhang S, Xu M, Kamigaki T, Hoang Do JP, Chang W-C, Jenvay S, et al. Selective attention. Long-range and local circuits for top-down modulation of visual cortex processing. Science. 2014;345(6197):660–5. pmid:25104383
- 18. Morales-Gregorio A, Kurth AC, Ito J, Kleinjohann A, Barthélemy FV, Brochier T, et al. Neural manifolds in V1 change with top-down signals from V4 targeting the foveal region. Cell Rep. 2024;43(7):114371. pmid:38923458
- 19. McBride EG, Lee S-YJ, Callaway EM. Local and global influences of visual spatial selection and locomotion in mouse primary visual cortex. Curr Biol. 2019;29(10):1592-1605.e5. pmid:31056388
- 20. Saenz M, Buracas GT, Boynton GM. Global effects of feature-based attention in human visual cortex. Nat Neurosci. 2002;5(7):631–2. pmid:12068304
- 21. Serences JT, Boynton GM. Feature-based attentional modulations in the absence of direct visual stimulation. Neuron. 2007;55(2):301–12. pmid:17640530
- 22. Itthipuripat S, Garcia JO, Rungratsameetaweemana N, Sprague TC, Serences JT. Changing the spatial scope of attention alters patterns of neural gain in human cortex. J Neurosci. 2014;34(1):112–23. pmid:24381272
- 23. Debes SR, Dragoi V. Suppressing feedback signals to visual cortex abolishes attentional modulation. Science. 2023;379(6631):468–73. pmid:36730414
- 24. Liu T, Larsson J, Carrasco M. Feature-based attention modulates orientation-selective responses in human visual cortex. Neuron. 2007;55(2):313–23. pmid:17640531
- 25. Henderson MM, Serences JT, Rungratsameetaweemana N. Dynamic categorization rules alter representations in human visual cortex. Nat Commun. 2025;16(1):3459. pmid:40216798
- 26. Rademaker RL, Chunharas C, Serences JT. Coexisting representations of sensory and mnemonic information in human visual cortex. Nat Neurosci. 2019;22(8):1336–44. pmid:31263205
- 27. Musall S, Kaufman MT, Juavinett AL, Gluf S, Churchland AK. Single-trial neural dynamics are dominated by richly varied movements. Nat Neurosci. 2019;22(10):1677–86. pmid:31551604
- 28. International Brain Laboratory, Angelaki D, Benson B, Benson J, Birman D, Bonacchi N, et al. A brain-wide map of neural activity during complex behaviour. Nature. 2025;645(8079):177–91. pmid:40903598
- 29. Ebrahimi S, Lecoq J, Rumyantsev O, Tasci T, Zhang Y, Irimia C, et al. Emergent reliability in sensory cortical coding and inter-area communication. Nature. 2022;605(7911):713–21. pmid:35589841
- 30. Alreja A, Nemenman I, Rozell CJ. Constrained brain volume in an efficient coding model explains the fraction of excitatory and inhibitory neurons in sensory cortices. PLoS Comput Biol. 2022;18(1):e1009642. pmid:35061666
- 31. Melzer S, Monyer H. Diversity and function of corticopetal and corticofugal GABAergic projection neurons. Nat Rev Neurosci. 2020;21(9):499–515. pmid:32747763
- 32. Wu YK, Miehl C, Gjorgjieva J. Regulation of circuit organization and function through inhibitory synaptic plasticity. Trends Neurosci. 2022;45(12):884–98. pmid:36404455
- 33. Schroeder A, Pardi MB, Keijser J, Dalmay T, Groisman AI, Schuman EM, et al. Inhibitory top-down projections from zona incerta mediate neocortical memory. Neuron. 2023;111(5):727-738.e8. pmid:36610397
- 34. Pfeffer CK, Xue M, He M, Huang ZJ, Scanziani M. Inhibition of inhibition in visual cortex: the logic of connections between molecularly distinct interneurons. Nat Neurosci. 2013;16(8):1068–76. pmid:23817549
- 35. Pi H-J, Hangya B, Kvitsiani D, Sanders JI, Huang ZJ, Kepecs A. Cortical interneurons that specialize in disinhibitory control. Nature. 2013;503(7477):521–4. pmid:24097352
- 36. Herstel LJ, Wierenga CJ. Network control through coordinated inhibition. Curr Opin Neurobiol. 2021;67:34–41. pmid:32853970
- 37. Isaacson JS, Scanziani M. How inhibition shapes cortical activity. Neuron. 2011;72(2):231–43. pmid:22017986
- 38.
Garrett M, Groblewski P, Piet A, Ollerenshaw D, Najafi F, Yavorska I, et al. Stimulus novelty uncovers coding diversity in visual cortical circuits. bioRxiv. 2023:2023–02.
- 39. Piet A, Ponvert N, Ollerenshaw D, Garrett M, Groblewski PA, Olsen S, et al. Behavioral strategy shapes activation of the Vip-Sst disinhibitory circuit in visual cortex. Neuron. 2024;112(11):1876-1890.e4. pmid:38447579
- 40. Keller AJ, Dipoppa M, Roth MM, Caudill MS, Ingrosso A, Miller KD, et al. A Disinhibitory Circuit for contextual modulation in primary visual cortex. Neuron. 2020;108(6):1181-1193.e8. pmid:33301712
- 41. Wilmes KA, Clopath C. Inhibitory microcircuits for top-down plasticity of sensory representations. Nat Commun. 2019;10(1):5055. pmid:31699994
- 42. Lehr AB, Kumar A, Tetzlaff C. Spatially heterogeneous inhibition projects sequential activity onto unique neural subspaces. eLife. 2025;14.
- 43. Shen B, Louie K, Glimcher P. Flexible control of representational dynamics in a disinhibition-based model of decision-making. Elife. 2023;12:e82426. pmid:37261426
- 44. Schulz A, Miehl C, Berry MJ 2nd, Gjorgjieva J. The generation of cortical novelty responses through inhibitory plasticity. Elife. 2021;10:e65309. pmid:34647889
- 45. Ito S, Piet A, Bennett C, Durand S, Belski H, Garrett M, et al. Coordinated changes in a cortical circuit sculpt effects of novelty on neural dynamics. Cell Rep. 2024;43(9):114763. pmid:39288028
- 46. Panzeri S, Brunel N, Logothetis NK, Kayser C. Sensory neural codes using multiplexed temporal scales. Trends Neurosci. 2010;33(3):111–20. pmid:20045201
- 47. Gire DH, Whitesell JD, Doucette W, Restrepo D. Information for decision-making and stimulus identification is multiplexed in sensory cortex. Nat Neurosci. 2013;16(8):991–3. pmid:23792942
- 48. Song HF, Yang GR, Wang X-J. Training excitatory-inhibitory recurrent neural networks for cognitive tasks: a simple and flexible framework. PLoS Comput Biol. 2016;12(2):e1004792. pmid:26928718
- 49. Miconi T. Biologically plausible learning in recurrent neural networks reproduces neural dynamics observed during cognitive tasks. Elife. 2017;6:e20899. pmid:28230528
- 50. Yang GR, Joglekar MR, Song HF, Newsome WT, Wang X-J. Task representations in neural networks trained to perform many cognitive tasks. Nat Neurosci. 2019;22(2):297–306. pmid:30643294
- 51. Barbosa J, Proville R, Rodgers CC, DeWeese MR, Ostojic S, Boubenec Y. Early selection of task-relevant features through population gating. Nat Commun. 2023;14(1):6837. pmid:37884507
- 52.
Park I, Ságodi Á, Sokół PA. Persistent learning signals and working memory without continuous attractors. 2023. Available from: https://arxiv.org/abs/2308.12585
- 53. Roach JP, Churchland AK, Engel TA. Choice selective inhibition drives stability and competition in decision circuits. Nat Commun. 2023;14(1):147. pmid:36627310
- 54. Kim R, Li Y, Sejnowski TJ. Simple framework for constructing functional spiking recurrent neural networks. Proc Natl Acad Sci U S A. 2019;116(45):22811–20. pmid:31636215
- 55. Liu Y, Wang X-J. Flexible gating between subspaces in a neural network model of internally guided task switching. Nat Commun. 2024;15(1):6497. pmid:39090084
- 56. Vyas S, Golub MD, Sussillo D, Shenoy KV. Computation through neural population dynamics. Annu Rev Neurosci. 2020;43:249–75. pmid:32640928
- 57.
Kleinman M, Chandrasekaran C, Kao JC. Recurrent neural network models of multi-area computation underlying decision-making. bioRxiv. 2019:798553.
- 58. Michaels JA, Schaffelhofer S, Agudelo-Toro A, Scherberger H. A goal-driven modular neural network predicts parietofrontal neural dynamics during grasping. Proc Natl Acad Sci U S A. 2020;117(50):32124–35. pmid:33257539
- 59. Kim R, Sejnowski TJ. Strong inhibitory signaling underlies stable temporal dynamics and working memory in spiking neural networks. Nat Neurosci. 2021;24(1):129–39. pmid:33288909
- 60. Kim Y, Yang GR, Pradhan K, Venkataraju KU, Bota M, García Del Molino LC, et al. Brain-wide maps reveal stereotyped cell-type-based cortical architecture and subcortical sexual dimorphism. Cell. 2017;171(2):456-469.e22. pmid:28985566
- 61. Kirchberger L, Mukherjee S, Schnabel UH, van Beest EH, Barsegyan A, Levelt CN, et al. The essential role of recurrent processing for figure-ground perception in mice. Sci Adv. 2021;7(27):eabe1833. pmid:34193411
- 62. Blough DS. Delayed matching in the pigeon. J Exp Anal Behav. 1959;2(2):151–60. pmid:13801643
- 63. Hampson RE, Heyser CJ, Deadwyler SA. Hippocampal cell firing correlates of delayed-match-to-sample performance in the rat. Behav Neurosci. 1993;107(5):715–39. pmid:8280383
- 64. Mishkin M, Delacour J. An analysis of short-term visual memory in the monkey. J Exp Psychol Anim Behav Process. 1975;1(4):326–34. pmid:811754
- 65. Wu Z, Litwin-Kumar A, Shamash P, Taylor A, Axel R, Shadlen MN. Context-dependent decision making in a premotor circuit. Neuron. 2020;106(2):316-328.e6. pmid:32105611
- 66. Monk CS, Zhuang J, Curtis WJ, Ofenloch I-T, Tottenham N, Nelson CA, et al. Human hippocampal activation in the delayed matching- and nonmatching-to-sample memory tasks: an event-related functional MRI approach. Behav Neurosci. 2002;116(4):716–21. pmid:12148938
- 67. Sussillo D, Barak O. Opening the black box: low-dimensional dynamics in high-dimensional recurrent neural networks. Neural Comput. 2013;25(3):626–49. pmid:23272922
- 68. Golub M, Sussillo D. FixedPointFinder: a Tensorflow toolbox for identifying and characterizing fixed points in recurrent neural networks. JOSS. 2018;3(31):1003.
- 69. Driscoll LN, Shenoy K, Sussillo D. Flexible multitask computation in recurrent networks utilizes shared dynamical motifs. Nat Neurosci. 2024;27(7):1349–63. pmid:38982201
- 70. Schneider S, Lee JH, Mathis MW. Learnable latent embeddings for joint behavioural and neural analysis. Nature. 2023;617(7960):360–8. pmid:37138088
- 71. Bastos G, Holmes JT, Ross JM, Rader AM, Gallimore CG, Wargo JA, et al. Top-down input modulates visual context processing through an interneuron-specific circuit. Cell Rep. 2023;42(9):113133. pmid:37708021
- 72. Chini M, Pfeffer T, Hanganu-Opatz I. An increase of inhibition drives the developmental decorrelation of neural activity. Elife. 2022;11:e78811. pmid:35975980
- 73. Lamme VA, Roelfsema PR. The distinct modes of vision offered by feedforward and recurrent processing. Trends Neurosci. 2000;23(11):571–9. pmid:11074267
- 74. Klink PC, Dagnino B, Gariel-Mathis M-A, Roelfsema PR. Distinct feedforward and feedback effects of microstimulation in visual cortex reveal neural mechanisms of texture segregation. Neuron. 2017;95(1):209-220.e3. pmid:28625487
- 75. Lee S, Kruglikov I, Huang ZJ, Fishell G, Rudy B. A disinhibitory circuit mediates motor integration in the somatosensory cortex. Nat Neurosci. 2013;16(11):1662–70. pmid:24097044
- 76. Zhang S, Xu M, Chang W-C, Ma C, Hoang Do JP, Jeong D, et al. Organization of long-range inputs and outputs of frontal cortex for top-down control. Nat Neurosci. 2016;19(12):1733–42. pmid:27749828
- 77. Yavorska I, Wehr M. Somatostatin-expressing inhibitory interneurons in cortical circuits. Front Neural Circuits. 2016;10:76. pmid:27746722
- 78. Fino E, Yuste R. Dense inhibitory connectivity in neocortex. Neuron. 2011;69(6):1188–203. pmid:21435562
- 79. Fu Y, Tucciarone JM, Espinosa JS, Sheng N, Darcy DP, Nicoll RA, et al. A cortical circuit for gain control by behavioral state. Cell. 2014;156(6):1139–52. pmid:24630718
- 80. Karnani MM, Jackson J, Ayzenshtat I, Hamzehei Sichani A, Manoocheri K, Kim S, et al. Opening holes in the blanket of inhibition: localized lateral disinhibition by VIP Interneurons. J Neurosci. 2016;36(12):3471–80. pmid:27013676
- 81. Xu H, Liu L, Tian Y, Wang J, Li J, Zheng J, et al. A disinhibitory microcircuit mediates conditioned social fear in the prefrontal cortex. Neuron. 2019;102(3):668-682.e5. pmid:30898376
- 82. Letzkus JJ, Wolff SBE, Meyer EMM, Tovote P, Courtin J, Herry C, et al. A disinhibitory microcircuit for associative fear learning in the auditory cortex. Nature. 2011;480(7377):331–5. pmid:22158104
- 83. Wang X-J, Yang GR. A disinhibitory circuit motif and flexible information routing in the brain. Curr Opin Neurobiol. 2018;49:75–83. pmid:29414069
- 84. Kuchibhotla KV, Gill JV, Lindsay GW, Papadoyannis ES, Field RE, Sten TAH, et al. Parallel processing by cortical inhibition enables context-dependent behavior. Nat Neurosci. 2017;20(1):62–71. pmid:27798631
- 85. Kepecs A, Fishell G. Interneuron cell types are fit to function. Nature. 2014;505(7483):318–26. pmid:24429630
- 86. Makino H, Komiyama T. Learning enhances the relative impact of top-down processing in the visual cortex. Nat Neurosci. 2015;18(8):1116–22. pmid:26167904
- 87. Chen SX, Kim AN, Peters AJ, Komiyama T. Subtype-specific plasticity of inhibitory circuits in motor cortex during motor learning. Nat Neurosci. 2015;18(8):1109–15. pmid:26098758
- 88. Garrett M, Manavi S, Roll K, Ollerenshaw DR, Groblewski PA, Ponvert ND, et al. Experience shapes activity dynamics and stimulus coding of VIP inhibitory cells. Elife. 2020;9:e50340. pmid:32101169
- 89. Steriade M, Contreras D. Spike-wave complexes and fast components of cortically generated seizures. I. Role of neocortex and thalamus. J Neurophysiol. 1998;80(3):1439–55. pmid:9744951
- 90. Sanzeni A, Akitake B, Goldbach HC, Leedy CE, Brunel N, Histed MH. Inhibition stabilization is a widespread property of cortical networks. Elife. 2020;9:e54875. pmid:32598278
- 91. Ozeki H, Finn IM, Schaffer ES, Miller KD, Ferster D. Inhibitory stabilization of the cortical network underlies visual surround suppression. Neuron. 2009;62(4):578–92. pmid:19477158
- 92. Bertero A, Garcia C, Apicella AJ. Corticofugal VIP gabaergic projection neurons in the mouse auditory and motor cortex. Front Neural Circuits. 2021;15:714780. pmid:34366798
- 93.
Dellal S, Zurita H, Kruglikov I, Valero M, Abad-Perez P, Geron E, et al. Inhibitory and disinhibitory vip in-mediated circuits in neocortex. bioRxiv. 2025.
- 94. Werbos PJ. Backpropagation through time: what it does and how to do it. Proc IEEE. 1990;78(10):1550–60.
- 95. Rungratsameetaweemana N, Kim R, Chotibut T, Sejnowski TJ. Random noise promotes slow heterogeneous synaptic dynamics important for robust working memory computation. Proc Natl Acad Sci U S A. 2025;122(3):e2316745122. pmid:39819216
- 96. Hendry SH, Schwark HD, Jones EG, Yan J. Numbers and proportions of GABA-immunoreactive neurons in different areas of monkey cerebral cortex. J Neurosci. 1987;7(5):1503–19. pmid:3033170
- 97. Sherwood CC, Raghanti MA, Stimpson CD, Bonar CJ, de Sousa AA, Preuss TM, et al. Scaling of inhibitory interneurons in areas v1 and v2 of anthropoid primates as revealed by calcium-binding protein immunohistochemistry. Brain Behav Evol. 2007;69(3):176–95. pmid:17106195
- 98. Duarte R, Seeholzer A, Zilles K, Morrison A. Synaptic patterning and the timescales of cortical dynamics. Curr Opin Neurobiol. 2017;43:156–65. pmid:28407562
- 99. Zador AM, Dobrunz LE. Dynamic synapses in the cortex. Neuron. 1997;19(1):1–4. pmid:9247258
- 100. Wasmuht DF, Spaak E, Buschman TJ, Miller EK, Stokes MG. Intrinsic neuronal dynamics predict distinct functional roles during working memory. Nat Commun. 2018;9(1):3499. pmid:30158572
- 101. Spaak E, Watanabe K, Funahashi S, Stokes MG. Stable and dynamic coding for working memory in primate prefrontal cortex. J Neurosci. 2017;37(27):6503–16. pmid:28559375
- 102. Stokes MG, Kusunoki M, Sigala N, Nili H, Gaffan D, Duncan J. Dynamic coding for cognitive control in prefrontal cortex. Neuron. 2013;78(2):364–75. pmid:23562541
- 103.
Hagberg A, Swart P, S Chult D. Exploring network structure, dynamics, and function using networkx. Los Alamos, NM (United States): Los Alamos National Lab; 2008.




















English (US) ·
French (CA) ·