Abstract
Mastering two languages has been associated with enhancement in human executive control, but previous studies of this phenomenon have exclusively relied on comparisons between bilingual and monolingual individuals. In the present study, we tested a single group of Welsh–English bilinguals engaged in a nonverbal conflict resolution task and manipulated language context by intermittently presenting words in Welsh, English, or both languages. Surprisingly, participants showed enhanced executive capacity to resolve interference when exposed to a mixed compared with a single language context, even though they ignored the irrelevant contextual words. This result was supported by greater response accuracy and reduced amplitude of the P300, an electrophysiological correlate of cognitive interference. Our findings introduce a new level of plasticity in bilingual executive control dependent on fast changing language context rather than long-term language experience.
Introduction
Bilinguals outperform monolinguals in a number of nonlinguistic tests of executive function (Bialystok et al., 2004; Costa et al., 2008). One explanation for this bilingual advantage is that managing two languages requires constantly selecting words in the intended language and inhibiting words from the unintended language, processes that are thought to engender a highly efficient control mechanism (Green, 1998; Bialystok, 2001, 2007). Furthermore, the experience of multiple languages has been shown to facilitate the development of executive function in children and protect elderly people against executive control decline (Bialystok et al., 2006; Kovács and Mehler, 2009; Gold at al., 2013).
However, bilinguals and monolinguals differ not only based on language ability but also other variables, such as socioeconomic status and ethnic origin. Indeed, the latter two factors have been hypothesized to account for the bilingual advantage (Sabbagh et al., 2006; Morton and Harper, 2007). The methodological dilemma is that preexisting differences between bilingual and monolingual individuals make it difficult, if not impossible, to match participants across groups on language-dependent variables, because language plays an important role in almost every aspect of life. Therefore, it remains to be established whether bilingualism directly enhances cognitive abilities or whether the bilingual advantage is an artifact prompted by language-associated confounding variables.
Even if we consider the existence of the bilingual advantage well established, it is unknown whether such advantage is permanent or modulated by the immediate cognitive context. Unlike monolinguals, bilinguals can either use one language (monolingual context) or both their languages (bilingual context) in a given interaction. If bilingualism bestows a generic and permanent executive control advantage to bilinguals, enhanced cognitive control should be independent of language context. If, on the contrary, the bilingual advantage is context dependent, enhancement should be most salient when bilinguals are exposed to their two languages.
The present study examined this hypothesis by testing effects of the immediate language context within a group of early Welsh–English bilinguals. Executive functioning was tested in a simplified version of the flanker task (Fan et al., 2002). Participants were instructed to press a button to indicate the direction of a central arrow (i.e., the target) surrounded by congruent or incongruent flanker arrows. Occasionally, a word was presented instead of arrows, which participants were instructed to ignore. This paradigm, therefore, differs radically from that used in previous studies of cognitive control looking into sequential effects of congruent and incongruent trials on performance (Botvinick et al., 1999; Kerns et al., 2004). The experiment contained three blocks in which the contextual words to be ignored were in Welsh, in English, or in either of the two languages (Fig. 1). We expected trials in which the direction of flanker arrows was incongruent with that of the central arrow to require more inhibitory control than congruent trials, thereby increasing the amplitude of the P300 wave of ERPs. A modulation of the P300 congruency effect by the type of words presented incidentally would indicate that bilinguals experience fast modulation of their executive system by language context.
Materials and Methods
Participants.
Eighteen right-handed (Oldfield, 1971) Welsh–English bilinguals (nine females and nine males; 20.4 ± 2.1 years) with normal or corrected-to-normal vision took part in the experiment. No participants reported neurological or other mental issues. Bilingual participants learned both languages at an early age (mean age of acquisition of Welsh, 2.1 ± 0.7 years; mean age of acquisition of English, 2.8 ± 0.9 years) and did not speak any other language. They used both languages on a daily basis, at home and at university. Bilingual participants rated their global proficiency in Welsh at 9.8 ± 0.5 and their proficiency in English at 8.9 ± 1.1 on a scale from 1 (very poor) to 10 (perfectly fluent). The rating was based on reading, writing, listening, and speaking skills.
Stimuli.
Stimuli were arrow displays featuring five horizontal arrows with the arrow in the center either matching the direction of the flanking arrows or not, thus making up four stimulus types: (1) all arrows directed to the left, (2) all arrows directed to the right, (3) all arrows but the center one directed to the left, and (4) all arrows but the center one directed to the right. Word stimuli were 270 high-frequency nouns in English and Welsh, with neutral affective valence (Bradley and Lang, 1999), distributed across three blocks of 90 flanker trials (arrow displays). The blocks differed in terms of the language(s) of the intervening words (i.e., English, Welsh, or mixed). In the monolingual blocks, all words were in English or Welsh. In the mixed-language block, half of the words were in English and the other half were in Welsh. Repeated-measures ANOVAs conducted on lexical frequency, word concreteness, and number of letters (Coltheart, 1981) revealed no significant differences between the three experimental blocks (all ps > 0.1). Word order was randomized within each block. We prepared 18 experimental stimulus sequences to randomize presentation between participants. Words susceptible of causing semantic interference with the flanker task (e.g., left, right, middle) were avoided.
Procedure.
Participants sat on a chair 1 m away from the screen in a quiet room. They signed a written consent form before taking part in the study that was approved by the ethics committee of Bangor University (Wales, UK). The flanker task was a simplified version of the test originally described by Fan et al. (2002): (1) flanker trials were intermixed with pseudorandom presentations of single words; and (2) all stimuli were presented at the center of the screen to avoid vertical eye movements. Each trial had the following sequence: (1) a fixation cross (+) appeared at the center of the screen for 500 ms; (2) a blank screen appeared for 200 ms; (3) the stimulus was presented until the participants responded or for a maximum duration of 1500 ms; and (4) a 2000 ms intertrial interval occurred. The relatively long intertrial interval reduced the eventuality of carryover effects. Participants were instructed to indicate whether the central arrow in flanker stimuli pointed to the left (left button press) or the right (right button press) with the corresponding index finger and to ignore intervening word presentations (i.e., not to make any response to them). Block order was fully counterbalanced between participants. Participants took a short break between experimental blocks. The language in which the filler words were presented was mentioned to the participants, who were also reminded that these words were irrelevant and should be ignored. After the experiment, participants were given a surprise questionnaire in which they had to identify words previously presented in the experiment intermixed with 270 new words matched in terms of lexical frequency (Coltheart, 1981). Participants were able to recognize >70% of the words (false alarms were marked negatively), and the error rate did not differ significantly between experimental blocks (ps > 0.1 in all pairwise comparisons).
ERP recording.
Electrophysiological data were recorded in reference to Cz at a rate of 1 kHz from 64 Ag/AgCl electrodes placed according to the extended 10–20 convention. Impedances were kept <5 kΩ. Electroencephalogram activity was filtered on-line bandpass between 0.1 and 200 Hz and refiltered off-line with a 25 Hz, low-pass, zero-phase shift digital filter. Eye blinks were mathematically corrected, and remaining artifacts were manually dismissed (Gratton et al., 1983). Continuous recordings were cut into epochs ranging from −100 to 1000 ms after the onset of each flanker trial. Signals exceeding ±75 μV in any given epoch were automatically discarded. There was a minimum of 60 valid epochs per condition in every subject. Baseline correction was performed in reference to prestimulus activity, and individual averages were digitally re-referenced to the global average reference. ERP data were collected simultaneously to behavioral data.
ERP data analysis.
The temporal window of the P300 effect was determined by means of pairwise millisecond-by-millisecond comparisons between the congruent and incongruent conditions considered significant when differences were above threshold (p < 0.05) for >30 ms over a minimum of nine clustered electrodes (Rugg et al., 1993; Thierry et al., 2003). As a result, the mean ERP amplitudes between 500 and 750 ms, in which the two conditions were significantly different in all three blocks, were subjected to a repeated-measures ANOVA with congruency (congruent/incongruent) and language context (English, Welsh, and mixed) as within-subject factors. Topographical analyses were based on mean amplitudes measured over 63 electrodes distributed over the entire scalp. Interactions involving the electrode factor were controlled using within-condition vector normalization (McCarthy and Wood, 1985). A Greenhouse-Geisser correction was performed where applicable. The P300 effect was maximal over the central parietal scalp (i.e.,C1, Cz, C2, P1, Pz, P2, CP1, CPz, and CP2).
Results
As expected, reaction times (F(1,17) = 23.22; p < 0.001) and error rates (F(1,17) = 32.46; p < 0.001) were reduced in congruent compared with incongruent trials (Fig. 2), showing a classic flanker interference effect (Eriksen and Eriksen, 1974). However, whereas language processing context did not significantly affect reaction times (F(2,34) = 0.34; p > 0.1) or error rates (F(2,34) = 2.47; p > 0.1), a significant interaction between experimental block (i.e., English, Welsh, and mixed) and congruency (i.e., congruent and incongruent) was found on error rates (F(2,34) = 5.20; p < 0.01). Post hoc analysis showed that error rates for incongruent trials were reduced in the mixed block compared with each of the two monolingual blocks (ps < 0.05), suggesting that conflict resolution was easier in the mixed-language context.
Consistent with the behavioral results, a main effect of congruency was found on ERP mean amplitude measured between 500 and 750 ms after stimulus onset (F(1,17) = 18.73; p < 0.001). Compared with congruent trials, incongruent trials elicited a greater positive-going waveform maximal over centroparietal regions, consistent with a P300 effect (Fig. 3), which previous studies have established as an index of inhibition control in the flanker paradigm (Polich, 2007; Neuhaus et al., 2010). Note that the P300 was expected in a relatively late time window since it was elicited by low probability events requiring explicit responses. Such a paradigm typically elicits a P3b with a distribution slightly more posterior and a latency slightly greater than that observed for the P3a.
Critically, a significant interaction between congruency and experimental block was found in this time window (F(2,34) = 3.71; p < 0.05; Fig. 4): ERP mean amplitude for incongruent trials was reduced in the mixed compared with the monolingual blocks (ps < 0.05), showing that participants suffered less interference from incongruent flankers in a mixed-language context. Moreover, millisecond-by-millisecond analyses of averaged ERP differences performed independently across groups showed that the congruency effect was of shorter duration in the mixed (500–757 ms) than in the monolingual (500–800+ ms) blocks.
Discussion
The behavioral and electrophysiological findings converge in showing that the mixed-language context, compared with the monolingual context, facilitates nonlinguistic conflict resolution. Since the same participants were tested in all conditions, the present findings cannot be accounted for by spurious differences between groups. To our knowledge, this is the first evidence of on-line interaction between two cognitive factors generally considered independent (i.e., language processing context and nonlinguistic executive interference), lending strong support to theories that relate enhancement in executive control to language abilities. However, the direction of this cross-domain interaction remains counterintuitive since the traditional bilingual advantage hypothesis assumes that controlling two languages in parallel is a mental burden, thus requiring more, not less, cognitive resources. The most parsimonious explanation for the present findings is that, since it is task irrelevant, the incidental processing of words from two languages did not directly compete for cognitive resources with nonverbal conflict resolution. Instead, the mixed-language context shifted the executive system to an enhanced functional level, thus improving the effectiveness of nonverbal conflict resolution. This explanation is in line with theories of reactive adjustment in human cognitive control. For example, studies have shown that the processing of conflict is enhanced when the brain is primed to a state of higher cognitive control by previous tasks engaging the same cognitive operations (Gratton et al., 1992; Botvinick et al., 1999; Kerns et al., 2004; Kerns, 2006). The present findings shed new light in this domain by demonstrating that executive function is modulated not only by explicit nonverbal conflict operations but also by implicit and irrelevant contextual language cues (i.e., a cross-domain interaction).
This account is also fully consistent with recent neuroimaging studies revealing patterns of brain activations that are considered specific to bilingual participants performing nonlinguistic conflict tasks (Bialystok et al., 2005; Garbin et al., 2010). Some of the between-group contrasts were found in brain regions such as the left caudate and dorsal anterior cingulate cortex, which have also been shown to be involved in bilingual language control (Crinion et al., 2006; Abutalebi et al., 2008, 2012). However, despite evidence of convergence in neuroanatomical representations, previous studies have failed to establish a functional link between language use and cognitive control. Here, we show that bilingual processing context activates a domain-general neural network underpinning both language processing and nonverbal executive function, thereby demonstrating an on-line interaction between bilingualism and enhancement in executive control.
One critical aspect of our findings is that performance was highly comparable in the Welsh (i.e., the native language) and the English (i.e., the second language) contexts, in the congruent and the incongruent conditions considered separately. This ensures that including incidental prime words did not have any confounding effect on flanker task execution per se but only established the desired language contexts, as verified by a postexperimental procedure (see Materials and Methods). Moreover, the bilingual context effect on enhancement in executive control was observed only in the case of incongruent stimuli, since error rates in incongruent trials and corresponding ERP amplitudes were reduced in the mixed block relative to the English and Welsh blocks, whereas congruent trials elicited no differences across the three blocks. This pattern of results suggests that a mixed-language context enhances the specific ability to inhibit irrelevant information when handling conflict but does not affect general cognitive functioning when conflict resolution is not required. In contrast with the current findings, between-subject comparisons have previously shown a relative advantage in bilinguals for processing both congruent and incongruent stimuli (Bialystok et al., 2004; Costa et al., 2008; Emmorey et al., 2008). In addition, the bilingual advantage is generally more pronounced for incongruent than congruent trials, leading to the usual observation of a reduced Simon effect. However, it is insufficient to conclude that bilinguals have greater executive control capacity than monolinguals because (1) the possibility remains that the reduced Simon effect results from a ceiling effect for congruent stimuli and (2) a generic advantage in executive control, as evidenced by facilitation in both congruent and incongruent trials, may be the consequence of various between-group differences. Therefore, results of the present within-subject study are more compatible with bilingualism bestowing cognitive benefits through enhancing executive mechanisms that are shared by bilingual lexical selection and nonverbal conflict resolution. However, it is important to note that the choice of tasks used to investigate executive control (i.e., the Simon and flanker tasks) may also contribute to the different results observed in the present and previous studies.
In a study comparing the performance of English monolinguals, early bilinguals, and bimodal bilinguals (who use both English and American Sign Language) in a flanker task, Emmorey et al. (2008) showed that bilinguals have enhanced executive function compared with monolinguals and, surprisingly, also bimodal bilinguals, who did not differ from the monolinguals. The critical difference between unimodal and bimodal bilinguals is that the latter individuals switch between languages less frequently than the former (i.e., they tend to either sign or speak in a given interaction). Moreover, spoken and sign languages involve distinct perceptual and motor coordinators and therefore require minimal inhibitory control even when bimodal bilinguals sign and speak at the same time. Therefore, a bilingual advantage is only observed when the two languages are often used in a mixed-language context rather than two independent single-language contexts. The effect of language context in the same bilingual individuals described in the present study is consistent with this interpretation.
The finding that manipulating language-processing context spontaneously changes nonverbal cognitive performance has implications beyond the bilingual advantage, in the domain of functional plasticity. Functional plasticity, the process by which new skills and life experiences modify brain structure and functions (Rakic, 2002; Pascual-Leone et al., 2005), is known to be prompted by juggling (Draganski et al., 2004), taxi driving (Maguire et al., 2000), playing video games (Green and Bavelier, 2008), musical training (Elbert et al., 1995), and using multiple languages (Mechelli et al., 2004; Luk et al., 2011; Krizman et al., 2012). However, such neuroplastic changes reflect the “end product” of what is usually a long-term experience or training. Therefore, between-group contrasts do not capture the specific immediate effect of a given manipulated variable but rather the combined effect of many correlated factors over time. Here, we show that manipulating the current processing context within the same individuals modulates a brain mechanism shaped by long-term experience even though the context is irrelevant regarding the ongoing nonverbal task.
Future studies will need to investigate how the phenomenon established here interacts with (1) language proficiency and (2) language nonselective lexical access. A recent study shows that even relatively balanced bilinguals inhibit words more efficiently in one language than the other, suggesting that inhibition capacity may be determined by proficiency or usage (Ng and Wicha, 2013). Therefore, language proficiency may not only affect the overall magnitude of the effect but also the effectiveness of executive control in a dual-language context. Bilinguals have been shown to activate both their languages when only one is presented (Thierry and Wu, 2007; Wu and Thierry, 2010). Observing greater executive control benefits in the mixed-language compared with the single-language context suggests that only physically presented mixed languages generate sufficient conflict to modulate executive function. This hypothesis may be tested by obtaining measurements of language control and executive control simultaneously and investigating potential correlations between the two domains.
Footnotes
G.T. was supported by the Economic and Social Research Council (Grant RES-000-23-0095), and Y.J.W. and G.T. were supported by the European Research Council (Grant ERC-StG-209704). We thank Ellen Bialystok, Angela Friederici, Judy Kroll, Marilyn Vihman, Janet Van Hell, and four anonymous reviewers for advice and comments on this manuscript.
The authors declare no competing financial interests.
- Correspondence should be addressed to Prof. Guillaume Thierry, School of Psychology, Bangor University, Bangor LL57 2AS, UK. g.thierry{at}bangor.ac.uk
This article is freely available online through the J Neurosci Author Open Choice option.