Constituent structure has long been established as a central feature of human language. Analogous to how syntax organizes words in sentences, a narrative grammar organizes sequential images into hierarchic constituents. Here we show that the brain draws upon this constituent structure to comprehend wordless visual narratives. We recorded neural responses as participants viewed sequences of visual images (comics strips) in which blank images either disrupted individual narrative constituents or fell at natural constituent boundaries. A disruption of either the first or the second narrative constituent produced a left-lateralized anterior negativity effect between 500 and 700ms. Disruption of the second constituent also elicited a posteriorly-distributed positivity (P600) effect. These neural responses are similar to those associated with structural violations in language and music. These findings provide evidence that comprehenders use a narrative structure to comprehend visual sequences and that the brain engages similar neurocognitive mechanisms to build structure across multiple domains.
Papers
2014
We used event-related potentials (ERPs) to investigate the neurocognitive mechanisms associated with processing light verb constructions such as “give a kiss”. These constructions consist of a semantically underspecified light verb (“give”) and an event nominal that contributes most of the meaning and also activates an argument structure of its own (“kiss”). This creates a mismatch between the syntactic constituents and the semantic roles of a sentence. Native speakers read German verb-final sentences that contained light verb constructions (e.g., “Julius gave Anne a kiss”), non-light constructions (e.g., “Julius gave Anne a rose”), and semantically anomalous constructions (e.g., *“Julius gave Anne a conversation”). ERPs were measured at the critical verb, which appeared after all its arguments. Compared to non-light constructions, the light verb constructions evoked a widely distributed, frontally focused, sustained negative-going effect between 500 and 900 ms after verb onset. We interpret this effect as reflecting working memory costs associated with complex semantic processes that establish a shared argument structure in the light verb constructions.
The verb “pounce” describes a single, near-instantaneous event. Yet, we easily understand that, “For several minutes the cat pounced…” describes a situation in which multiple pounces occurred, although this interpretation is not overtly specified by the sentence s syntactic structure or by any of its individual words—a phenomenon known as “aspectual coercion.” Previous psycholinguistic studies have reported processing costs in association with aspectual coercion, but the neurocognitive mechanisms giving rise to these costs remain contentious. Additionally, there is some controversy about whether readers commit to a full interpretation of the event when the aspectual information becomes available, or whether they leave it temporarily underspecified until later in the sentence. Using ERPs, we addressed these questions in a design that fully crossed context type (punctive, durative, frequentative) with verb type (punctive, durative). We found a late, sustained negativity to punctive verbs in durative contexts, but not in frequentative (e.g., explicitly iterative) contexts. This effect was distinct from the N400 in both its time course and scalp distribution, suggesting that it reflected a different underlying neurocognitive mechanism. We also found that ERPs to durative verbs were unaffected by context type. Together, our results provide strong evidence that neural activity associated with aspectual coercion is driven by the engagement of a morphosyntactically unrealized semantic operator rather than by violations of real-world knowledge, more general shifts in event representation, or event iterativity itself. More generally, our results add to a growing body of evidence that a set of late-onset sustained negativities reflect elaborative semantic processing that goes beyond simply combining the meaning of individual words with syntactic structure to arrive at a final representation of meaning.
2013
A core property of human semantic processing is the rapid, facilitatory influence of prior input on extracting the meaning of what comes next, even under conditions of minimal awareness. Previous work has shown a number of neurophysiological indices of this facilitation, but the mapping between time course and localization-critical for separating automatic semantic facilitation from other mechanisms-has thus far been unclear. In the current study, we used a multimodal imaging approach to isolate early, bottom-up effects of context on semantic memory, acquiring a combination of electroencephalography (EEG), magnetoencephalography (MEG), and functional magnetic resonance imaging (fMRI) measurements in the same individuals with a masked semantic priming paradigm. Across techniques, the results provide a strikingly convergent picture of early automatic semantic facilitation. Event-related potentials demonstrated early sensitivity to semantic association between 300 and 500 ms; MEG localized the differential neural response within this time window to the left anterior temporal cortex, and fMRI localized the effect more precisely to the left anterior superior temporal gyrus, a region previously implicated in semantic associative processing. However, fMRI diverged from early EEG/MEG measures in revealing semantic enhancement effects within frontal and parietal regions, perhaps reflecting downstream attempts to consciously access the semantic features of the masked prime. Together, these results provide strong evidence that automatic associative semantic facilitation is realized as reduced activity within the left anterior superior temporal cortex between 300 and 500 ms after a word is presented, and emphasize the importance of multimodal neuroimaging approaches in distinguishing the contributions of multiple regions to semantic processing.
When a word is preceded by a supportive context such as a semantically associated word or a strongly constraining sentence frame, the N400 component of the ERP is reduced in amplitude. An ongoing debate is the degree to which this reduction reflects a passive spread of activation across long-term semantic memory representations as opposed to specific predictions about upcoming input. We addressed this question by embedding semantically associated prime-target pairs within an experimental context that encouraged prediction to a greater or lesser degree. The proportion of related items was used to manipulate the predictive validity of the prime for the target while holding semantic association constant. A semantic category probe detection task was used to encourage semantic processing and to preclude the need for a motor response on the trials of interest. A larger N400 reduction to associated targets was observed in the high than the low relatedness proportion condition, consistent with the hypothesis that predictions about upcoming stimuli make a substantial contribution to the N400 effect. We also observed an earlier priming effect (205-240 msec) in the high-proportion condition, which may reflect facilitation because of form-based prediction. In summary, the results suggest that predictability modulates N400 amplitude to a greater degree than the semantic content of the context.
Words that are semantically congruous with their preceding discourse context are easier to process than words that are semantically incongruous with their context. This facilitation of semantic processing is reflected by an attenuation of the N400 event-related potential (ERP). We asked whether this was true of emotional words in emotional contexts where discourse congruity was conferred through emotional valence. ERPs were measured as 24 participants read twosentence scenarios with critical words that varied by emotion (pleasant, unpleasant, or neutral) and congruity (congruous or incongruous). Semantic predictability, constraint, and plausibility were comparable across the neutral and emotional scenarios. As expected, the N400 was smaller to neutral words that were semantically congruous (vs. incongruous) with their neutral discourse context. No such N400 congruity effect was observed on emotional words following emotional discourse contexts. Rather, the amplitude of the N400 was small to all emotional words (pleasant and unpleasant), regardless of whether their emotional valence was congruous with the valence of their emotional discourse context. However, consistent with previous studies, the emotional words produced a larger late positivity than did the neutral words. These data suggest that comprehenders bypassed deep semantic processing of valence-incongruous emotional words within the N400 time window, moving rapidly on to evaluate the words’ motivational significance.
2012
Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: (1) Normal sequences with both structure and meaning, (2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), (3) Structural Only sequences (narrative structure but no semantic relatedness), and (4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning.
We aimed to determine whether semantic relatedness between an incoming word and its preceding context can override expectations based on two types of stored knowledge: real-world knowledge about the specific events and states conveyed by a verb, and the verb’s broader selection restrictions on the animacy of its argument. We recorded event-related potentials on post-verbal Agent arguments as participants read and made plausibility judgments about passive English sentences. The N400 evoked by incoming animate Agent arguments that violated expectations based on real-world event/state knowledge, was strongly attenuated when they were semantically related to the context. In contrast, semantic relatedness did not modulate the N400 evoked by inanimate Agent arguments that violated the preceding verb’s animacy selection restrictions. These findings suggest that, under these task and experimental conditions, semantic relatedness can facilitate processing of post-verbal animate arguments that violate specific expectations based on real-world event/state knowledge, but only when the semantic features of these arguments match the coarser-grained animacy restrictions of the verb. Animacy selection restriction violations also evoked a P600 effect, which was not modulated by semantic relatedness, suggesting that it was triggered by propositional impossibility. Together, these data indicate that the brain distinguishes between real-world event/state knowledge and animacy-based selection restrictions during online processing.
Active reading requires coordination between frequent eye movements (saccades) and short fixations in text. Yet, the impact of saccades on word processing remains unknown, as neuroimaging studies typically employ constant eye fixation. Here we investigate eye-movement effects on word recognition processes in healthy human subjects using anatomically constrained magnetoencephalography, psychophysical measurements, and saccade detection in real time. Word recognition was slower and brain responses were reduced to words presented early versus late after saccades, suggesting an overall transient impairment of word processing after eye movements. Response reductions occurred early in visual cortices and later in language regions, where they colocalized with repetition priming effects. Qualitatively similar effects occurred when words appeared early versus late after background movement that mimicked saccades, suggesting that retinal motion contributes to postsaccadic inhibition. Further, differences in postsaccadic and background-movement effects suggest that central mechanisms also contribute to postsaccadic modulation. Together, these results suggest a complex interplay between visual and central saccadic mechanisms during reading.
