Publisher's PDF

Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level

Abstract

During speech perception, listeners rely on multimodal input and make use of both auditory and visual information. When presented with speech, for example syllables, the differences in brain responses to distinct stimuli are not, however, caused merely by the acoustic or visual features of the stimuli. The congruency of the auditory and visual information and the familiarity of a syllable, that is, whether it appears in the listener's native language or not, also modulates brain responses. We investigated how the congruency and familiarity of the presented stimuli affect brain responses to audio-visual (AV) speech in 12 adult Finnish native speakers and 12 adult Chinese native speakers. They watched videos of a Chinese speaker pronouncing syllables (/pa/, /pha/, /ta/, /tha/, /fa/) during a magnetoencephalography (MEG) measurement where only /pa/ and /ta/ were part of Finnish phonology while all the stimuli were part of Chinese phonology. The stimuli were presented in audio-visual (congruent or incongruent), audio only, or visual only conditions. The brain responses were examined in five time-windows: 75-125, 150-200, 200-300, 300-400, and 400-600 ms. We found significant differences for the congruency comparison in the fourth time-window (300-400 ms) in both sensor and source level analysis. Larger responses were observed for the incongruent stimuli than for the congruent stimuli. For the familiarity comparisons no significant differences were found. The results are in line with earlier studies reporting on the modulation of brain responses for audio-visual congruency around 250-500 ms. This suggests a much stronger process for the general detection of a mismatch between predictions based on lip movements and the auditory signal than for the top-down modulation of brain responses based on phonological information.

Main Authors

Kolozsvári, Orsolya B. Xu, Weiyong Leppänen, Paavo H. T. Hämäläinen, Jarmo A.

Format

Articles Research article

Published

2019

Series

Frontiers in Human Neuroscience

Subjects

speech perception

magnetoencephalography

audio-visual stimuli

audio-visual integration

familiarity

havaitseminen

puhe (puhuminen)

ärsykkeet

MEG

Publication in research information system

https://converis.jyu.fi/converis/portal/detail/Publication/32166646

Publisher

Frontiers Media

The permanent address of the publication

https://urn.fi/URN:NBN:fi:jyu-201907293693Käytä tätä linkitykseen.

Review status

Peer reviewed

ISSN

1662-5161

DOI

https://doi.org/10.3389/fnhum.2019.00243

Language

English

Published in

Frontiers in Human Neuroscience

Citation

Kolozsvári, O. B., Xu, W., Leppänen, P. H. T., & Hämäläinen, J. A. (2019). Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level. Frontiers in Human Neuroscience, 13, Article 243. https://doi.org/10.3389/fnhum.2019.00243

License

Funder(s)

European Commission

Research Council of Finland

European Commission

Funding program(s)

MSCA Marie Skłodowska-Curie Actions, H2020

Profilointi, SA

MSCA Marie Skłodowska-Curie Actions, H2020

Research profiles, AoF

MSCA Marie Skłodowska-Curie Actions, H2020

Additional information about funding

This work was supported by the European Union Projects ChildBrain (Marie Curie Innovative Training Networks, # 641652), Predictable (Marie Curie Innovative Training Networks, # 641858), and the Academy of Finland (MultiLeTe #292 466).

Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level

Share

Similar Items