Giovanni M. Di Liberto, James A. O’Sullivan, Edmund C. Lalor 

Slides:



Advertisements
Similar presentations
Thomas Andrillon, Sid Kouider, Trevor Agus, Daniel Pressnitzer 
Advertisements

Attention Narrows Position Tuning of Population Responses in V1
Joshua J. Foster, Emma M. Bsales, Russell J. Jaffe, Edward Awh 
Volume 26, Issue 16, Pages (August 2016)
Volume 77, Issue 5, Pages (March 2013)
Individual Differences Reveal the Basis of Consonance
Elise A. Piazza, Marius Cătălin Iordan, Casey Lew-Williams 
Decision Making during the Psychological Refractory Period
Volume 58, Issue 3, Pages (May 2008)
Rhythmic Working Memory Activation in the Human Hippocampus
Perceptual Echoes at 10 Hz in the Human Brain
Somatosensory Precision in Speech Production
Walking Modulates Speed Sensitivity in Drosophila Motion Vision
Thomas Andrillon, Sid Kouider, Trevor Agus, Daniel Pressnitzer 
Volume 25, Issue 15, Pages (August 2015)
Huan Luo, Xing Tian, Kun Song, Ke Zhou, David Poeppel  Current Biology 
Nori Jacoby, Josh H. McDermott  Current Biology 
Volume 66, Issue 6, Pages (June 2010)
Volume 27, Issue 20, Pages e3 (October 2017)
Volume 76, Issue 5, Pages (December 2012)
Volume 25, Issue 5, Pages (March 2015)
Jason Samaha, Bradley R. Postle  Current Biology 
Differential Impact of Behavioral Relevance on Quantity Coding in Primate Frontal and Parietal Neurons  Pooja Viswanathan, Andreas Nieder  Current Biology 
Roman F. Loonis, Scott L. Brincat, Evan G. Antzoulatos, Earl K. Miller 
Face Pareidolia in the Rhesus Monkey
Volume 27, Issue 24, Pages e3 (December 2017)
Aryeh Hai Taub, Rita Perets, Eilat Kahana, Rony Paz  Neuron 
Tobias Staudigl, Simon Hanslmayr  Current Biology 
Volume 80, Issue 2, Pages (October 2013)
Disruption of Perceptual Learning by a Brief Practice Break
Volume 28, Issue 2, Pages e5 (January 2018)
Volume 28, Issue 5, Pages e3 (March 2018)
Volume 54, Issue 6, Pages (June 2007)
Single-Unit Responses Selective for Whole Faces in the Human Amygdala
Mathilde Bonnefond, Ole Jensen  Current Biology 
Marian Stewart Bartlett, Gwen C. Littlewort, Mark G. Frank, Kang Lee 
Volume 22, Issue 18, Pages (September 2012)
Walking Modulates Speed Sensitivity in Drosophila Motion Vision
BOLD fMRI Correlation Reflects Frequency-Specific Neuronal Correlation
Volume 19, Issue 6, Pages (March 2009)
Volume 28, Issue 9, Pages e4 (May 2018)
Near-Real-Time Feature-Selective Modulations in Human Cortex
Volume 80, Issue 4, Pages (November 2013)
Amy E. Skerry, Rebecca Saxe  Current Biology 
Volume 26, Issue 12, Pages (June 2016)
Attentive Tracking of Sound Sources
Volume 25, Issue 5, Pages (March 2015)
Direct Two-Dimensional Access to the Spatial Location of Covert Attention in Macaque Prefrontal Cortex  Elaine Astrand, Claire Wardak, Pierre Baraduc,
Robust Selectivity to Two-Object Images in Human Visual Cortex
Volume 64, Issue 3, Pages (November 2009)
Towards Large-Scale, Human-Based, Mesoscopic Neurotechnologies
Attention Reorients Periodically
Volume 18, Issue 5, Pages (March 2008)
Event Boundaries Trigger Rapid Memory Reinstatement of the Prior Events to Promote Their Representation in Long-Term Memory  Ignasi Sols, Sarah DuBrow,
Volume 23, Issue 11, Pages (June 2013)
Encoding of Stimulus Probability in Macaque Inferior Temporal Cortex
PER3 Polymorphism Predicts Sleep Structure and Waking Performance
Volume 58, Issue 1, Pages (April 2008)
When Correlation Implies Causation in Multisensory Integration
Volume 17, Issue 15, Pages (August 2007)
Hippocampal-Prefrontal Theta Oscillations Support Memory Integration
Søren K. Andersen, Steven A. Hillyard, Matthias M. Müller 
Supratim Ray, John H.R. Maunsell  Neuron 
Simon Hanslmayr, Jonas Matuschek, Marie-Christin Fellner 
Volume 26, Issue 16, Pages (August 2016)
Christoph Daube, Robin A.A. Ince, Joachim Gross  Current Biology 
Jacqueline R. Hembrook-Short, Vanessa L. Mock, Farran Briggs 
Nori Jacoby, Josh H. McDermott  Current Biology 
A Temporal Channel for Information in Sparse Sensory Coding
Towards Large-Scale, Human-Based, Mesoscopic Neurotechnologies
Presentation transcript:

Low-Frequency Cortical Entrainment to Speech Reflects Phoneme-Level Processing  Giovanni M. Di Liberto, James A. O’Sullivan, Edmund C. Lalor  Current Biology  Volume 25, Issue 19, Pages 2457-2465 (October 2015) DOI: 10.1016/j.cub.2015.08.030 Copyright © 2015 Elsevier Ltd Terms and Conditions

Figure 1 Assessing the Encoding of Speech Features in EEG 128-channel EEG data were recorded while subjects listened to continuous, natural speech consisting of a male speaker reading from a novel or its time-reversed complement. Linear regression was used to fit multivariate temporal response functions (mTRFs) between the low-frequency (1–15 Hz) EEG data and five different representations of the speech stimulus. Each mTRF model was then tested for its ability to predict EEG using leave-one-out cross-validation. Current Biology 2015 25, 2457-2465DOI: (10.1016/j.cub.2015.08.030) Copyright © 2015 Elsevier Ltd Terms and Conditions

Figure 2 EEG Responses to Forward Speech, but Not Time-Reversed Speech, Are Best Predicted When Speech Is Represented as a Combination of Spectrotemporal Features and Phonetic-Feature Labels (A) Grand-average EEG prediction correlations (Pearson’s r) for each speech model (mean ± SEM). While there is no statistical difference in prediction performance between the spectrogram (Sgram), phonetic-features (Fea), and phonemic (Ph) models (p > 0.05), all of these models are better predictors of the EEG than that based on the envelope (Env; ▲p < 0.01). Importantly, the model based on the combination of phonetic features and spectrogram (FS) outperforms all other models (∗p < 0.01). (B) Correlation values between recorded EEG and that predicted by each mTRF model for individual subjects. The subjects are sorted according to the prediction correlations of the FS model. Although the results show variability across subjects, the FS model outperforms all the other models for every subject. The model based on the speech envelope (Env) performs worse than every other model for every subject. (C) Grand-average EEG prediction correlations for the time-reversed speech condition (mean ± SEM). Prediction correlations using the model based on the envelope (Env) are lower than those for all other models (▲p < 0.05). As with normal speech, there is no statistical difference in prediction performance between the spectrogram (Sgram), phonetic-features (Fea), and phonemic (Ph) models (p > 0.05). Importantly in this case, there is also no difference between the performance of those models and that based on the combination of phonetic features and spectrogram (FS; p > 0.05). (D) Correlation values between recorded EEG and that predicted by each mTRF model for individual subjects for time-reversed speech. The subjects are sorted according to the prediction correlations of the FS model. The model based on the speech envelope (Env) performs worse than every other model for every subject. Current Biology 2015 25, 2457-2465DOI: (10.1016/j.cub.2015.08.030) Copyright © 2015 Elsevier Ltd Terms and Conditions

Figure 3 EEG Response Prediction for Different EEG Frequency Bands Grand-average EEG prediction correlations (Pearson’s r) for each speech model (mean ± SEM) for delta (A), theta (B), alpha (C), beta (D), and low-gamma (E) EEG frequencies. ∗ indicates prediction differences at the level of p < 0.05 and ∗∗ at the level of p < 0.01, both using planned paired t tests. Current Biology 2015 25, 2457-2465DOI: (10.1016/j.cub.2015.08.030) Copyright © 2015 Elsevier Ltd Terms and Conditions

Figure 4 mTRF Models for Natural Speech Reflect Sensitivity to Different Speech Features mTRFs plotted for envelope (Env, A), spectrogram (Sgram, B), phonemic (Ph, C), and phonetic-features (Fea, D) models at peri-stimulus time lags from −100 to 400 ms for natural speech, averaged over 12 frontotemporal electrodes (see Figure S1). The phonemes were sorted based on a hierarchical clustering analysis on the average mTRF after grouping them into vowels, diphthongs, semi-vowels, and consonants. Horizontal dashed lines separate distinct categories of phonemes and phonetic features. Current Biology 2015 25, 2457-2465DOI: (10.1016/j.cub.2015.08.030) Copyright © 2015 Elsevier Ltd Terms and Conditions

Figure 5 Sensitivity of EEG to Speech Features Increases with Response Latency Multidimensional scaling (MDS) on the phonetic-features and phonemic mTRFs as a function of peri-stimulus time lag. By carrying out repeated k-means classification, we derive F-score measures that represent the discriminability of our mTRFs in each of the three time intervals 50–100 ms, 100–150 ms, and 150–200 ms, which correspond approximately to the three main peaks and troughs of the phonemic mTRF (Figure 3C). (A) MDS on the phonetic-features mTRF. The F-scores indicate the differential sensitivity of responses to manner of articulation, voicing, backness of a vowel, and place of articulation features. These F-scores show significant increase with response latency. (B) MDS on the phoneme mTRF. The F-scores are a measure of the binary classification of responses to consonants and non-consonants (vowels, diphthongs, and semi-vowels), which, again, significantly increase with latency. Although the classification performed was binary, the distinction between the four main classes of phonemes is evident. (C) MDS on the phoneme mTRF. Here the F-scores are computed for the four classes: plosive, liquid & glide, nasal, and fricative & affricate. The four categories are progressively more separable across the three time intervals (jackknife method, p < 0.0005). Current Biology 2015 25, 2457-2465DOI: (10.1016/j.cub.2015.08.030) Copyright © 2015 Elsevier Ltd Terms and Conditions