bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2023,
Номер
unknown
Опубликована: Дек. 23, 2023
Abstract
The
comprehension
of
phonemes
is
a
fundamental
component
speech
processing
which
relies
on
both,
temporal
fine
structure
(TFS)
and
envelope
(TE)
coding.
EEG
amplitude
in
response
to
has
been
identified
as
indicator
performance
hearing
aid
users.
Presbyacusis
may
also
alter
neuro-electric
responses
phonemes,
even
with
minimally
or
unaffected
thresholds.
Elevated
reception
thresholds
(SRT)
absence
pure-tone
threshold
(PTT)
elevation
suggest
central
deficits.
We
therefore
collected
audiometric
data
(PTT,
SRT)
during
passive
listening
80
subjects,
ranging
age
from
18
76
years.
confirm
phoneme-evoked
(PEA)
comprehension.
Specifically,
PEA
decreased
elevated
SRT,
PTT
increased
age.
As
novel
observation,
we
report
the
delay
(PED)
increase
PTT.
absolute
duration
PED,
its
age-correlation,
lack
PEA-lateralization
combined
frequency
phoneme
stimuli
used
here
predominantly
thalamic
generator
responses.
Hearing
loss
extended
high-frequencies
affects
PED
more
than
PEA.
In
our
sample,
neural
compensation
for
came
at
cost
speed.
Most
importantly,
correlates
SRT
explains
SRT-variance
quiet
ipsilateral
noise
that
cannot.
was
better
predictor
TFS
coding
TE
noise.
reflects
both
coding,
activity
provide
integrated
information
gate
neocortex.
Significance
Statement
Intact
essential
social
participation
protects
against
depression
dementia.
Age-related
growing
problem
aging
societies,
deficits
constitute
third
most
important
modifiable
risk
factor
cognitive
decline.
This
work
uses
electrical
brain
cohort
covering
showed
significant
correlations
high-frequency
thresholds,
demonstrated
speed
seems
observed
signals
likely
originate
thalamus
receives
feedback
neocortex
embedded
processing.
Developing
objective
markers
key
ensuring
fitness
aging.
PLoS ONE,
Год журнала:
2024,
Номер
19(2), С. e0297826 - e0297826
Опубликована: Фев. 8, 2024
Perception
of
sounds
and
speech
involves
structures
in
the
auditory
brainstem
that
rapidly
process
ongoing
stimuli.
The
role
these
processing
can
be
investigated
by
measuring
their
electrical
activity
using
scalp-mounted
electrodes.
However,
typical
analysis
methods
involve
averaging
neural
responses
to
many
short
repetitive
stimuli
bear
little
relevance
daily
listening
environments.
Recently,
subcortical
more
ecologically
relevant
continuous
were
detected
linear
encoding
models.
These
estimate
temporal
response
function
(TRF),
which
is
a
regression
model
minimises
error
between
measured
signal
predictor
derived
from
stimulus.
Using
predictors
highly
non-linear
peripheral
system
may
improve
TRF
estimation
accuracy
peak
detection.
Here,
we
compare
both
simple
complex
models
for
estimating
TRFs
on
electroencephalography
(EEG)
data
24
participants
speech.
We
also
investigate
length
required
TRFs,
find
around
12
minutes
sufficient
clear
wave
V
peaks
(>3
dB
SNR)
seen
nearly
all
participants.
Interestingly,
filterbank-based
yield
SNRs
are
not
significantly
different
those
estimated
nerve,
provided
nonlinear
effects
adaptation
appropriately
modelled.
Crucially,
computing
simpler
than
50
times
faster
compared
model.
This
work
paves
way
efficient
modelling
detection
speech,
lead
improved
diagnosis
metrics
hearing
impairment
assistive
technology.
Journal of Cognitive Neuroscience,
Год журнала:
2024,
Номер
36(3), С. 475 - 491
Опубликована: Янв. 1, 2024
Abstract
Most
parts
of
speech
are
voiced,
exhibiting
a
degree
periodicity
with
fundamental
frequency
and
many
higher
harmonics.
Some
neural
populations
respond
to
this
temporal
fine
structure,
in
particular
at
the
frequency.
This
frequency-following
response
consists
both
subcortical
cortical
contributions
can
be
measured
through
EEG
as
well
magnetoencephalography
(MEG),
although
differ
aspects
activity
that
they
capture:
is
sensitive
radial
tangential
sources
deep
sources,
whereas
MEG
more
restrained
measurement
superficial
activity.
responses
continuous
have
shown
an
early
contribution,
latency
around
9
msec,
agreement
measurements
short
tokens,
not
yet
revealed
such
component.
Here,
we
analyze
long
segments
speech.
We
find
latencies
4–11
followed
by
later
right-lateralized
activities
delays
20–58
msec
potential
activities.
Our
results
show
component
FFR
from
participants
its
agrees
EEG.
They
furthermore
temporally
separated
contributions,
enabling
independent
assessment
components
toward
further
processing.
Journal of Neuroscience,
Год журнала:
2023,
Номер
43(44), С. 7429 - 7440
Опубликована: Окт. 4, 2023
Selective
attention
to
one
of
several
competing
speakers
is
required
for
comprehending
a
target
speaker
among
other
voices
and
successful
communication
with
them.
It
moreover
has
been
found
involve
the
neural
tracking
low-frequency
speech
rhythms
in
auditory
cortex.
Effects
selective
have
also
subcortical
activities,
particular
regarding
frequency-following
response
related
fundamental
frequency
(speech-FFR).
Recent
investigations
have,
however,
shown
that
speech-FFR
contains
cortical
contributions
as
well.
remains
unclear
whether
these
are
modulated
by
attention.
Here
we
used
magnetoencephalography
assess
attentional
modulation
speech-FFR.
We
presented
both
male
female
participants
two
signals
analyzed
responses
during
switching
between
speakers.
Our
findings
revealed
robust
contribution
speech-FFR:
were
higher
when
was
attended
than
they
ignored.
that,
regardless
attention,
voice
lower
elicited
larger
frequency.
results
show
does
not
only
occur
subcortically
but
extends
cortex
IEEE Open Journal of Signal Processing,
Год журнала:
2024,
Номер
5, С. 700 - 716
Опубликована: Янв. 1, 2024
The
electroencephalogram
(EEG)
offers
a
non-invasive
means
by
which
listener's
auditory
system
may
be
monitored
during
continuous
speech
perception.
Reliable
auditory-EEG
decoders
could
facilitate
the
objective
diagnosis
of
hearing
disorders,
or
find
applications
in
cognitively-steered
aids.
Previously,
we
developed
for
ICASSP
Auditory
EEG
Signal
Processing
Grand
Challenge
(SPGC).
These
aimed
to
solve
match-mismatch
task:
given
short
temporal
segment
recordings,
and
two
candidate
segments,
task
is
identify
segments
temporally
aligned,
matched,
with
segment.
made
use
cortical
responses
envelope,
as
well
speech-related
frequency-following
responses,
relate
recordings
stimuli.
Here
comprehensively
document
methods
were
developed.
We
extend
our
previous
analysis
exploring
association
between
speaker
characteristics
(pitch
sex)
classification
accuracy,
provide
full
statistical
final
performance
evaluated
on
heldout
portion
dataset.
Finally,
generalisation
capabilities
are
characterised,
evaluating
them
using
an
entirely
different
dataset
contains
recorded
under
variety
speech-listening
conditions.
results
show
that
achieve
accurate
robust
accuracies,
they
can
even
serve
attention
without
additional
training.
Frontiers in Neuroscience,
Год журнала:
2023,
Номер
17
Опубликована: Дек. 14, 2023
Auditory
cortical
responses
to
speech
obtained
by
magnetoencephalography
(MEG)
show
robust
tracking
the
speaker's
fundamental
frequency
in
high-gamma
band
(70-200
Hz),
but
little
is
currently
known
about
whether
such
depend
on
focus
of
selective
attention.
In
this
study
22
human
subjects
listened
concurrent,
fixed-rate,
from
male
and
female
speakers,
were
asked
selectively
attend
one
speaker
at
a
time,
while
their
neural
recorded
with
MEG.
The
pitch
range
coincided
lower
band,
whereas
higher
had
much
less
overlap,
only
upper
end
band.
Neural
analyzed
using
temporal
response
function
(TRF)
framework.
As
expected,
demonstrate
male's
speech,
peak
latency
~40
ms.
Critically,
magnitude
depends
attention:
significantly
greater
when
attended
than
it
not
attended,
under
acoustically
identical
conditions.
This
clear
demonstration
that
even
very
early
auditory
are
influenced
top-down,
cognitive,
processing
mechanisms.
Frontiers in Human Neuroscience,
Год журнала:
2023,
Номер
16
Опубликована: Янв. 20, 2023
In
many
experiments
that
investigate
auditory
and
speech
processing
in
the
brain
using
electroencephalography
(EEG),
experimental
paradigm
is
often
lengthy
tedious.
Typically,
experimenter
errs
on
side
of
including
more
data,
trials,
therefore
conducting
a
longer
task
to
ensure
data
are
robust
effects
measurable.
Recent
studies
used
naturalistic
stimuli
brain's
response
individual
or
combination
multiple
features
system
identification
techniques,
such
as
multivariate
temporal
receptive
field
(mTRF)
analyses.
The
neural
collected
from
must
be
divided
into
training
set
test
fit
validate
mTRF
weights.
While
good
strategy
clearly
collect
much
feasible,
it
unclear
how
needed
achieve
stable
results.
Furthermore,
whether
specific
stimulus
for
fitting
choice
feature
representation
affects
would
required
generalizable
Here,
we
previously
EEG
our
lab
sentence
movie
well
an
open-source
dataset
audiobook
better
understand
needs
measuring
acoustic
phonetic
tuning.
We
found
structure
tested
here
stabilizes
after
collecting
approximately
200
s
TIMIT
sentences,
around
600
trailers
460
data.
Thus,
provide
suggestions
minimum
amount
necessary
mTRFs
listening
Our
findings
motivated
by
highly
practical
concerns
when
working
with
children,
patient
populations,
others
who
may
not
tolerate
long
study
sessions.
These
will
aid
future
researchers
wish
healthy
clinical
populations
while
minimizing
participant
fatigue
retaining
signal
quality.
bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2023,
Номер
unknown
Опубликована: Июнь 26, 2023
Abstract
Most
parts
of
speech
are
voiced,
exhibiting
a
degree
periodicity
with
fundamental
frequency
and
many
higher
harmonics.
Some
neural
populations
respond
to
this
temporal
fine
structure,
in
particular
at
the
frequency.
This
frequency-following
response
(speech-FFR)
consists
both
subcortical
cortical
contributions
can
be
measured
through
electroen-cephalography
(EEG)
as
well
magnetoencephalography
(MEG),
although
differ
aspects
activity
that
they
capture:
EEG
is
sensitive
radial
tangential
sources
deep
sources,
while
MEG
more
restrained
measurement
superficial
activity.
responses
continuous
have
shown
an
early
contribution,
latency
around
9
ms,
agreement
measurements
short
tokens,
whereas
not
yet
revealed
such
component.
Here
we
analyze
long
segments
speech.
We
find
followed
by
later
right-lateralized
activities
delays
20
-
57
ms
potential
activities.
Our
results
show
component
FFR
from
MEG,
its
agrees
EEG.
They
furthermore
temporally
separated
contributions,
enabling
independent
assessment
components
towards
further
processing.
bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2023,
Номер
unknown
Опубликована: Март 1, 2023
Abstract
Perception
of
sounds
and
speech
involves
structures
in
the
auditory
brainstem
that
rapidly
process
ongoing
stimuli.
The
role
these
processing
can
be
investigated
by
measuring
their
electrical
activity
using
scalp-mounted
electrodes.
However,
typical
analysis
methods
involve
averaging
neural
responses
to
many
short
repetitive
stimuli
bear
little
relevance
daily
listening
environments.
Recently,
subcortical
more
ecologically
relevant
continuous
were
detected
linear
encoding
models.
These
estimate
temporal
response
function
(TRF),
which
is
a
regression
model
minimises
error
between
measured
signal
predictor
derived
from
stimulus.
Using
predictors
highly
non-linear
peripheral
system
may
improve
TRF
estimation
accuracy
peak
detection.
Here,
we
compare
both
simple
complex
models
for
estimating
TRFs
on
electroencephalography
(EEG)
data
24
participants
speech.
We
also
discuss
length
required
with
clear
peaks.
Interestingly,
resulted
similar
those
estimated
models,
much
faster
compute.
This
work
paves
way
efficient
modelling
detection
speech,
lead
improved
diagnosis
metrics
hearing
impairment
assistive
technology.
bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2023,
Номер
unknown
Опубликована: Июль 4, 2023
Abstract
Selective
attention
to
one
of
several
competing
speakers
is
required
for
comprehending
a
target
speaker
amongst
other
voices
and
successful
communication
with
them.
has
been
found
involve
the
neural
tracking
low-frequency
speech
rhythms
in
auditory
cortex.
Effects
selective
have
also
subcortical
activities,
particular
regarding
high-frequency
response
at
fundamental
frequency
speech,
speech-FFR.
Recent
investigations
have,
however,
shown
that
speech-FFR
contains
cortical
contributions
as
well.
It
remains
unclear
whether
these
are
modulated
by
attention.
Here
we
employed
magnetoencephalography
(MEG)
assess
attentional
modulation
We
presented
participants
two
signals
analyzed
responses
during
switching
between
speakers.
Our
findings
revealed
robust
contribution
speech-FFR:
were
higher
when
was
attended
than
they
ignored.
that,
regardless
attention,
voice
lower
elicited
larger
frequency.
results
show
does
not
only
occur
subcortically
but
extends
cortex