bioRxiv (Cold Spring Harbor Laboratory),
Год журнала:
2022,
Номер
unknown
Опубликована: Июнь 9, 2022
Abstract
In
recent
years,
temporal
response
function
(TRF)
analyses
of
non-invasive
recordings
neural
activity
evoked
by
continuous
naturalistic
stimuli
have
become
increasingly
popular
for
characterizing
properties
within
the
auditory
hierarchy.
However,
despite
this
rise
in
TRF
usage,
relatively
few
educational
resources
these
tools
exist.
Here
we
use
a
dual-talker
speech
paradigm
to
demonstrate
how
key
parameter
experimental
design,
quantity
acquired
data,
influences
fit
either
individual
data
(subject-specific
analyses),
or
group
(generic
analyses).
We
show
that
although
model
performance
monotonically
increases
with
quantity,
amount
required
achieve
significant
prediction
accuracies
can
vary
substantially
based
on
whether
fitted
contains
densely
(e.g.,
acoustic
envelope)
sparsely
lexical
surprisal)
spaced
features,
especially
when
goal
is
capture
aspect
responses
uniquely
explained
specific
features.
Moreover,
generic
models
exhibit
high
small
amounts
test
(2-8
min),
if
they
are
trained
sufficiently
large
set.
As
such,
may
be
particularly
useful
clinical
and
multi-task
study
designs
limited
recording
time.
Finally,
regularization
procedure
used
fitting
interact
models,
larger
training
quantities
resulting
systematically
amplitudes.
Together,
demonstrations
work
should
aid
learning
process
new
users
analyses,
combination
other
tools,
such
as
piloting
power
serve
detailed
reference
choosing
acquisition
duration
future
studies.
Journal of Neuroscience,
Год журнала:
2022,
Номер
42(39), С. 7442 - 7453
Опубликована: Авг. 30, 2022
When
listening
to
continuous
speech,
the
human
brain
can
track
features
of
presented
speech
signal.
It
has
been
shown
that
neural
tracking
acoustic
is
a
prerequisite
for
understanding
and
predict
in
controlled
circumstances.
However,
also
tracks
linguistic
which
may
be
more
directly
related
understanding.
We
investigated
processing
as
function
varying
by
manipulating
rate.
In
this
paradigm,
affected
simultaneously
but
opposite
directions:
rate
increases,
information
per
second
present.
contrast,
becomes
challenging
when
less
intelligible
at
higher
rates.
measured
EEG
18
participants
(4
male)
who
listened
various
As
expected
confirmed
behavioral
results,
decreased
with
increasing
Accordingly,
rate,
increased.
This
indicates
representations
capture
gradual
effect
decreasing
addition,
increased
does
not
necessarily
imply
better
suggests
that,
although
measure
because
low
signal-to-noise
ratio,
direct
predictor
SIGNIFICANCE
STATEMENT
An
increasingly
popular
method
investigate
tracking.
Although
much
research
done
on
how
features,
have
received
attention.
study,
we
disentangled
characteristics
via
A
proper
way
objectively
measuring
auditory
language
paves
toward
clinical
applications:
objective
would
allow
behavioral-free
evaluation
understanding,
allows
evaluate
hearing
loss
adjust
aids
based
responses.
benefit
populations
from
whom
obtaining
measures
complex,
such
young
children
or
people
cognitive
impairments.
Scientific Reports,
Год журнала:
2023,
Номер
13(1)
Опубликована: Янв. 16, 2023
Abstract
To
investigate
the
processing
of
speech
in
brain,
commonly
simple
linear
models
are
used
to
establish
a
relationship
between
brain
signals
and
features.
However,
these
ill-equipped
model
highly-dynamic,
complex
non-linear
system
like
they
often
require
substantial
amount
subject-specific
training
data.
This
work
introduces
novel
decoder
architecture:
Very
Large
Augmented
Auditory
Inference
(VLAAI)
network.
The
VLAAI
network
outperformed
state-of-the-art
subject-independent
(median
Pearson
correlation
0.19,
p
<
0.001),
yielding
an
increase
over
well-established
by
52%.
Using
ablation
techniques,
we
identified
relative
importance
each
part
found
that
components
output
context
module
influenced
performance
most
(10%
increase).
Subsequently,
was
evaluated
on
holdout
dataset
26
subjects
publicly
available
unseen
test
generalization
for
stimuli.
No
significant
difference
default
subjects,
set
public
dataset.
also
significantly
all
baseline
We
effect
size
data
from
1
up
80
revealing
following
hyperbolic
tangent
function
number
subjects.
Finally,
finetuned
obtain
models.
With
5
minutes
or
more,
improvement
found,
34%
(from
0.18
0.25
median
correlation)
with
regards
Cortex,
Год журнала:
2022,
Номер
151, С. 133 - 146
Опубликована: Март 19, 2022
The
alignment
between
low-frequency
activity
in
the
brain
and
slow
acoustic
modulations
speech
signal
depicts
a
core
principle
present
theories
of
perception—a
process
referred
to
as
'neural
tracking'.
While
most
older
adults,
particularly
those
with
highly
prevalent
age-related
hearing
loss,
have
difficulties
perception
comprehension,
impact
loss
on
neural
tracking
is
still
unclear.
In
this
study
we
investigated
effects
pure-tone
different
types
background
noise
response
large
sample
adults
(N
=
101).
Furthermore,
examined
whether
was
predictive
for
comprehension.
For
purpose,
obtained
scalp
EEG
from
our
participants
who
had
varying
degrees
(7.5–59.6
dB
HL
.5–8
kHz
pure
tones)
while
they
listened
sentences
quiet,
pink
multi-talker
babble
noise.
Speech
quantified
by
computing
cross-correlation
amplitude
envelope
heard.
A
higher
degree
associated
greater
(i.e.,
cross-correlation).
Additionally,
showed
positive
association
This
relationship
modulated
hearing-impaired
benefitting
more
tracking.
Our
results
highlight
potential
an
objective
measure
comprehension
possible
target
mechanism
clinical
interventions
such
neurofeedback.
interaction
suggests
compensatory
which
rely
signal.
NeuroImage,
Год журнала:
2022,
Номер
267, С. 119841 - 119841
Опубликована: Дек. 28, 2022
Background:
Older
adults
process
speech
differently,
but
it
is
not
yet
clear
how
aging
affects
different
levels
of
processing
natural,
continuous
speech,
both
in
terms
bottom-up
acoustic
analysis
and
top-down
generation
linguistic-based
predictions.
We
studied
natural
across
the
adult
lifespan
via
electroencephalography
(EEG)
measurements
neural
tracking.
Goals:
Our
goals
are
to
analyze
unique
contribution
linguistic
using
while
controlling
for
influence
processing.
Moreover,
we
also
age.
In
particular,
focus
on
changes
spatial
temporal
activation
patterns
response
lifespan.
Methods:
52
normal-hearing
between
17
82
years
age
listened
a
naturally
spoken
story
EEG
signal
was
recorded.
investigated
effect
speech.
Because
correlated
with
hearing
capacity
measures
cognition,
whether
observed
mediated
by
these
factors.
Furthermore,
there
an
hemisphere
lateralization
spatiotemporal
responses.
Results:
results
showed
that
declines
advancing
as
increased,
latency
certain
aspects
increased.
Also
tracking
(NT)
decreased
increasing
age,
which
at
odds
literature.
contrast
processing,
older
subjects
shorter
latencies
early
responses
No
evidence
found
hemispheric
neither
younger
nor
during
Most
effects
were
explained
age-related
decline
or
cognition.
However,
our
suggest
decreasing
word-level
partially
due
cognition
than
robust
Conclusion:
Spatial
characteristics
change
These
may
be
traces
structural
and/or
functional
occurs
Frontiers in Neuroscience,
Год журнала:
2023,
Номер
16
Опубликована: Янв. 12, 2023
In
recent
years,
temporal
response
function
(TRF)
analyses
of
neural
activity
recordings
evoked
by
continuous
naturalistic
stimuli
have
become
increasingly
popular
for
characterizing
properties
within
the
auditory
hierarchy.
However,
despite
this
rise
in
TRF
usage,
relatively
few
educational
resources
these
tools
exist.
Here
we
use
a
dual-talker
speech
paradigm
to
demonstrate
how
key
parameter
experimental
design,
quantity
acquired
data,
influences
fit
either
individual
data
(subject-specific
analyses),
or
group
(generic
analyses).
We
show
that
although
model
prediction
accuracy
increases
monotonically
with
quantity,
amount
required
achieve
significant
accuracies
can
vary
substantially
based
on
whether
fitted
contains
densely
(e.g.,
acoustic
envelope)
sparsely
lexical
surprisal)
spaced
features,
especially
when
goal
is
capture
aspect
responses
uniquely
explained
specific
features.
Moreover,
generic
models
exhibit
high
performance
small
amounts
test
(2–8
min),
if
they
are
trained
sufficiently
large
set.
As
such,
may
be
particularly
useful
clinical
and
multi-task
study
designs
limited
recording
time.
Finally,
regularization
procedure
used
fitting
interact
models,
larger
training
quantities
resulting
systematically
amplitudes.
Together,
demonstrations
work
should
aid
new
users
analyses,
combination
other
tools,
such
as
piloting
power
serve
detailed
reference
choosing
acquisition
duration
future
studies.
NeuroImage,
Год журнала:
2024,
Номер
297, С. 120675 - 120675
Опубликована: Июнь 15, 2024
The
synchronization
between
the
speech
envelope
and
neural
activity
in
auditory
regions,
referred
to
as
cortical
tracking
of
(CTS),
plays
a
key
role
processing.
method
selected
for
extracting
is
crucial
step
CTS
measurement,
absence
consensus
on
best
practices
among
various
methods
can
influence
analysis
outcomes
interpretation.
Here,
we
systematically
compare
five
standard
extraction
absolute
value
Hilbert
transform
(absHilbert),
gammatone
filterbanks,
heuristic
approach,
Bark
scale,
vocalic
energy),
analyzing
their
impact
CTS.
We
present
performance
metrics
each
based
recording
brain
from
participants
listening
clear
noisy
conditions,
utilizing
intracranial
EEG,
MEG
EEG
data.
As
expected,
observed
significant
temporal
regions
below
10
Hz
across
all
datasets,
regardless
methods.
In
general,
filterbanks
approach
consistently
demonstrated
superior
compared
other
Results
our
study
guide
scientists
field
make
informed
decisions
about
optimal
extract
CTS,
contributing
advancing
understanding
neuronal
mechanisms
implicated
Deleted Journal,
Год журнала:
2024,
Номер
6(2), С. 667 - 684
Опубликована: Июнь 13, 2024
Using
naturalistic
spoken
narratives
to
investigate
speech
processes
and
comprehension
is
becoming
increasingly
popular
in
experimental
hearing
research.
Yet,
little
known
about
how
individuals
engage
with
story
materials
listening
experiences
change
age.
We
investigated
absorption
the
context
of
stories,
explored
predictive
factors
for
engagement,
examined
utility
a
scale
developed
written
assess
auditory
materials.
Adults
aged
20–78
years
(N
=
216)
participated
an
online
study.
Participants
listened
one
ten
stories
intended
be
engaging
different
degrees
rated
terms
enjoyment.
ages
similarly
absorbing
enjoyable.
Further,
higher
mood
scores
predicted
enjoyment
ratings.
Factor
analysis
showed
items
approximately
grouped
according
original
dimensions,
suggesting
that
may
similar
although
certain
discriminated
less
effectively
between
more
or
engaging.
The
present
study
provides
novel
insights
into
adults
supports
using
stimuli