arXiv (Cornell University),
Journal Year:
2023,
Volume and Issue:
unknown
Published: Jan. 1, 2023
In
our
digital
universe
nowadays,
enormous
amount
of
data
are
produced
in
a
streaming
manner
variety
application
areas.
These
often
unlabelled.
this
case,
identifying
infrequent
events,
such
as
anomalies,
poses
great
challenge.
This
problem
becomes
even
more
difficult
non-stationary
environments,
which
can
cause
deterioration
the
predictive
performance
model.
To
address
above
challenges,
paper
proposes
an
autoencoder-based
incremental
learning
method
with
drift
detection
(strAEm++DD).
Our
proposed
strAEm++DD
leverages
on
advantages
both
and
detection.
We
conduct
experimental
study
using
real-world
synthetic
datasets
severe
or
extreme
class
imbalance,
provide
empirical
analysis
strAEm++DD.
further
comparative
study,
showing
that
significantly
outperforms
existing
baseline
advanced
methods.
Natural Product Reports,
Journal Year:
2023,
Volume and Issue:
40(11), P. 1735 - 1753
Published: Jan. 1, 2023
This
review
presents
a
summary
of
the
recent
advancements
in
machine
learning-assisted
structure
elucidation
(MLASE)
to
establish
structures
natural
products
(NPs).
2022 International Joint Conference on Neural Networks (IJCNN),
Journal Year:
2023,
Volume and Issue:
unknown, P. 1 - 8
Published: June 18, 2023
In
our
digital
universe
nowadays,
enormous
amount
of
data
are
produced
in
a
streaming
manner
variety
application
areas.
These
often
unlabelled.
this
case,
identifying
infrequent
events,
such
as
anomalies,
poses
great
challenge.
This
problem
becomes
even
more
difficult
non-stationary
environments,
which
can
cause
deterioration
the
predictive
performance
model.
To
address
above
challenges,
paper
proposes
an
autoencoder-based
incremen-tal
learning
method
with
drift
detection
(strAEm++DD).
Our
proposed
strAEm++DD
leverages
on
advantages
both
incremental
and
detection.
We
conduct
experimental
study
using
real-world
synthetic
datasets
severe
or
extreme
class
imbalance,
provide
empirical
analysis
strAEm++DD.
further
comparative
study,
showing
that
significantly
outper-forms
existing
baseline
advanced
methods.
2021 IEEE Symposium Series on Computational Intelligence (SSCI),
Journal Year:
2022,
Volume and Issue:
unknown
Published: Dec. 4, 2022
There
is
an
emerging
need
for
predictive
models
to
be
trained
on-the-fly,
since
in
numerous
machine
learning
applications
data
are
arriving
online
fashion.
A
critical
challenge
encountered
that
of
limited
availability
ground
truth
information
(e.g.,
labels
classification
tasks)
as
new
observed
one-by-one
online,
while
another
significant
class
imbalance.
This
work
introduces
the
novel
Augmented
Queues
method,
which
addresses
dual-problem
by
combining
a
synergistic
manner
active
learning,
augmentation,
and
multi-queue
memory
maintain
separate
balanced
queues
each
class.
We
perform
extensive
experimental
study
using
image
time-series
augmentations,
we
examine
roles
budget,
size,
imbalance
level,
neural
network
type.
demonstrate
two
major
advantages
Queues.
First,
it
does
not
reserve
additional
space
generation
synthetic
occurs
only
at
training
times.
Second,
have
access
more
labelled
without
increase
budget
/
or
original
size.
Learning
on-the-fly
poses
challenges
which,
typically,
hinder
deployment
models.
significantly
improves
performance
terms
quality
speed.
Our
code
made
publicly
available.
IEEE Access,
Journal Year:
2023,
Volume and Issue:
11, P. 140428 - 140442
Published: Jan. 1, 2023
The
class
imbalance
problem
negatively
impacts
learning
algorithms'
performance
in
minority
classes
which
may
constitute
more
severe
attacks
than
the
majority
ones.
This
study
investigates
benefits
of
balancing
strategies
and
imbalanced
approaches
on
intrusion
data
from
Software
Defined
Networking
(SDN).
Although
research
community
has
covered
machine
learning-based
detection,
addressing
this
SDN
is
novel
powerful.
Addressing
over
InSDN
(the
only
publicly
available
detection
dataset
as
recent)
significant
impact
future
area
SDN.
We
address
through
data-level
classifier-level
techniques.
Our
objective
to
determine
suitable
methods
propose
custom
deep
architectures
based
GANs
Siamese
Neural
Networks
for
generative
modeling
similarity-based
detection.
paper
provides
benchmarking
results
classification
with
Random
Oversampling
(ROS),
SMOTE,
GANs,
weighted
Forest,
Siamese-based
one-shot
learning.
have
found
that
Forest
(RF)
outperforms
models
instances.
supports
notion
RF
can
handle
well.
also
observe
widely-used
techniques,
ROS
drastically
decrease
False
Positive
Rate
(FPR)
but
increase
Negative
(FNR)
classes.
Conclusively,
while
improve
models,
they,
fact,
degrade
RF's
performance,
i.e.
cause
higher
numbers
false
predictions.
Therefore,
does
not
need
additional
get
performance.
work
addresses
data,
it
a
well-designed
benchmark
be
exemplary
any
network
data.
Thus,
studies
respective
domain.
2021 IEEE Symposium Series on Computational Intelligence (SSCI),
Journal Year:
2022,
Volume and Issue:
unknown
Published: Dec. 4, 2022
In
real-world
applications,
the
process
generating
data
might
suffer
from
nonstationary
effects
(e.g.,
due
to
seasonality,
faults
affecting
sensors
or
actuators,
and
changes
in
users'
behaviour).
These
changes,
often
called
concept
drift,
induce
severe
(potentially
catastrophic)
impacts
on
trained
learning
models
that
become
obsolete
over
time,
inadequate
solve
task
at
hand.
Learning
presence
of
drift
aims
designing
machine
deep
are
able
track
adapt
drift.
Typically,
techniques
handle
either
active
passive,
traditionally,
these
have
been
considered
be
mutually
exclusive.
Active
use
an
explicit
detection
mechanism,
re-train
algorithm
when
is
detected.
Passive
implicit
method
deal
with
continually
update
model
using
incremental
learning.
Differently
what
present
literature,
we
propose
a
hybrid
alternative
which
merges
two
approaches,
hence,
leveraging
their
advantages.
The
proposed
Hybrid-Adaptive
REBAlancing
(HAREBA)
significantly
outperforms
strong
baselines
state-of-the-art
methods
terms
quality
speed;
experiment
how
it
effective
under
class
imbalance
levels
too.