2022 IEEE/ACM 44th International Conference on Software Engineering: Companion Proceedings (ICSE-Companion),
Journal Year:
2023,
Volume and Issue:
unknown, P. 88 - 92
Published: May 1, 2023
Log
recommendation
plays
a
vital
role
in
analyzing
run-time
issues
including
anomaly
detection,
performance
monitoring,
and
security
evaluation.
However,
existing
deeplearning-based
approaches
for
log
suffer
from
insufficient
features
low
F
1
.
To
this
end,
paper
proposes
prototype
called
DeepLog
to
recommend
location
based
on
deep
learning
model.
parses
the
source
code
into
an
abstract
syntax
tree
then
converts
each
method
block
hierarchical
which
extracts
both
semantic
syntactic
features.
By
doing
this,
we
construct
dataset
with
more
than
110K
samples.
employs
double-branched
neural
network
model
locations.
We
evaluate
effectiveness
of
by
answering
four
research
questions.
The
experimental
results
demonstrate
that
it
can
8,725
logs
23
projects
is
28.17%
higher
approaches,
improves
state-of-the-art.
Artificial Intelligence Review,
Journal Year:
2024,
Volume and Issue:
57(10)
Published: Aug. 18, 2024
Artificial
intelligence
(AI)
has
significantly
impacted
various
fields.
Large
language
models
(LLMs)
like
GPT-4,
BARD,
PaLM,
Megatron-Turing
NLG,
Jurassic-1
Jumbo
etc.,
have
contributed
to
our
understanding
and
application
of
AI
in
these
domains,
along
with
natural
processing
(NLP)
techniques.
This
work
provides
a
comprehensive
overview
LLMs
the
context
modeling,
word
embeddings,
deep
learning.
It
examines
diverse
fields
including
text
generation,
vision-language
models,
personalized
learning,
biomedicine,
code
generation.
The
paper
offers
detailed
introduction
background
on
LLMs,
facilitating
clear
their
fundamental
ideas
concepts.
Key
modeling
architectures
are
also
discussed,
alongside
survey
recent
works
employing
LLM
methods
for
downstream
tasks
across
different
domains.
Additionally,
it
assesses
limitations
current
approaches
highlights
need
new
methodologies
potential
directions
significant
advancements
this
field.
Scientific Reports,
Journal Year:
2023,
Volume and Issue:
13(1)
Published: Sept. 27, 2023
Abstract
Detecting
code
smells
may
be
highly
helpful
for
reducing
maintenance
costs
and
raising
source
quality.
Code
facilitate
developers
or
researchers
to
understand
several
types
of
design
flaws.
with
high
severity
can
cause
significant
problems
the
software
challenges
system's
maintainability.
It
is
quite
essential
assess
detected
in
software,
as
it
prioritizes
refactoring
efforts.
The
class
imbalance
problem
also
further
enhances
difficulties
smell
detection.
In
this
study,
four
datasets
(Data
class,
God
Feature
envy,
Long
method)
are
selected
detect
severity.
work,
an
effort
made
address
issue
imbalance,
which,
Synthetic
Minority
Oversampling
Technique
(SMOTE)
balancing
technique
applied.
Each
dataset's
relevant
features
chosen
using
a
feature
selection
based
on
principal
component
analysis.
determined
five
machine
learning
techniques:
K-nearest
neighbor,
Random
forest,
Decision
tree,
Multi-layer
Perceptron,
Logistic
Regression.
This
study
obtained
0.99
accuracy
score
forest
tree
approach
method
smell.
model's
performance
compared
its
three
other
measurements
(Precision,
Recall,
F-measure)
estimate
classification
models.
impact
presented
without
applying
SMOTE.
results
promising
beneficial
paving
way
studies
area.
Scientific Reports,
Journal Year:
2023,
Volume and Issue:
13(1)
Published: Nov. 16, 2023
This
paper
aims
to
explore
the
application
of
deep
learning
in
smart
contract
vulnerabilities
detection.
Smart
contracts
are
an
essential
part
blockchain
technology
and
crucial
for
developing
decentralized
applications.
However,
can
cause
financial
losses
system
crashes.
Static
analysis
tools
frequently
used
detect
contracts,
but
they
often
result
false
positives
negatives
because
their
high
reliance
on
predefined
rules
lack
semantic
capabilities.
Furthermore,
these
quickly
become
obsolete
fail
adapt
or
generalize
new
data.
In
contrast,
methods
do
not
require
detection
learn
features
during
training
process.
this
paper,
we
introduce
a
solution
called
Lightning
Cat
which
is
based
techniques.
We
train
three
models
detecting
contract:
Optimized-CodeBERT,
Optimized-LSTM,
Optimized-CNN.
Experimental
results
show
that,
propose,
Optimized-CodeBERT
model
surpasses
other
methods,
achieving
f1-score
93.53%.
To
precisely
extract
vulnerability
features,
acquire
segments
vulnerable
code
functions
retain
critical
features.
Using
CodeBERT
pre-training
data
preprocessing,
could
capture
syntax
semantics
more
accurately.
demonstrate
feasibility
our
proposed
solution,
evaluate
its
performance
using
SolidiFI-benchmark
dataset,
consists
9369
injected
with
from
seven
different
types.
ACM Computing Surveys,
Journal Year:
2023,
Volume and Issue:
55(13s), P. 1 - 48
Published: May 13, 2023
The
accuracy
reported
for
code
smell-detecting
tools
varies
depending
on
the
dataset
used
to
evaluate
tools.
Our
survey
of
45
existing
datasets
reveals
that
adequacy
a
detecting
smells
highly
depends
relevant
properties
such
as
size,
severity
level,
project
types,
number
each
type
smell,
smells,
and
ratio
smelly
non-smelly
samples
in
dataset.
Most
support
God
Class,
Long
Method,
Feature
Envy
while
six
Fowler
Beck's
catalog
are
not
supported
by
any
datasets.
We
conclude
suffer
from
imbalanced
samples,
lack
supporting
restriction
Java
language.
Software Practice and Experience,
Journal Year:
2023,
Volume and Issue:
53(10), P. 1902 - 1927
Published: June 26, 2023
Summary
Machine
learning‐based
code
smell
detection
(CSD)
has
been
demonstrated
to
be
a
valuable
approach
for
improving
software
quality
and
enabling
developers
identify
problematic
patterns
in
code.
However,
previous
researches
have
shown
that
the
datasets
commonly
used
train
these
models
are
heavily
imbalanced.
While
some
recent
studies
explored
use
of
imbalanced
learning
techniques
CSD,
they
only
evaluated
limited
number
thus
their
conclusions
about
most
effective
methods
may
biased
inconclusive.
To
thoroughly
evaluate
effect
machine
we
examine
31
with
seven
classifiers
build
CSD
on
four
data
sets.
We
employ
evaluation
metrics
assess
performance
Wilcoxon
signed‐rank
test
Cliff's
.
The
results
show
(1)
Not
all
significantly
improve
performance,
but
deep
forest
outperforms
other
(2)
SMOTE
(Synthetic
Minority
Over‐sampling
TEchnique)
is
not
technique
resampling
(3)
best‐performing
top‐3
little
time
cost
detection.
Therefore,
provide
practical
guidelines.
First,
researchers
practitioners
should
select
appropriate
(e.g.,
forest)
ameliorate
class
imbalance
problem.
In
contrast,
blind
application
could
harmful.
Then,
better
than
selected
preprocess