BMC Bioinformatics,
Journal Year:
2023,
Volume and Issue:
24(1)
Published: Dec. 14, 2023
Abstract
Background
The
standardization
of
biological
data
using
unique
identifiers
is
vital
for
seamless
integration,
comprehensive
interpretation,
and
reproducibility
research
findings,
contributing
to
advancements
in
bioinformatics
systems
biology.
Despite
being
widely
accepted
as
a
universal
identifier,
scientific
names
species
have
inherent
limitations,
including
lack
stability,
uniqueness,
convertibility,
hindering
their
effective
use
databases,
particularly
natural
product
(NP)
occurrence
posing
substantial
obstacle
utilizing
this
valuable
large-scale
applications.
Result
To
address
these
challenges
facilitate
high-throughput
analysis
involving
names,
we
developed
PhyloSophos,
Python
package
that
considers
the
properties
taxonomic
accurately
map
name
inputs
entries
within
chosen
reference
database.
We
illustrate
importance
assessing
multiple
databases
considering
syntax-based
pre-processing
NP
an
example,
with
ultimate
goal
integrating
heterogeneous
information
into
single,
unified
dataset.
Conclusions
anticipate
PhyloSophos
significantly
aid
systematic
processing
poorly
digitized
curated
data,
such
biodiversity
ethnopharmacological
resources,
enabling
full-scale
resources.
Journal of Biogeography,
Journal Year:
2024,
Volume and Issue:
51(8), P. 1365 - 1373
Published: March 8, 2024
Abstract
The
gap
between
the
number
of
described
species
and
that
actually
exist
is
known
as
Linnean
shortfall
fundamental
importance
for
biogeography
conservation.
Unsurprisingly,
there
have
been
many
attempts
to
quantify
its
extent
different
taxa
regions.
In
this
Perspective
,
we
argue
such
forecasts
remain
highly
problematic
because
does
depend
not
only
on
rates
exploration
(sampling
undescribed
taxa)
which
estimates
commonly
based
but
also
taxonomic
change
(lumping
splitting).
These
changes
concepts
adopted
information
methods
used
delimit
species.
Commonly
estimating
unknown
(e.g.
discovery
curves,
taxon
ratios)
can
underestimate
or
overestimate
if
they
do
effectively
account
trends
change.
A
further
complication
history
well
documented
most
typically
available
in
biodiversity
databases.
Moreover,
wide
geographic
variation
adoption
delimitation
mean
comparison
even
same
regions
may
be
unreliable.
Given
high
likelihood
future
major
taxa,
propose
two
main
strategies
consider
influence
species:
(i)
a
conservative
approach
shortfall,
restricting
analysis
groups
where
taxonomies
are
relatively
stable
(ii)
explicitly
incorporating
metrics
into
models
estimates.
short,
relevant
will
achieved
by
accounting
dynamic
nature
process
itself.
Biodiversity Data Journal,
Journal Year:
2024,
Volume and Issue:
12
Published: Feb. 1, 2024
Incomplete
species
inventories
for
Antarctica
represent
a
key
challenge
comprehensive
ecological
research
and
conservation
in
the
region.
Additionally,
data
required
to
understand
population
dynamics,
rates
of
evolution,
spatial
ranges,
functional
traits,
physiological
tolerances
interactions,
all
which
are
fundamental
disentangle
different
elements
Antarctic
biodiversity,
mostly
missing.
However,
much
fauna,
flora
microbiota
emerged
ice-free
land
continent
have
an
uncertain
presence
and/or
unresolved
status,
with
entire
biodiversity
compendia
prokaryotic
groups
(e.g.
bacteria)
being
All
available
information
requires
consolidation,
cross-validation,
re-assessment
steady
systematic
inclusion
order
create
robust
catalogue
continent.
We
compiled,
completed
revised
eukaryotic
present
terrestrial
freshwater
ecosystems
new
living
database:
terrANTALife
(version
1.0).
The
database
includes
first
integration
compendium
many
microorganisms.
also
introduce
amplicon
sequence
variants
(ASVs)
biodiversity.
Available
literature
date
were
searched
species,
integrated,
taxonomically
harmonised
curated
by
experts
checklists
organisms.
final
comprises
470
animal
(including
vertebrates,
free-living
invertebrates
parasites),
306
plants
Viridiplantae:
embryophytes
green
algae),
997
fungal
434
protists
(sensu
lato).
provide
account
microorganisms,
including
non-lichenised
fungi
multiple
unicellular
(Stramenophila,
Alveolata
Rhizaria
(SAR),
Chromists
Amoeba),
jointly
referred
as
"protists".
In
addition,
we
identify
1753
bacterial
(obtained
from
348117
ASVs)
34
archaeal
genera
(from
1848
ASVs),
well
as,
at
least,
14
virus
families.
formulate
basic
tree
life
main
lineages
listed
region
their
“known-accepted-species”
numbers.
Biodiversity Data Journal,
Journal Year:
2025,
Volume and Issue:
13
Published: Jan. 8, 2025
The
standardisation
and
correction
of
taxonomic
names
in
large
biodiversity
databases
remain
persistent
challenges
for
researchers,
as
errors
species
can
compromise
ecological
analyses,
land-use
planning
conservation
efforts,
particularly
when
inaccurate
data
are
shared
on
global
portals.
We
present
pytaxon,
a
Python
software
designed
to
resolve
correct
by
leveraging
the
Global
Names
Verifier
(GNV)
API
employing
fuzzy
matching
techniques
suggest
corrections
discrepancies
nomenclatural
inconsistencies.
pytaxon
offers
both
Command
Line
Interface
(CLI)
Graphical
User
(GUI),
ensuring
accessibility
users
with
different
levels
computing
expertise.
Tests
spreadsheets
derived
from
datasets
published
Biodiversity
Information
Facility
(GBIF)
demonstrated
its
effectiveness
identifying
resolving
errors.
By
mitigating
propagation
inaccuracies
researchers'
databases,
supports
more
reliable
decisions
robust
scientific
investigations.
Its
contributions
enhance
integrity
promote
informed
management
rapidly
evolving
environment.
Scientific Data,
Journal Year:
2022,
Volume and Issue:
9(1)
Published: Sept. 3, 2022
Sediment
diatoms
are
widely
used
to
track
environmental
histories
of
lakes
and
their
watersheds,
but
merging
datasets
generated
by
different
researchers
for
further
large-scale
studies
is
challenging
because
taxonomic
discrepancies
caused
rapidly
evolving
diatom
nomenclature
concepts.
We
collated
five
lake
sediment
from
the
Northeastern
USA
using
a
harmonization
process
which
included
updating
synonyms,
tracking
identity
inconsistently
identified
taxa,
grouping
those
that
could
not
be
resolved
taxonomically.
Each
step
led
an
increase
in
variation
explained
variables
parallel
reduction
attributable
inconsistency.
To
maximize
future
use
data
underlying
specimens
we
provide
original
harmonized
counts
1327
core
samples
607
lakes,
name
translation
schemes,
sample
metadata,
specimen
museum
locations,
Northeast
Lakes
Voucher
Flora,
set
light
microscope
images
grouped
into
1154
morphological
operational
units.
Post-hoc
enables
quality
control
when
other
approaches
(e.g.,
upfront
management
consistency)
possible.
Organisms Diversity & Evolution,
Journal Year:
2021,
Volume and Issue:
21(4), P. 657 - 675
Published: Oct. 22, 2021
A
consensus
among
biologists
has
been
growing
in
recent
years
for
the
development
of
a
global
list
accepted
species
(and
other
taxa).
To
date,
much
discussion
focused
on
visions
how
such
would
benefit
many
scientific
and
societal
disciplines.
Less
emphasis
placed
understanding
technical
challenges
compiling
maintaining
list.
Challenges
include
details
implementation
as
defining
what
each
entry
represents,
scope
(taxonomic
breadth),
granularity
(only
species,
all
taxonomic
ranks,
unnamed
operational
units),
level
confidence
status
individual
entries.
The
specific
properties
minimum
information
requirements
items
need
to
be
defined,
process
ensuring
accuracy,
consistency,
noting
uncertainties,
needs
adopted.
Perhaps
greatest
challenge
is
developing
procedures
by
which
created,
updated,
maintained.
Considerations
incorporate
obscure
newly
described
taxa,
extent
review
process,
mechanisms
arbitrating
disputes
or
alternative
viewpoints
will
addressed
through
an
open
transparent
with
broad
engagement
from
multiple
communities.
Details
concerning
can
accessed,
it
maintained,
way
its
contents
are
properly
cited
determined.
Many
these
issues
have
considered
sometimes
solved
Catalogue
Life,
should
serve
core
foundation
actual
any
species.
Ornithology Research,
Journal Year:
2023,
Volume and Issue:
31(1), P. 62 - 70
Published: Jan. 17, 2023
Abstract
In
a
recent
commentary,
Lima
(Ornithol
Res
30:225–228,
2022a)
claims
that
the
Brazilian
Ornithological
Records
Committee
(CBRO)
bases
its
species
ranking
decisions
on
misinterpretation
of
general
lineage
concept
(GLSC),
with
two
major
detrimental
consequences:
(1)
misinform
ornithological
community
about
nature
interspecific
limits
and
(2)
mislead
users
CBRO
checklist
“…about
taxa
it
contains.”
Here,
I
review
arguments
behind
Lima’s
demonstrate
they
are
unfounded
stem
from
an
inaccurate
interpretation
GLSC
coupled
lack
understanding
differences
between
inherent
theoretical
concepts
their
actual
implementation
in
real-life
situations
by
taxonomists
taxonomy
committees
particular.
Accordingly,
provide
several
examples
taxonomic
treatments
implemented
CBRO,
confirming
as
guiding
committee.
If
not
addressed
properly,
unsubstantiated
commentaries
such
put
forward
—
which
cast
serious
doubts
standards
upon
checklists
produced
may
have
unintended
effect
amplifying
current
representation
bias
against
Latin
American
ornithologists
global
initiatives
publication
single
worldwide
avian
checklist.
Megataxa,
Journal Year:
2023,
Volume and Issue:
10(1)
Published: April 26, 2023
There
are
notoriously
many
different
definitions
of
species
and
methods
delimitation,
forcing
taxonomists
to
make
a
long
range
methodological
decisions
in
delimitation.
Because
this,
there
sometimes
multiple
viable
competing
paths,
which
could
lead
ranking
(or
even
grouping)
decisions.
As
result,
it
is
often
unclear
what
means
for
group
be
recognized
as
species,
the
groups
not
always
comparable,
some
have
called
‘subjective’.
To
mitigate
problems
this
causes
users
taxonomy
taxonomists,
we
propose
that
across
tree
life
should
start
preregistering
their
research
design
criteria
delimitation
advance
research.
We
argue
if
were
require
additional
effort,
taxonomic
would
strongly
benefit
term,
by
increasing
transparency
usability
outcomes
reducing
need
ad
hoc
Authorea (Authorea),
Journal Year:
2023,
Volume and Issue:
unknown
Published: Nov. 8, 2023
The
IUCN
Red
List
of
Threatened
Species
is
an
extinction
risk
assessment
tool
that
has
guided
species
conservation
over
the
last
five
decades.
However,
as
wildlife
scientists
and
conservationists,
we
argue
its
influence
on
global
agenda
can
hinder
effective
efforts.
Here,
review
limitations
misuse
in
priority
setting,
which
overlook
local
regional
contexts.
This
result
improper
allocation
resources,
especially
Global
South,
where
financial
resources
are
limited.
In
particular,
funds
directed
towards
red-listed
may
fail
to
address
a
broader
range
priorities.
We
also
contend
insufficient
for
guiding
efforts
recommend
broadening
planning
decision-making
beyond
reliance
List.
Thus,
more
inclusive
decentralized
approach,
summarize
guidelines
at
appropriate
ecological,
spatial,
taxonomic
scales.
Finally,
encourage
collaborative
stakeholder
engagement
setting
priorities
efficient
funding
allocation.