A Long‐Term Ecological Research Data Set From the Marine Genetic Monitoring Program ARMSMBON 2018–2020 DOI Creative Commons
Nauras Daraghmeh, Katrina Exter, Justine Pagnier

et al.

Molecular Ecology Resources, Journal Year: 2025, Volume and Issue: unknown

Published: Jan. 31, 2025

Molecular methods such as DNA/eDNA metabarcoding have emerged useful tools to document the biodiversity of complex communities over large spatio-temporal scales. We established an international Marine Biodiversity Observation Network (ARMS-MBON) combining standardised sampling using autonomous reef monitoring structures (ARMS) with for genetic marine hard-bottom benthic communities. Here, we present data our first campaign comprising 56 ARMS units deployed in 2018-2019 and retrieved 2018-2020 across 15 observatories along coasts Europe adjacent regions. describe open-access set (image, metadata) explore show its potential ecological research. Our analysis shows that recovered more than 60 eukaryotic phyla capturing diversity up ~5500 amplicon sequence variants ~1800 operational taxonomic units, ~250 ~50 species per observatory cytochrome c oxidase subunit I (COI) 18S rRNA marker genes, respectively. Further, detected threatened, vulnerable non-indigenous often targeted biological monitoring. while deployment duration does not drive estimates, effort sequencing depth do. recommend should be at least 3-6 months during main growth season use resources efficiently possible post-sequencing curation is applied enable statistical comparison entities. suggest used programs long-term research encourage adoption ARMS-MBON protocols.

Language: Английский

The Galaxy platform for accessible, reproducible, and collaborative data analyses: 2024 update DOI Creative Commons

Linelle Ann L Abueg,

Enis Afgan,

Olivier Allart

et al.

Nucleic Acids Research, Journal Year: 2024, Volume and Issue: 52(W1), P. W83 - W94

Published: May 20, 2024

Abstract Galaxy (https://galaxyproject.org) is deployed globally, predominantly through free-to-use services, supporting user-driven research that broadens in scope each year. Users are attracted to public services by platform stability, tool and reference dataset diversity, training, support integration, which enables complex, reproducible, shareable data analysis. Applying the principles of user experience design (UXD), has driven improvements accessibility, discoverability Labs/subdomains, a redesigned ToolShed. capabilities progressing two strategic directions: integrating general purpose graphical processing units (GPGPU) access for cutting-edge methods, licensed support. Engagement with global consortia being increased developing more workflows resourcing run them. The Training Network (GTN) portfolio grown both size, learning paths direct integration tools feature training courses. Code development continues line Project roadmap, job scheduling interface. Environmental impact assessment also helping engage users developers, reminding them their role sustainability, displaying estimated CO2 emissions generated job.

Language: Английский

Citations

159

Challenges and opportunities in sharing microbiome data and analyses DOI Open Access
Curtis Huttenhower, ROBERT FINN, Alice C. McHardy

et al.

Nature Microbiology, Journal Year: 2023, Volume and Issue: 8(11), P. 1960 - 1970

Published: Oct. 2, 2023

Language: Английский

Citations

25

Applying the FAIR Principles to computational workflows DOI Creative Commons
Sean Wilkinson, Meznah Aloqalaa, Khalid Belhajjame

et al.

Scientific Data, Journal Year: 2025, Volume and Issue: 12(1)

Published: Feb. 24, 2025

Recent trends within computational and data sciences show an increasing recognition adoption of workflows as tools for productivity reproducibility that also democratize access to platforms processing know-how. As digital objects be shared, discovered, reused, benefit from the FAIR principles, which stand Findable, Accessible, Interoperable, Reusable. The Workflows Community Initiative's Working Group (WCI-FW), a global open community researchers developers working with across disciplines domains, has systematically addressed application both software principles workflows. We present recommendations commentary reflects our discussions justifies choices adaptations. These are offered workflow users authors, management system developers, providers services guidelines fodder discussion. we propose in this paper will maximize their value research assets facilitate by wider community.

Language: Английский

Citations

1

Playbook workflow builder: Interactive construction of bioinformatics workflows DOI Creative Commons
Daniel Clarke, John Erol Evangelista, Zhuorui Xie

et al.

PLoS Computational Biology, Journal Year: 2025, Volume and Issue: 21(4), P. e1012901 - e1012901

Published: April 3, 2025

The Playbook Workflow Builder (PWB) is a web-based platform to dynamically construct and execute bioinformatics workflows by utilizing growing network of input datasets, semantically annotated API endpoints, data visualization tools contributed an ecosystem collaborators. Via user-friendly user interface, can be constructed from building-blocks without technical expertise. output each step the workflow added into reports containing textual descriptions, figures, tables, references. To workflows, users click on cards that represent in workflow, or via chat interface assisted large language model (LLM). Completed are compatible with Common Language (CWL) published as research publications, slideshows, posters. demonstrate how PWB generates meaningful hypotheses draw knowledge across multiple resources, we present several use cases. For example, one these cases prioritizes drug targets for individual cancer patients using NIH Fund programs GTEx, LINCS, Metabolomics, GlyGen, ExRNA. created repurposed tackle similar different inputs. available from: https://playbook-workflow-builder.cloud/ .

Language: Английский

Citations

1

Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows DOI Creative Commons
Olivia Eriksson, Upinder S. Bhalla, Kim T. Blackwell

et al.

eLife, Journal Year: 2022, Volume and Issue: 11

Published: July 6, 2022

Modeling in neuroscience occurs at the intersection of different points view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that model is constructed to investigate specific hypothesis about how system works or why certain phenomena are observed. Data-driven modeling, on other hand, follows more unbiased approach, with construction informed by computationally intensive use data. At same time, researchers employ models biological scales levels abstraction. Combining these while validating them against experimental data increases understanding multiscale brain. However, lack interoperability, transparency, reusability both workflows used construct creates barriers for integration representing built using philosophies. We argue imperatives drive resources policy - such as FAIR (Findable, Accessible, Interoperable, Reusable) principles also support The require be shared formats Findable, Reusable. Applying workflows, well constrain validate them, would allow find, reuse, question, validate, extend published models, regardless whether they implemented phenomenologically mechanistically, few equations multiscale, hierarchical system. To illustrate ideas, we classical synaptic plasticity model, Bienenstock-Cooper-Munro rule, an example due its long history, abstraction, implementation many scales.

Language: Английский

Citations

30

Ten quick tips for building FAIR workflows DOI Creative Commons
Casper de Visser, Lennart Johansson, Purva Kulkarni

et al.

PLoS Computational Biology, Journal Year: 2023, Volume and Issue: 19(9), P. e1011369 - e1011369

Published: Sept. 28, 2023

Research data is accumulating rapidly and with it the challenge of fully reproducible science. As a consequence, implementation high-quality management scientific has become global priority. The FAIR (Findable, Accesible, Interoperable Reusable) principles provide practical guidelines for maximizing value research data; however, processing using workflows-systematic executions series computational tools-is equally important good management. have recently been adapted to Software (FAIR4RS Principles) promote reproducibility reusability any type software. Here, we propose set 10 quick tips, drafted by experienced workflow developers that will help researchers apply FAIR4RS workflows. tips arranged according acronym, clarifying purpose each tip respect principles. Altogether, these can be seen as who aim contribute more sustainable science, aiming positively impact open science community.

Language: Английский

Citations

19

Croissant: A Metadata Format for ML-Ready Datasets DOI
Mubashara Akhtar, Omar Benjelloun, Costanza Conforti

et al.

Published: May 29, 2024

Data is a critical resource for Machine Learning (ML), yet working with data remains key friction point. This paper introduces Croissant, metadata format datasets that simplifies how used by ML tools and frameworks. Croissant makes more discoverable, portable interoperable, thereby addressing significant challenges in management responsible AI. already supported several popular dataset repositories, spanning hundreds of thousands datasets, ready to be loaded into the most

Language: Английский

Citations

7

Recording provenance of workflow runs with RO-Crate DOI Creative Commons
Simone Leo, Michael R. Crusoe, Laura Rodríguez‐Navas

et al.

PLoS ONE, Journal Year: 2024, Volume and Issue: 19(9), P. e0309210 - e0309210

Published: Sept. 10, 2024

Recording the provenance of scientific computation results is key to support traceability, reproducibility and quality assessment data products. Several models have been explored address this need, providing representations workflow plans their executions as well means packaging resulting information for archiving sharing. However, existing approaches tend lack interoperable adoption across management systems. In work we present Workflow Run RO-Crate, an extension RO-Crate (Research Object Crate) Schema.org capture execution computational workflows at different levels granularity bundle together all associated objects (inputs, outputs, code, etc.). The model supported by a diverse, open community that runs regular meetings, discussing development, maintenance aspects. already implemented several systems, allowing comparisons between from heterogeneous We describe model, its alignment standards such W3C PROV, implementation in six Finally, illustrate application two use cases machine learning digital image analysis domain.

Language: Английский

Citations

7

Prototype Biodiversity Digital Twin: Real-time bird monitoring with citizen-science data DOI Creative Commons
Otso Ovaskainen, Patrik Lauha, Julian Lopez Gordillo

et al.

Research Ideas and Outcomes, Journal Year: 2024, Volume and Issue: 10

Published: June 20, 2024

Bird populations respond rapidly to environmental change making them excellent ecological indicators. Climate shifts advance migration, causing mismatches in breeding and resources. Understanding these changes is crucial monitor the state of environment. Citizen science offers vast potential collect biodiversity data. We outline a project that combines citizen with AI-based bird sound classification. The mobile app records vocalisations are classified by AI stored for re-analysis. Additionally, it shows shared observation board visualises collective classifications. By merging long-term monitoring modern science, this harnesses strength both approaches comprehensive population monitoring.

Language: Английский

Citations

6

Sharing Begins at Home: How Continuous and Ubiquitous FAIRness Can Enhance Research Productivity and Data Reuse DOI Creative Commons
William P. Dempsey, Ian Foster, Scott E. Fraser

et al.

Harvard data science review, Journal Year: 2022, Volume and Issue: unknown

Published: July 27, 2022

The broad sharing of research data is widely viewed as critical for the speed, quality, accessibility, and integrity science. Despite increasing efforts to encourage sharing, both quality shared frequency reuse remain stubbornly low. We argue here that a significant reason this unfortunate state affairs organization results in findable, accessible, interoperable, reusable (FAIR) form required too often deferred end project when preparing publications-by which time essential details are no longer accessible. Thus, we propose an approach informatics FAIR principles applied

Language: Английский

Citations

23