Semisynthetic simulation for microbiome data analysis DOI Creative Commons
Kris Sankaran, Saritha Kodikara, Jingyi Jessica Li

et al.

Briefings in Bioinformatics, Journal Year: 2024, Volume and Issue: 26(1)

Published: Nov. 22, 2024

Abstract High-throughput sequencing data lie at the heart of modern microbiome research. Effective analysis these requires careful preprocessing, modeling, and interpretation to detect subtle signals avoid spurious associations. In this review, we discuss how simulation can serve as a sandbox test candidate approaches, creating setting that mimics real while providing ground truth. This is particularly valuable for power analysis, methods benchmarking, reliability analysis. We explain probability, multivariate regression concepts behind simulators different implementations make trade-offs between generality, faithfulness, controllability. Recognizing all only approximate reality, review evaluate accurately they reflect key properties. also present case studies demonstrating value in differential abundance testing, dimensionality reduction, network integration. Code examples available an online tutorial (https://go.wisc.edu/8994yz) be easily adapted new problem settings.

Language: Английский

Evaluation of cell-cell interaction methods by integrating single-cell RNA sequencing data with spatial information DOI Creative Commons
Zhaoyang Liu, Dongqing Sun, Chenfei Wang

et al.

Genome biology, Journal Year: 2022, Volume and Issue: 23(1)

Published: Oct. 17, 2022

Abstract Background Cell-cell interactions are important for information exchange between different cells, which the fundamental basis of many biological processes. Recent advances in single-cell RNA sequencing (scRNA-seq) enable characterization cell-cell using computational methods. However, it is hard to evaluate these methods since no ground truth provided. Spatial transcriptomics (ST) data profiles relative position cells. We propose that spatial distance suggests interaction tendency cell types, thus could be used evaluating tools. Results benchmark 16 by integrating scRNA-seq with ST data. characterize into short-range and long-range distributions ligands receptors. Based on this classification, we define enrichment score apply an evaluation workflow tools 15 simulated 5 real datasets. also compare consistency results from single commonly identified interactions. Our suggest predicted highly dynamic, statistical-based show overall better performance than network-based ST-based Conclusions study presents a comprehensive scRNA-seq. CellChat, CellPhoneDB, NicheNet, ICELLNET other terms software scalability. recommend at least two ensure accuracy have packaged detailed documentation GitHub ( https://github.com/wanglabtongji/CCI ).

Language: Английский

Citations

94

Synthetic data generation methods in healthcare: A review on open-source tools and methods DOI Creative Commons
Vasileios C. Pezoulas, Dimitrios I. Zaridis, Eugenia Mylona

et al.

Computational and Structural Biotechnology Journal, Journal Year: 2024, Volume and Issue: 23, P. 2892 - 2910

Published: July 9, 2024

Synthetic data generation has emerged as a promising solution to overcome the challenges which are posed by scarcity and privacy concerns, well as, address need for training artificial intelligence (AI) algorithms on unbiased with sufficient sample size statistical power. Our review explores application efficacy of synthetic methods in healthcare considering diversity medical data. To this end, we systematically searched PubMed Scopus databases great focus tabular, imaging, radiomics, time-series, omics Studies involving multi-modal were also explored. The type method used process was identified each study categorized into statistical, probabilistic, machine learning, deep learning. Emphasis given programming languages implementation method. evaluation revealed that majority studies utilize generators to: (i) reduce cost time required clinical trials rare diseases conditions, (ii) enhance predictive power AI models personalized medicine, (iii) ensure delivery fair treatment recommendations across diverse patient populations, (iv) enable researchers access high-quality, representative multimodal datasets without exposing sensitive information, among others. We underline wide use learning based 72.6 % included studies, 75.3 being implemented Python. A thorough documentation open-source repositories is finally provided accelerate research field.Graphical abstract

Language: Английский

Citations

26

Recent advances in mass spectrometry-based computational metabolomics DOI Creative Commons
Timothy M. D. Ebbels, Justin J. J. van der Hooft, Haley Chatelaine

et al.

Current Opinion in Chemical Biology, Journal Year: 2023, Volume and Issue: 74, P. 102288 - 102288

Published: March 24, 2023

The computational metabolomics field brings together computer scientists, bioinformaticians, chemists, clinicians, and biologists to maximize the impact of across a wide array scientific medical disciplines. continues expand as modern instrumentation produces datasets with increasing complexity, resolution, sensitivity. These must be processed, annotated, modeled, interpreted enable biological insight. Techniques for visualization, integration (within or between omics), interpretation data have evolved along innovation in databases knowledge resources required aid understanding. In this review, we highlight recent advances reflect on opportunities innovations response most pressing challenges. This review was compiled from discussions 2022 Dagstuhl seminar entitled "Computational Metabolomics: From Spectra Knowledge".

Language: Английский

Citations

34

PathIntegrate: Multivariate modelling approaches for pathway-based multi-omics data integration DOI Creative Commons
Cecilia Wieder, Juliette Cooke, Clément Frainay

et al.

PLoS Computational Biology, Journal Year: 2024, Volume and Issue: 20(3), P. e1011814 - e1011814

Published: March 25, 2024

As terabytes of multi-omics data are being generated, there is an ever-increasing need for methods facilitating the integration and interpretation such data. Current typically output lists, clusters, or subnetworks molecules related to outcome. Even with expert domain knowledge, discerning biological processes involved a time-consuming activity. Here we propose PathIntegrate, method integrating datasets based on pathways, designed exploit knowledge systems thus provide interpretable models studies. PathIntegrate employs single-sample pathway analysis transform from molecular pathway-level, applies predictive single-view multi-view model integrate Model outputs include pathways ranked by their contribution outcome prediction, each omics layer, importance molecule in pathway. Using semi-synthetic demonstrate benefit grouping into detect signals low signal-to-noise scenarios, as well ability precisely identify important at effect sizes. Finally, using COPD COVID-19 showcase how enables convenient complex high-dimensional datasets. available open-source Python package.

Language: Английский

Citations

9

Current approaches and outstanding challenges of functional annotation of metabolites: a comprehensive review DOI Creative Commons
Quang‐Huy Nguyen, Ha Nam Nguyen, Edwin C. Oh

et al.

Briefings in Bioinformatics, Journal Year: 2024, Volume and Issue: 25(6)

Published: Sept. 23, 2024

Metabolite profiling is a powerful approach for the clinical diagnosis of complex diseases, ranging from cardiometabolic cancer, and cognitive disorders to respiratory pathologies conditions that involve dysregulated metabolism. Because importance systems-level interpretation, many methods have been developed identify biologically significant pathways using metabolomics data. In this review, we first describe complete workflow (sample preparation, data acquisition, pre-processing, downstream analysis, etc.). We then comprehensively review 24 approaches capable performing functional including those combine with other types investigate disease-relevant changes at multiple omics layers. discuss their availability, implementation, capability pre-processing quality control, supported types, embedded databases, pathway analysis methodologies, integration techniques. also provide rating evaluation each software, focusing on key technique, software accessibility, documentation, user-friendliness. Following our guideline, life scientists can easily choose suitable method depending rating, available data, input format, category. More importantly, highlight outstanding challenges potential solutions need be addressed by future research. To further assist users in executing reviewed methods, wrappers packages https://github.com/tinnlab/metabolite-pathway-review-docker.

Language: Английский

Citations

4

Recent advances in high-throughput biofluid metabotyping by direct infusion and ambient ionization mass spectrometry DOI Creative Commons
Vera Plekhova, Kimberly De Windt, Margot De Spiegeleer

et al.

TrAC Trends in Analytical Chemistry, Journal Year: 2023, Volume and Issue: 168, P. 117287 - 117287

Published: Sept. 17, 2023

Biofluid metabolomics is a popular tool for biomarker discovery to decipher disease-, genetics-, and exposure-related metabolic alterations an essential component understanding integrated metabolite-level responses. The conventional workflow in mass spectrometry (MS) involves hyphenation with chromatographic separation represents valuable analytical both research clinical settings. However, complexity, relatively low throughput, high costs often hinder implementation when routine, large-scale analysis sample turnover desired, such as point-of-care applications. In this context, direct infusion (DI) ambient ionization (AI) MS, where samples can be analysed directly, rapidly, minimal handling, offer attractive alternatives hyphenated methods. Recent technological advances have addressed the typical issues of AIMS DIMS methods regarding metabolome coverage, reproducibility, repeatability encountered during their early development. systematic review, we discussed recent (2017–2023) original publications on DIMS- AIMS-based biofluid considering reported biomedical implementations, assets workflow, data handling coherence platforms.

Language: Английский

Citations

10

A computational framework for detecting inter-tissue gene-expression coordination changes with aging DOI Creative Commons

Shaked Briller,

Gil Ben David,

Yam Amir

et al.

Scientific Reports, Journal Year: 2025, Volume and Issue: 15(1)

Published: March 31, 2025

Aging is a complex and systematic biological process that involves multiple genes pathways across different tissues. While existing studies focus on tissue-specific aging factors, the inter-tissue interplay between molecular during remains insufficiently explored. To bridge this gap, we propose novel computational framework to identify effect of coordinated patterns gene-expression Our includes (1) an adjusted multi-tissue weighted gene co-expression network analysis, (2) differential connectivity analysis age groups (3) machine learning models, XGBoost Random Forest (RF) fed by expression levels lower-dimensional pathway score space, unique key for classifying aging. We applied our approach three representative tissues: Adipose-Subcutaneous, Muscle-Skeletal Brain-Cortex. The RF model demonstrated best performance in predicting group (AUC < 88%) highlighting involved coordination processes also identified involvement lipid metabolism, immune system, cell communication detected distinct manifested proposed highlights importance underlying provides valuable insights into mechanisms which can further assist development therapeutic strategies promoting healthy

Language: Английский

Citations

0

Phoenics: a novel statistical approach for longitudinal metabolomic pathway analysis DOI Creative Commons

Camille Guilmineau,

Marie Tremblay‐Franco, Nathalie Villa‐Vialaneix

et al.

BMC Bioinformatics, Journal Year: 2025, Volume and Issue: 26(1)

Published: April 16, 2025

Language: Английский

Citations

0

The application of multi-omics in the respiratory microbiome: Progresses, challenges and promises DOI Creative Commons

Jingyuan Gao,

Xinzhu Yi, Zhang Wang

et al.

Computational and Structural Biotechnology Journal, Journal Year: 2023, Volume and Issue: 21, P. 4933 - 4943

Published: Jan. 1, 2023

The study of the respiratory microbiome has entered a multi-omic era. Through integrating different omic data types such as metagenome, metatranscriptome, metaproteome, metabolome, culturome and radiome surveyed from specimens, holistic insights can be gained on lung its interaction with host immunity inflammation in diseases. power multi-omics have moved field forward associative assessment alterations to causative understanding pathogenesis chronic, acute other However, application remains unique challenges sample processing, integration, downstream validation. In this review, we first introduce applicable studying microbiome. We next describe approaches for focusing dimensionality reduction, association prediction. then summarize progresses finally discuss current share our thoughts future promises field.

Language: Английский

Citations

7

The metabolic role of vitamin D in children’s neurodevelopment: a network study DOI Creative Commons
Margherita De Marzio, Jessica Lasky‐Su, Su H. Chu

et al.

Scientific Reports, Journal Year: 2024, Volume and Issue: 14(1)

Published: July 23, 2024

Neurodevelopmental disorders are rapidly increasing in prevalence and have been linked to various environmental risk factors. Mounting evidence suggests a potential role of vitamin D child neurodevelopment, though the causal mechanisms remain largely unknown. Here, we investigate how deficiency affects children's communication development, particularly relation Autism Spectrum Disorder (ASD). We do so by developing an integrative network approach that combines metabolomic profiles, clinical traits, neurodevelopmental data from pediatric cohort. Our results show low levels associated with changes metabolic networks tryptophan, linoleic, fatty acid metabolism. These correlate distinct ASD-related phenotypes, including delayed skills respiratory dysfunctions. Additionally, our analysis kynurenine serotonin sub-pathways may mediate effect on early life development. Altogether, findings provide metabolome-wide insights into as therapeutic option for ASD other disorders.

Language: Английский

Citations

2