Integer programming framework for pangenome-based genome inference DOI Creative Commons
Ghanshyam Chandra,

Md Helal Hossen,

Stephan Scholz

и другие.

Опубликована: Окт. 29, 2024

Abstract Affordable genotyping methods are essential in genomics. Commonly used primarily support single nucleotide variants and short indels but neglect structural variants. Additionally, accuracy of read alignments to a reference genome is unreliable highly polymorphic repetitive regions, further impacting performance. Recent works highlight the advantage haplotype-resolved pangenome graphs addressing these challenges. Building on developments, we propose rigorous alignment-free framework. Our formulation seeks path through graph that maximizes matches between substrings sequencing reads (e.g., k -mers) while minimizing recombination events (haplotype switches) along path. We prove this problem NP-Hard develop efficient integer-programming solutions. benchmarked algorithm using downsampled short-read datasets from homozygous human cell lines with coverage ranging 0.1× 10×. accurately estimates complete major histocompatibility complex (MHC) haplotype sequences small edit distances ground-truth sequences, providing significant over existing low-coverage inputs. Although our designed for haploid samples, discuss future extensions diploid samples. Implementation https://github.com/at-cg/PHI

Язык: Английский

The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity DOI Creative Commons
María Cerezo, Elliot Sollis, Yue Ji

и другие.

Nucleic Acids Research, Год журнала: 2024, Номер 53(D1), С. D998 - D1005

Опубликована: Ноя. 12, 2024

The NHGRI-EBI GWAS Catalog serves as a vital resource for the genetic research community, providing access to most comprehensive database of human results. Currently, it contains close 7 000 publications >15 traits, from which more than 625 lead associations have been curated. Additionally, 85 full genome-wide summary statistics datasets-containing association data all variants in analysis-are available downstream analyses such meta-analysis, fine-mapping, Mendelian randomisation or development polygenic risk scores. As centralised repository results, sets and implements standards submission harmonisation, encourages use consistent descriptors samples methodologies. We share processes vocabulary with PGS Catalog, improving interoperability growing user group. Here, we describe latest changes content, improvements our interface, implementation GWAS-SSF standard format statistics. address challenges handling rapid increase large-scale molecular quantitative trait need sensitivity population cohort while maintaining reusability.

Язык: Английский

Процитировано

13

Genetics and Population Analysis DOI

Prachi Balyan,

Nismabi A Nisamudheen,

Jan Zainab

и другие.

Elsevier eBooks, Год журнала: 2025, Номер unknown

Опубликована: Янв. 1, 2025

Язык: Английский

Процитировано

0

Genetic Susceptibility of Type 2 Diabetes and Metabolic Syndrome DOI Creative Commons
Vladimir Ercegović, Monika Džimbeg, Andrea Gelemanović

и другие.

Diabetology, Год журнала: 2025, Номер 6(2), С. 11 - 11

Опубликована: Фев. 6, 2025

Type 2 diabetes (T2D) and metabolic syndrome (MetS) represent complex, multifactorial conditions that pose significant challenges to public health healthcare costs worldwide. These two share common risk factors such as obesity, dyslipidemia, hypertension and, a consequence, are frequently jointly diagnosed in an individual. More specifically, it is estimated around 85% of T2D patients also have MetS, while with MetS five times likely develop T2D. While lifestyle environmental factors, poor diet physical inactivity, play crucial role, genetic susceptibility has substantial influence on the overall risk. Recent advancements genome-wide association studies (GWAS) had major impact identifying numerous loci associated these conditions. This narrative review summarizes key findings from studies, highlighting pathways their clinical implications. The objective this provide comprehensive understanding known underpinnings inform future research open potential therapeutic preventive strategies.

Язык: Английский

Процитировано

0

The Genomics Revolution in Nonmodel Species: Predictions vs. Reality for Salmonids DOI Creative Commons
Samuel A. May, Samuel W. Rosenbaum, Devon E. Pearse

и другие.

Molecular Ecology, Год журнала: 2025, Номер unknown

Опубликована: Апрель 18, 2025

ABSTRACT The increasing feasibility of whole‐genome sequencing has been highly anticipated, promising to transform our understanding the biology nonmodel species. Notably, dramatic cost reductions beginning around 2007 with advent high‐throughput inspired publications heralding ‘genomics revolution’, predictions about its future impacts. Although such served as useful guideposts, value is added when statements are evaluated benefit hindsight. Here, we review 10 key made early in genomics revolution, highlighting those realised while identifying challenges limiting others. We focus on concerning applied aspects and examples involving salmonid species which, due their socioeconomic ecological significance, have frontrunners applications Predicted outcomes included enhanced analytical power, deeper insights into genetic basis phenotype fitness variation, disease management breeding program advancements. many materialised, several expectations remain unmet technological, knowledge barriers. Additionally, largely unforeseen advancements, including identification applicability large‐effect loci, close‐kin mark–recapture, environmental DNA gene editing under‐anticipated value. Finally, emerging innovations artificial intelligence bioinformatics offer new directions. This retrospective evaluation impacts genomic revolution offers for

Язык: Английский

Процитировано

0

The NHGRI-EBI GWAS Catalog: standards for reusability, sustainability and diversity DOI Creative Commons
María Cerezo, Elliot Sollis, Yue Ji

и другие.

bioRxiv (Cold Spring Harbor Laboratory), Год журнала: 2024, Номер unknown

Опубликована: Окт. 23, 2024

Abstract The NHGRI-EBI GWAS Catalog serves as a vital resource for the genetic research community, providing access to most comprehensive database of human results. Currently, it contains close 7,000 publications more than 15,000 traits, from which 625,000 lead associations have been curated. Additionally, 85,000 full genome-wide summary statistics datasets - containing association data all variants in analysis are available downstream analyses such meta-analysis, fine-mapping, Mendelian randomisation or development polygenic risk scores. As centralised repository results, sets and implements standards submission harmonisation, encourages use consistent descriptors samples methodologies. We share processes vocabulary with PGS Catalog, improving interoperability growing user group. Here, we describe latest changes content, improvements our interface, implementation GWAS-SSF standard format statistics. address challenges handling rapid increase large-scale molecular quantitative trait need sensitivity population cohort while maintaining reusability.

Язык: Английский

Процитировано

1

Integer programming framework for pangenome-based genome inference DOI Creative Commons
Ghanshyam Chandra,

Md Helal Hossen,

Stephan Scholz

и другие.

Опубликована: Окт. 29, 2024

Abstract Affordable genotyping methods are essential in genomics. Commonly used primarily support single nucleotide variants and short indels but neglect structural variants. Additionally, accuracy of read alignments to a reference genome is unreliable highly polymorphic repetitive regions, further impacting performance. Recent works highlight the advantage haplotype-resolved pangenome graphs addressing these challenges. Building on developments, we propose rigorous alignment-free framework. Our formulation seeks path through graph that maximizes matches between substrings sequencing reads (e.g., k -mers) while minimizing recombination events (haplotype switches) along path. We prove this problem NP-Hard develop efficient integer-programming solutions. benchmarked algorithm using downsampled short-read datasets from homozygous human cell lines with coverage ranging 0.1× 10×. accurately estimates complete major histocompatibility complex (MHC) haplotype sequences small edit distances ground-truth sequences, providing significant over existing low-coverage inputs. Although our designed for haploid samples, discuss future extensions diploid samples. Implementation https://github.com/at-cg/PHI

Язык: Английский

Процитировано

0