Computers in Biology and Medicine, Год журнала: 2023, Номер 164, С. 107288 - 107288
Опубликована: Авг. 1, 2023
Язык: Английский
Computers in Biology and Medicine, Год журнала: 2023, Номер 164, С. 107288 - 107288
Опубликована: Авг. 1, 2023
Язык: Английский
Genomics Proteomics & Bioinformatics, Год журнала: 2022, Номер 21(4), С. 678 - 694
Опубликована: Сен. 9, 2022
As the most pervasive epigenetic marker present on mRNAs and long non-coding RNAs (lncRNAs), N
Язык: Английский
Процитировано
29Briefings in Bioinformatics, Год журнала: 2023, Номер 24(3)
Опубликована: Май 1, 2023
A-to-I editing is the most prevalent RNA event, which refers to change of adenosine (A) bases inosine (I) in double-stranded RNAs. Several studies have revealed that can regulate cellular processes and associated with various human diseases. Therefore, accurate identification sites crucial for understanding RNA-level (i.e. transcriptional) modifications their potential roles molecular functions. To date, computational approaches site been developed; however, performance still unsatisfactory needs further improvement. In this study, we developed a novel stacked-ensemble learning model, ATTIC (A-To-I ediTing predICtor), accurately identify across three species, including Homo sapiens, Mus musculus Drosophila melanogaster. We first comprehensively evaluated 37 sequence-derived features combined 14 popular machine algorithms. Then, selected optimal base models build series stacked ensemble models. The final framework was based on improved by feature selection strategy specific species. Extensive cross-validation independent tests illustrate outperforms state-of-the-art tools predicting sites. also web server ATTIC, publicly available at http://web.unimelb-bioinfortools.cloud.edu.au/ATTIC/. anticipate be utilized as useful tool accelerate events help characterize post-transcriptional regulation.
Язык: Английский
Процитировано
17Epigenetics, Год журнала: 2023, Номер 18(1)
Опубликована: Июнь 18, 2023
Epitranscriptomic modifications have recently emerged into the spotlight of researchers due to their vast regulatory effects on gene expression and thereby cellular physiology pathophysiology. N6,2'-O-dimethyladenosine (m6Am) is one most prevalent chemical marks RNA dynamically regulated by writers (PCIF1, METTL4) erasers (FTO). The presence or absence m6Am in affects mRNA stability, regulates transcription, modulates pre-mRNA splicing. Nevertheless, its functions heart are poorly known. This review summarizes current knowledge gaps about modification regulators cardiac biology. It also points out technical challenges lists currently available techniques measure m6Am. A better understanding epitranscriptomic needed improve our molecular regulations which may lead novel cardioprotective strategies.
Язык: Английский
Процитировано
16BMC Bioinformatics, Год журнала: 2024, Номер 25(1)
Опубликована: Янв. 17, 2024
Abstract Background Epi-transcriptome regulation through post-transcriptional RNA modifications is essential for all types. Precise recognition of critical understanding their functions and regulatory mechanisms. However, wet experimental methods are often costly time-consuming, limiting wide range applications. Therefore, recent research has focused on developing computational methods, particularly deep learning (DL). Bidirectional long short-term memory (BiLSTM), convolutional neural network (CNN), the transformer have demonstrated achievements in modification site prediction. BiLSTM cannot achieve parallel computation, leading to a training time, CNN learn dependencies distance sequence, Transformer lacks information interaction with sequences at different scales. This insight underscores necessity continued development natural language processing (NLP) DL devise an enhanced prediction framework that can effectively address challenges presented. Results study presents multi-scale self- cross-attention (MSCAN) identify methylation using NLP way. Experiment results twelve sites (m 6 A, m 1 5 C, U, Am, 7 G, Ψ, I, Cm, Gm, Um) reveal area under receiver operating characteristic MSCAN obtains respectively 98.34%, 85.41%, 97.29%, 96.74%, 99.04%, 79.94%, 76.22%, 65.69%, 92.92%, 92.03%, 95.77%, 89.66%, which better than state-of-the-art model. indicates model strong generalization capabilities. Furthermore, reveals association among types from perspective. A user-friendly web server predicting widely occurring human available http://47.242.23.141/MSCAN/index.php . Conclusions predictor been developed binary classification predict sites.
Язык: Английский
Процитировано
7Nucleic Acids Research, Год журнала: 2022, Номер 50(18), С. 10290 - 10310
Опубликована: Сен. 26, 2022
Abstract As the most pervasive epigenetic mark present on mRNA and lncRNA, N6-methyladenosine (m6A) RNA methylation regulates all stages of life in various biological processes disease mechanisms. Computational methods for deciphering modification have achieved great success recent years; nevertheless, their potential remains underexploited. One reason this is that existing models usually consider only sequence transcripts, ignoring regions (or geography) transcripts such as 3′UTR intron, where forms functions. Here, we developed three simple yet powerful encoding schemes to capture submolecular geographic information RNA, which largely independent from sequences. We show m6A prediction based alone can achieve comparable performances classic sequence-based methods. Importantly, substantially enhances accuracy models, enables isoform- tissue-specific sites, improves signal detection direct sequencing data. The exhibited strong interpretability, are applicable not but also N1-methyladenosine (m1A), serve a general effective complement widely used deep learning applications concerning transcripts.
Язык: Английский
Процитировано
23Briefings in Bioinformatics, Год журнала: 2023, Номер 24(3)
Опубликована: Май 1, 2023
Abstract The expanding field of epitranscriptomics might rival the epigenome in diversity biological processes impacted. In recent years, development new high-throughput experimental and computational techniques has been a key driving force discovering properties RNA modifications. Machine learning applications, such as for classification, clustering or de novo identification, have critical these advances. Nonetheless, various challenges remain before full potential machine can be leveraged. this review, we provide comprehensive survey methods to detect modifications using diverse input data sources. We describe strategies train test encode interpret features that are relevant epitranscriptomics. Finally, identify some current open questions about modification analysis, including ambiguity predicting transcript isoforms single nucleotides, lack complete ground truth sets believe review will inspire benefit rapidly developing addressing limitations through effective use learning.
Язык: Английский
Процитировано
15IEEE/ACM Transactions on Computational Biology and Bioinformatics, Год журнала: 2023, Номер 20(3), С. 2177 - 2189
Опубликована: Янв. 17, 2023
Recent work on language models has resulted in state-of-the-art performance various tasks. Among these, Bidirectional Encoder Representations from Transformers (BERT) focused contextualizing word embeddings to extract context and semantics of the words. On other hand, post-transcriptional 2'-O-methylation (Nm) RNA modification is important cellular tasks related a number diseases. The existing high-throughput experimental techniques take longer time detect these modifications, costly exploring functional processes. Here, deeply understand associated biological processes faster, we come up with an efficient method Bert2Ome infer sites sequences. combines BERT-based model convolutional neural networks (CNN) relationship between sequence content. Unlike methods proposed so far, assumes each given as text focuses improving prediction by integrating pretrained deep learning-based BERT. Additionally, our transformer-based approach could across multiple species. According 5-fold cross-validation, human mouse accuracies were 99.15% 94.35% respectively. Similarly, ROC AUC scores 0.99, 0.94 for same Detailed results show that reduces consumed experiments outperforms approaches different datasets species over metrics. learning such 2D CNNs are more promising BERT attributes than conventional machine methods.
Язык: Английский
Процитировано
14Computers in Biology and Medicine, Год журнала: 2023, Номер 164, С. 107238 - 107238
Опубликована: Июль 8, 2023
Язык: Английский
Процитировано
13Biosensors and Bioelectronics, Год журнала: 2025, Номер unknown, С. 117198 - 117198
Опубликована: Янв. 1, 2025
Язык: Английский
Процитировано
0Methods, Год журнала: 2022, Номер 203, С. 399 - 421
Опубликована: Март 3, 2022
Язык: Английский
Процитировано
20