
Опубликована: Дек. 17, 2021
Using the public data set Cifar-10.
Язык: Английский
Опубликована: Дек. 17, 2021
Using the public data set Cifar-10.
Язык: Английский
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Год журнала: 2022, Номер unknown
Опубликована: Янв. 1, 2022
Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.
Язык: Английский
Процитировано
26Findings of the Association for Computational Linguistics: ACL 2022, Год журнала: 2022, Номер unknown, С. 1720 - 1732
Опубликована: Янв. 1, 2022
Recently, there has been a trend to investigate the factual knowledge captured by Pre-trained Language Models (PLMs). Many works show PLMs' ability fill in missing words cloze-style prompts such as "Dante was born [MASK]." However, it is still mystery how PLMs generate results correctly: relying on effective clues or shortcut patterns? We try answer this question causal-inspired analysis that quantitatively measures and evaluates word-level patterns depend words. check have three typical associations with words: knowledge-dependent, positionally close, highly co-occurred. Our shows: (1) more close co-occurred than knowledge-dependent words; (2) dependence Accordingly, we conclude capture ineffectively because of depending inadequate associations.
Язык: Английский
Процитировано
24IEEE Access, Год журнала: 2024, Номер 12, С. 27225 - 27236
Опубликована: Янв. 1, 2024
In the era of digital communication, social media platforms have experienced exponential growth, becoming primary channels for information exchange. However, this surge has also amplified rapid spread hate speech, prompting extensive research efforts effective mitigation. These prominently featured advanced natural language processing techniques, particularly emphasizing deep learning methods that shown promising outcomes. This article presents a novel approach to address pressing issue, combining comprehensive dataset 18 sources. It includes 0.45 million comments sourced from various spanning different time frames. There were two models utilized diversity in data and leverage distinct strengths found within frameworks: CNN BiLSTM with an attention mechanism. tailored handle specific subsets data, allowing more targeted approach. The unique outputs both then fused into unified model. methodology outperformed recent models, showcasing enhanced generalization capabilities even when tested on largest most diverse dataset. Our model achieved impressive accuracy 89%, while maintaining high precision 0.88 recall 0.91.
Язык: Английский
Процитировано
4Information Fusion, Год журнала: 2025, Номер unknown, С. 103000 - 103000
Опубликована: Фев. 1, 2025
Язык: Английский
Процитировано
0Automation in Construction, Год журнала: 2025, Номер 176, С. 106237 - 106237
Опубликована: Май 16, 2025
Язык: Английский
Процитировано
0PeerJ Computer Science, Год журнала: 2025, Номер 11, С. e2911 - e2911
Опубликована: Май 30, 2025
The proliferation of user-generated content on social networking sites has intensified the challenge accurately and efficiently detecting inflammatory discriminatory speech at scale. Traditional manual moderation methods are impractical due to sheer volume complexity online discourse, necessitating automated solutions. However, existing deep learning models for hate detection typically function as black-box systems, providing binary classifications without interpretable insights into their decision-making processes. This opacity significantly limits practical utility, particularly in nuanced tasks. To address this challenge, our research explores leveraging advanced reasoning knowledge integration capabilities state-of-the-art language models, specifically Mistral-7B, develop transparent systems. We introduce a novel framework wherein large (LLMs) generate explicit rationales by identifying analyzing critical textual features indicative speech. These subsequently integrated specialized classifiers designed perform explainable moderation. rigorously evaluate methodology multiple benchmark English-language media datasets. Results demonstrate that incorporating LLM-generated explanations enhances both interpretability accuracy detection. approach not only identifies problematic effectively but also clearly articulates analytical rationale behind each decision, fulfilling demand transparency
Язык: Английский
Процитировано
0Proceedings of the AAAI Conference on Artificial Intelligence, Год журнала: 2024, Номер 38(16), С. 18243 - 18251
Опубликована: Март 24, 2024
Selective rationalization can be regarded as a straightforward self-explaining approach for enhancing model explainability in natural language processing tasks. It aims to provide explanations that are more accessible and understandable non-technical users by first selecting subsets of input texts rationales then predicting based on chosen subsets. However, existing methods follow this select-then-predict framework may suffer from the degeneration problem, resulting sub-optimal or unsatisfactory do not align with human judgments. This problem further lead failure, meaningless ultimately undermine people's trust model. To address these challenges, we propose Guidance-based Rationalization method (G-RAT) effectively improves robustness against failure situations quality using guidance module regularize selections distributions. Experimental results two synthetic settings prove our is robust problems, while real datasets show its effectiveness providing line The source code available at https://github.com/shuaibo919/g-rat.
Язык: Английский
Процитировано
1Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Год журнала: 2022, Номер unknown
Опубликована: Янв. 1, 2022
Zijie Zeng, Xinyu Li, Dragan Gasevic, Guanliang Chen. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.
Язык: Английский
Процитировано
4Frontiers in Bioinformatics, Год журнала: 2022, Номер 2
Опубликована: Апрель 26, 2022
Large-scale microbiome studies investigating disease-inducing microbial roles base their findings on differences between count data in contrasting environments (e.g., stool samples cases and controls). These survey are often impeded by small sample sizes database bias. Combining from multiple results obvious batch effects, even when DNA preparation sequencing methods identical. Relatedly, predictive models trained one dataset do not generalize to outside datasets. In this study, we address these limitations applying word embedding algorithms (GloVe) PCA transformation ASV the American Gut Project generating translation matrices that can be applied any 16S rRNA V4 region gut study. Because approaches contextualize occurrences a larger while reducing dimensionality of feature space, they improve generalization predict host phenotype associated microbiota. The GMEmbeddings R package contains GloVe at 50, 100 250 dimensions, each learned using ∼15,000 Project. It currently supports alignment, matching, matrix multiplication allow users transform into spaces. We show how correlate properties new space KEGG functional pathways for biological interpretation results. Lastly, provide benchmarking six datasets describing three phenotypes demonstrate ability embedding-based classifiers independent Future iterations will include other systems. Available at: https://github.com/MaudeDavidLab/GMEmbeddings .
Язык: Английский
Процитировано
32022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), Год журнала: 2022, Номер unknown, С. 408 - 411
Опубликована: Ноя. 24, 2022
Natural language processing (NLP) studies the techniques and procedures that allow a machine to converse using human language. Recent advances in artificial intelligence communication technology have considerably enhanced natural applications. Due improvements deep learning, virtually every aspect of intelligence, including processing, has made substantial progress. Deep learning methods use many layers neurons construct neural network. Recurrent Neural Networks their variants like long short term (LSTM), bidirectional LSTM, are some most popular techniques. This article reviews employed NLP- specific approaches last few years. We also studied issues faced by researchers while trying apply Asian languages. highlight critical research work carried out recently for Indian other In addition, we discuss fundamental linguistic challenges suggest future scopes topic.
Язык: Английский
Процитировано
3