
Published: Dec. 17, 2021
Using the public data set Cifar-10.
Language: Английский
Published: Dec. 17, 2021
Using the public data set Cifar-10.
Language: Английский
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Journal Year: 2022, Volume and Issue: unknown
Published: Jan. 1, 2022
Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.
Language: Английский
Citations
26Findings of the Association for Computational Linguistics: ACL 2022, Journal Year: 2022, Volume and Issue: unknown, P. 1720 - 1732
Published: Jan. 1, 2022
Recently, there has been a trend to investigate the factual knowledge captured by Pre-trained Language Models (PLMs). Many works show PLMs' ability fill in missing words cloze-style prompts such as "Dante was born [MASK]." However, it is still mystery how PLMs generate results correctly: relying on effective clues or shortcut patterns? We try answer this question causal-inspired analysis that quantitatively measures and evaluates word-level patterns depend words. check have three typical associations with words: knowledge-dependent, positionally close, highly co-occurred. Our shows: (1) more close co-occurred than knowledge-dependent words; (2) dependence Accordingly, we conclude capture ineffectively because of depending inadequate associations.
Language: Английский
Citations
24IEEE Access, Journal Year: 2024, Volume and Issue: 12, P. 27225 - 27236
Published: Jan. 1, 2024
In the era of digital communication, social media platforms have experienced exponential growth, becoming primary channels for information exchange. However, this surge has also amplified rapid spread hate speech, prompting extensive research efforts effective mitigation. These prominently featured advanced natural language processing techniques, particularly emphasizing deep learning methods that shown promising outcomes. This article presents a novel approach to address pressing issue, combining comprehensive dataset 18 sources. It includes 0.45 million comments sourced from various spanning different time frames. There were two models utilized diversity in data and leverage distinct strengths found within frameworks: CNN BiLSTM with an attention mechanism. tailored handle specific subsets data, allowing more targeted approach. The unique outputs both then fused into unified model. methodology outperformed recent models, showcasing enhanced generalization capabilities even when tested on largest most diverse dataset. Our model achieved impressive accuracy 89%, while maintaining high precision 0.88 recall 0.91.
Language: Английский
Citations
4Information Fusion, Journal Year: 2025, Volume and Issue: unknown, P. 103000 - 103000
Published: Feb. 1, 2025
Language: Английский
Citations
0Automation in Construction, Journal Year: 2025, Volume and Issue: 176, P. 106237 - 106237
Published: May 16, 2025
Language: Английский
Citations
0PeerJ Computer Science, Journal Year: 2025, Volume and Issue: 11, P. e2911 - e2911
Published: May 30, 2025
The proliferation of user-generated content on social networking sites has intensified the challenge accurately and efficiently detecting inflammatory discriminatory speech at scale. Traditional manual moderation methods are impractical due to sheer volume complexity online discourse, necessitating automated solutions. However, existing deep learning models for hate detection typically function as black-box systems, providing binary classifications without interpretable insights into their decision-making processes. This opacity significantly limits practical utility, particularly in nuanced tasks. To address this challenge, our research explores leveraging advanced reasoning knowledge integration capabilities state-of-the-art language models, specifically Mistral-7B, develop transparent systems. We introduce a novel framework wherein large (LLMs) generate explicit rationales by identifying analyzing critical textual features indicative speech. These subsequently integrated specialized classifiers designed perform explainable moderation. rigorously evaluate methodology multiple benchmark English-language media datasets. Results demonstrate that incorporating LLM-generated explanations enhances both interpretability accuracy detection. approach not only identifies problematic effectively but also clearly articulates analytical rationale behind each decision, fulfilling demand transparency
Language: Английский
Citations
0Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2024, Volume and Issue: 38(16), P. 18243 - 18251
Published: March 24, 2024
Selective rationalization can be regarded as a straightforward self-explaining approach for enhancing model explainability in natural language processing tasks. It aims to provide explanations that are more accessible and understandable non-technical users by first selecting subsets of input texts rationales then predicting based on chosen subsets. However, existing methods follow this select-then-predict framework may suffer from the degeneration problem, resulting sub-optimal or unsatisfactory do not align with human judgments. This problem further lead failure, meaningless ultimately undermine people's trust model. To address these challenges, we propose Guidance-based Rationalization method (G-RAT) effectively improves robustness against failure situations quality using guidance module regularize selections distributions. Experimental results two synthetic settings prove our is robust problems, while real datasets show its effectiveness providing line The source code available at https://github.com/shuaibo919/g-rat.
Language: Английский
Citations
1Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Journal Year: 2022, Volume and Issue: unknown
Published: Jan. 1, 2022
Zijie Zeng, Xinyu Li, Dragan Gasevic, Guanliang Chen. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.
Language: Английский
Citations
4Frontiers in Bioinformatics, Journal Year: 2022, Volume and Issue: 2
Published: April 26, 2022
Large-scale microbiome studies investigating disease-inducing microbial roles base their findings on differences between count data in contrasting environments (e.g., stool samples cases and controls). These survey are often impeded by small sample sizes database bias. Combining from multiple results obvious batch effects, even when DNA preparation sequencing methods identical. Relatedly, predictive models trained one dataset do not generalize to outside datasets. In this study, we address these limitations applying word embedding algorithms (GloVe) PCA transformation ASV the American Gut Project generating translation matrices that can be applied any 16S rRNA V4 region gut study. Because approaches contextualize occurrences a larger while reducing dimensionality of feature space, they improve generalization predict host phenotype associated microbiota. The GMEmbeddings R package contains GloVe at 50, 100 250 dimensions, each learned using ∼15,000 Project. It currently supports alignment, matching, matrix multiplication allow users transform into spaces. We show how correlate properties new space KEGG functional pathways for biological interpretation results. Lastly, provide benchmarking six datasets describing three phenotypes demonstrate ability embedding-based classifiers independent Future iterations will include other systems. Available at: https://github.com/MaudeDavidLab/GMEmbeddings .
Language: Английский
Citations
32022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), Journal Year: 2022, Volume and Issue: unknown, P. 408 - 411
Published: Nov. 24, 2022
Natural language processing (NLP) studies the techniques and procedures that allow a machine to converse using human language. Recent advances in artificial intelligence communication technology have considerably enhanced natural applications. Due improvements deep learning, virtually every aspect of intelligence, including processing, has made substantial progress. Deep learning methods use many layers neurons construct neural network. Recurrent Neural Networks their variants like long short term (LSTM), bidirectional LSTM, are some most popular techniques. This article reviews employed NLP- specific approaches last few years. We also studied issues faced by researchers while trying apply Asian languages. highlight critical research work carried out recently for Indian other In addition, we discuss fundamental linguistic challenges suggest future scopes topic.
Language: Английский
Citations
3