Cited by THE ESSENCE OF DEEP LEARNING

Triggerless Backdoor Attack for NLP Tasks with Clean Labels DOI

Leilei Gan, Jiwei Li, Tianwei Zhang

и другие.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Год журнала: 2022, Номер unknown

Опубликована: Янв. 1, 2022

Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

Язык: Английский

Процитировано

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis DOI

Shaobo Li, Xiaoguang Li,

Lifeng Shang

и другие.

Findings of the Association for Computational Linguistics: ACL 2022, Год журнала: 2022, Номер unknown, С. 1720 - 1732

Опубликована: Янв. 1, 2022

Recently, there has been a trend to investigate the factual knowledge captured by Pre-trained Language Models (PLMs). Many works show PLMs' ability fill in missing words cloze-style prompts such as "Dante was born [MASK]." However, it is still mystery how PLMs generate results correctly: relying on effective clues or shortcut patterns? We try answer this question causal-inspired analysis that quantitatively measures and evaluates word-level patterns depend words. check have three typical associations with words: knowledge-dependent, positionally close, highly co-occurred. Our shows: (1) more close co-occurred than knowledge-dependent words; (2) dependence Accordingly, we conclude capture ineffectively because of depending inadequate associations.

Язык: Английский

Процитировано

Enhancing Hate Speech Detection in the Digital Age: A Novel Model Fusion Approach Leveraging a Comprehensive Dataset DOI

Waqas Sharif, Saima Abdullah, Saman Iftikhar

и другие.

IEEE Access, Год журнала: 2024, Номер 12, С. 27225 - 27236

Опубликована: Янв. 1, 2024

In the era of digital communication, social media platforms have experienced exponential growth, becoming primary channels for information exchange. However, this surge has also amplified rapid spread hate speech, prompting extensive research efforts effective mitigation. These prominently featured advanced natural language processing techniques, particularly emphasizing deep learning methods that shown promising outcomes. This article presents a novel approach to address pressing issue, combining comprehensive dataset 18 sources. It includes 0.45 million comments sourced from various spanning different time frames. There were two models utilized diversity in data and leverage distinct strengths found within frameworks: CNN BiLSTM with an attention mechanism. tailored handle specific subsets data, allowing more targeted approach. The unique outputs both then fused into unified model. methodology outperformed recent models, showcasing enhanced generalization capabilities even when tested on largest most diverse dataset. Our model achieved impressive accuracy 89%, while maintaining high precision 0.88 recall 0.91.

Язык: Английский

Процитировано

DeepFusionSent: A novel feature fusion approach for deep learning-enhanced sentiment classification DOI

Ankit Thakkar, Devshri Pandya

Information Fusion, Год журнала: 2025, Номер unknown, С. 103000 - 103000

Опубликована: Фев. 1, 2025

Язык: Английский

Процитировано

Benchmarking methods for classifying space functions and access elements in multi-unit apartment buildings DOI

Amir Ziaee, Georg Suter

Automation in Construction, Год журнала: 2025, Номер 176, С. 106237 - 106237

Опубликована: Май 16, 2025

Язык: Английский

Процитировано

TARGE: large language model-powered explainable hate speech detection DOI

Muhammad Haseeb Hashir,

Memoona,

Sung Won Kim

и другие.

PeerJ Computer Science, Год журнала: 2025, Номер 11, С. e2911 - e2911

Опубликована: Май 30, 2025

The proliferation of user-generated content on social networking sites has intensified the challenge accurately and efficiently detecting inflammatory discriminatory speech at scale. Traditional manual moderation methods are impractical due to sheer volume complexity online discourse, necessitating automated solutions. However, existing deep learning models for hate detection typically function as black-box systems, providing binary classifications without interpretable insights into their decision-making processes. This opacity significantly limits practical utility, particularly in nuanced tasks. To address this challenge, our research explores leveraging advanced reasoning knowledge integration capabilities state-of-the-art language models, specifically Mistral-7B, develop transparent systems. We introduce a novel framework wherein large (LLMs) generate explicit rationales by identifying analyzing critical textual features indicative speech. These subsequently integrated specialized classifiers designed perform explainable moderation. rigorously evaluate methodology multiple benchmark English-language media datasets. Results demonstrate that incorporating LLM-generated explanations enhances both interpretability accuracy detection. approach not only identifies problematic effectively but also clearly articulates analytical rationale behind each decision, fulfilling demand transparency

Язык: Английский

Процитировано

Learning Robust Rationales for Model Explainability: A Guidance-Based Approach DOI

Shuaibo Hu,

Kui Yu

Proceedings of the AAAI Conference on Artificial Intelligence, Год журнала: 2024, Номер 38(16), С. 18243 - 18251

Опубликована: Март 24, 2024

Selective rationalization can be regarded as a straightforward self-explaining approach for enhancing model explainability in natural language processing tasks. It aims to provide explanations that are more accessible and understandable non-technical users by first selecting subsets of input texts rationales then predicting based on chosen subsets. However, existing methods follow this select-then-predict framework may suffer from the degeneration problem, resulting sub-optimal or unsatisfactory do not align with human judgments. This problem further lead failure, meaningless ultimately undermine people's trust model. To address these challenges, we propose Guidance-based Rationalization method (G-RAT) effectively improves robustness against failure situations quality using guidance module regularize selections distributions. Experimental results two synthetic settings prove our is robust problems, while real datasets show its effectiveness providing line The source code available at https://github.com/shuaibo919/g-rat.

Язык: Английский

Процитировано

Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring? DOI

Zijie Zeng, Xinyu Li, Dragan Gašević

и другие.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Год журнала: 2022, Номер unknown

Опубликована: Янв. 1, 2022

Zijie Zeng, Xinyu Li, Dragan Gasevic, Guanliang Chen. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

Язык: Английский

Процитировано

GMEmbeddings: An R Package to Apply Embedding Techniques to Microbiome Data DOI

Christine Tataru,

Austin Eaton,

Maude M. David

и другие.

Frontiers in Bioinformatics, Год журнала: 2022, Номер 2

Опубликована: Апрель 26, 2022

Large-scale microbiome studies investigating disease-inducing microbial roles base their findings on differences between count data in contrasting environments (e.g., stool samples cases and controls). These survey are often impeded by small sample sizes database bias. Combining from multiple results obvious batch effects, even when DNA preparation sequencing methods identical. Relatedly, predictive models trained one dataset do not generalize to outside datasets. In this study, we address these limitations applying word embedding algorithms (GloVe) PCA transformation ASV the American Gut Project generating translation matrices that can be applied any 16S rRNA V4 region gut study. Because approaches contextualize occurrences a larger while reducing dimensionality of feature space, they improve generalization predict host phenotype associated microbiota. The GMEmbeddings R package contains GloVe at 50, 100 250 dimensions, each learned using ∼15,000 Project. It currently supports alignment, matching, matrix multiplication allow users transform into spaces. We show how correlate properties new space KEGG functional pathways for biological interpretation results. Lastly, provide benchmarking six datasets describing three phenotypes demonstrate ability embedding-based classifiers independent Future iterations will include other systems. Available at: https://github.com/MaudeDavidLab/GMEmbeddings .

Язык: Английский

Процитировано

Recent Trends in Deep Learning for Natural Language Processing and Scope for Asian Languages DOI

Diganta Baishya, Rupam Baruah

2022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), Год журнала: 2022, Номер unknown, С. 408 - 411

Опубликована: Ноя. 24, 2022

Natural language processing (NLP) studies the techniques and procedures that allow a machine to converse using human language. Recent advances in artificial intelligence communication technology have considerably enhanced natural applications. Due improvements deep learning, virtually every aspect of intelligence, including processing, has made substantial progress. Deep learning methods use many layers neurons construct neural network. Recurrent Neural Networks their variants like long short term (LSTM), bidirectional LSTM, are some most popular techniques. This article reviews employed NLP- specific approaches last few years. We also studied issues faced by researchers while trying apply Asian languages. highlight critical research work carried out recently for Indian other In addition, we discuss fundamental linguistic challenges suggest future scopes topic.

Язык: Английский

Процитировано