THE ESSENCE OF DEEP LEARNING DOI Creative Commons
Wei Wang

Published: Dec. 17, 2021

Using the public data set Cifar-10.

Language: Английский

Triggerless Backdoor Attack for NLP Tasks with Clean Labels DOI Creative Commons
Leilei Gan, Jiwei Li, Tianwei Zhang

et al.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Journal Year: 2022, Volume and Issue: unknown

Published: Jan. 1, 2022

Leilei Gan, Jiwei Li, Tianwei Zhang, Xiaoya Yuxian Meng, Fei Wu, Yi Yang, Shangwei Guo, Chun Fan. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

Language: Английский

Citations

26

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis DOI Creative Commons
Shaobo Li, Xiaoguang Li,

Lifeng Shang

et al.

Findings of the Association for Computational Linguistics: ACL 2022, Journal Year: 2022, Volume and Issue: unknown, P. 1720 - 1732

Published: Jan. 1, 2022

Recently, there has been a trend to investigate the factual knowledge captured by Pre-trained Language Models (PLMs). Many works show PLMs' ability fill in missing words cloze-style prompts such as "Dante was born [MASK]." However, it is still mystery how PLMs generate results correctly: relying on effective clues or shortcut patterns? We try answer this question causal-inspired analysis that quantitatively measures and evaluates word-level patterns depend words. check have three typical associations with words: knowledge-dependent, positionally close, highly co-occurred. Our shows: (1) more close co-occurred than knowledge-dependent words; (2) dependence Accordingly, we conclude capture ineffectively because of depending inadequate associations.

Language: Английский

Citations

24

Enhancing Hate Speech Detection in the Digital Age: A Novel Model Fusion Approach Leveraging a Comprehensive Dataset DOI Creative Commons
Waqas Sharif, Saima Abdullah, Saman Iftikhar

et al.

IEEE Access, Journal Year: 2024, Volume and Issue: 12, P. 27225 - 27236

Published: Jan. 1, 2024

In the era of digital communication, social media platforms have experienced exponential growth, becoming primary channels for information exchange. However, this surge has also amplified rapid spread hate speech, prompting extensive research efforts effective mitigation. These prominently featured advanced natural language processing techniques, particularly emphasizing deep learning methods that shown promising outcomes. This article presents a novel approach to address pressing issue, combining comprehensive dataset 18 sources. It includes 0.45 million comments sourced from various spanning different time frames. There were two models utilized diversity in data and leverage distinct strengths found within frameworks: CNN BiLSTM with an attention mechanism. tailored handle specific subsets data, allowing more targeted approach. The unique outputs both then fused into unified model. methodology outperformed recent models, showcasing enhanced generalization capabilities even when tested on largest most diverse dataset. Our model achieved impressive accuracy 89%, while maintaining high precision 0.88 recall 0.91.

Language: Английский

Citations

4

DeepFusionSent: A novel feature fusion approach for deep learning-enhanced sentiment classification DOI
Ankit Thakkar, Devshri Pandya

Information Fusion, Journal Year: 2025, Volume and Issue: unknown, P. 103000 - 103000

Published: Feb. 1, 2025

Language: Английский

Citations

0

Benchmarking methods for classifying space functions and access elements in multi-unit apartment buildings DOI Creative Commons
Amir Ziaee, Georg Suter

Automation in Construction, Journal Year: 2025, Volume and Issue: 176, P. 106237 - 106237

Published: May 16, 2025

Language: Английский

Citations

0

TARGE: large language model-powered explainable hate speech detection DOI Creative Commons

Muhammad Haseeb Hashir,

Memoona,

Sung Won Kim

et al.

PeerJ Computer Science, Journal Year: 2025, Volume and Issue: 11, P. e2911 - e2911

Published: May 30, 2025

The proliferation of user-generated content on social networking sites has intensified the challenge accurately and efficiently detecting inflammatory discriminatory speech at scale. Traditional manual moderation methods are impractical due to sheer volume complexity online discourse, necessitating automated solutions. However, existing deep learning models for hate detection typically function as black-box systems, providing binary classifications without interpretable insights into their decision-making processes. This opacity significantly limits practical utility, particularly in nuanced tasks. To address this challenge, our research explores leveraging advanced reasoning knowledge integration capabilities state-of-the-art language models, specifically Mistral-7B, develop transparent systems. We introduce a novel framework wherein large (LLMs) generate explicit rationales by identifying analyzing critical textual features indicative speech. These subsequently integrated specialized classifiers designed perform explainable moderation. rigorously evaluate methodology multiple benchmark English-language media datasets. Results demonstrate that incorporating LLM-generated explanations enhances both interpretability accuracy detection. approach not only identifies problematic effectively but also clearly articulates analytical rationale behind each decision, fulfilling demand transparency

Language: Английский

Citations

0

Learning Robust Rationales for Model Explainability: A Guidance-Based Approach DOI Open Access

Shuaibo Hu,

Kui Yu

Proceedings of the AAAI Conference on Artificial Intelligence, Journal Year: 2024, Volume and Issue: 38(16), P. 18243 - 18251

Published: March 24, 2024

Selective rationalization can be regarded as a straightforward self-explaining approach for enhancing model explainability in natural language processing tasks. It aims to provide explanations that are more accessible and understandable non-technical users by first selecting subsets of input texts rationales then predicting based on chosen subsets. However, existing methods follow this select-then-predict framework may suffer from the degeneration problem, resulting sub-optimal or unsatisfactory do not align with human judgments. This problem further lead failure, meaningless ultimately undermine people's trust model. To address these challenges, we propose Guidance-based Rationalization method (G-RAT) effectively improves robustness against failure situations quality using guidance module regularize selections distributions. Experimental results two synthetic settings prove our is robust problems, while real datasets show its effectiveness providing line The source code available at https://github.com/shuaibo919/g-rat.

Language: Английский

Citations

1

Do Deep Neural Nets Display Human-like Attention in Short Answer Scoring? DOI Creative Commons
Zijie Zeng, Xinyu Li, Dragan Gašević

et al.

Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Journal Year: 2022, Volume and Issue: unknown

Published: Jan. 1, 2022

Zijie Zeng, Xinyu Li, Dragan Gasevic, Guanliang Chen. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

Language: Английский

Citations

4

GMEmbeddings: An R Package to Apply Embedding Techniques to Microbiome Data DOI Creative Commons
Christine Tataru,

Austin Eaton,

Maude M. David

et al.

Frontiers in Bioinformatics, Journal Year: 2022, Volume and Issue: 2

Published: April 26, 2022

Large-scale microbiome studies investigating disease-inducing microbial roles base their findings on differences between count data in contrasting environments (e.g., stool samples cases and controls). These survey are often impeded by small sample sizes database bias. Combining from multiple results obvious batch effects, even when DNA preparation sequencing methods identical. Relatedly, predictive models trained one dataset do not generalize to outside datasets. In this study, we address these limitations applying word embedding algorithms (GloVe) PCA transformation ASV the American Gut Project generating translation matrices that can be applied any 16S rRNA V4 region gut study. Because approaches contextualize occurrences a larger while reducing dimensionality of feature space, they improve generalization predict host phenotype associated microbiota. The GMEmbeddings R package contains GloVe at 50, 100 250 dimensions, each learned using ∼15,000 Project. It currently supports alignment, matching, matrix multiplication allow users transform into spaces. We show how correlate properties new space KEGG functional pathways for biological interpretation results. Lastly, provide benchmarking six datasets describing three phenotypes demonstrate ability embedding-based classifiers independent Future iterations will include other systems. Available at: https://github.com/MaudeDavidLab/GMEmbeddings .

Language: Английский

Citations

3

Recent Trends in Deep Learning for Natural Language Processing and Scope for Asian Languages DOI
Diganta Baishya, Rupam Baruah

2022 International Conference on Augmented Intelligence and Sustainable Systems (ICAISS), Journal Year: 2022, Volume and Issue: unknown, P. 408 - 411

Published: Nov. 24, 2022

Natural language processing (NLP) studies the techniques and procedures that allow a machine to converse using human language. Recent advances in artificial intelligence communication technology have considerably enhanced natural applications. Due improvements deep learning, virtually every aspect of intelligence, including processing, has made substantial progress. Deep learning methods use many layers neurons construct neural network. Recurrent Neural Networks their variants like long short term (LSTM), bidirectional LSTM, are some most popular techniques. This article reviews employed NLP- specific approaches last few years. We also studied issues faced by researchers while trying apply Asian languages. highlight critical research work carried out recently for Indian other In addition, we discuss fundamental linguistic challenges suggest future scopes topic.

Language: Английский

Citations

3