Feature Subset Selection for High-Dimensional, Low Sampling Size Data Classification Using Ensemble Feature Selection With a Wrapper-Based Search DOI Creative Commons
Ashis Kumar Mandal, Md Nadim, Hasi Saha

и другие.

IEEE Access, Год журнала: 2024, Номер 12, С. 62341 - 62357

Опубликована: Янв. 1, 2024

The identification of suitable feature subsets from High-Dimensional Low-Sample-Size (HDLSS) data is paramount importance because this dataset often contains numerous redundant and irrelevant features, leading to poor classification performance. However, the selection an optimal subset a vast space creates significant computational challenge. In domain HDLSS data, conventional methods face challenges in achieving balance between reducing number features preserving high accuracy. Addressing these issues, study introduces effective framework that employs filter wrapper-based strategy specifically designed address inherent data. adopts multi-step approach where ensemble integrates five ranking approaches: Chi-square (χ 2 ), Gini index (GI), F-score, Mutual Information (MI), Symmetric uncertainty (SU) identify top-ranking features. subsequent stage, search method utilized, which Differential Evaluation (DE) metaheuristic algorithm as strategy. fitness during assessed based on weighted combination error rate Support Vector Machine (SVM) classifier cardinality subset. datasets, now with reduced dimensionality, are subsequently employed build models SVM, K-Nearest Neighbors (KNN), Logistic Regression (LR).The proposed was evaluated 13 datasets assess its efficacy selecting appropriate improving Classification Accuracy (ACC) analog Area Under Curve (AUC).The produces smaller (ranging 2 9 for all datasets), while maintaining commendable average AUC ACC (between 98% 100%). comparative results demonstrate outperforms both non-feature approaches terms ACC. Furthermore, when compared several other state-of-the-art approaches, exhibits

Язык: Английский

Machine learning-based global trends and the development prospects of wastewater treatment: A bibliometric analysis DOI
Libo Xia,

Xiaoxuan Hao,

Yun Zhou

и другие.

Journal of environmental chemical engineering, Год журнала: 2024, Номер 12(3), С. 112732 - 112732

Опубликована: Апрель 7, 2024

Язык: Английский

Процитировано

7

Novel Machine Learning-Based Energy Consumption Model of Wastewater Treatment Plants DOI
Shike Zhang, Hongtao Wang, Arturo A. Keller

и другие.

ACS ES&T Water, Год журнала: 2021, Номер 1(12), С. 2531 - 2540

Опубликована: Ноя. 19, 2021

Wastewater treatment plants (WWTPs) can account for up to 1% of a country's energy consumption. Meanwhile, WWTPs have high energy-saving potential. To achieve this, it is necessary establish appropriate consumption models WWTPs. Several recent been developed using logarithmic, exponential, or linear functions. However, the behavior non-linear and difficult fit with simple functions, particularly non-numerical variables. Thus, traditional modeling methods cannot effectively describe relationship between water in Therefore, machine learning method was adopted this study investigate WWTPs; novel model variable (discharge standard) random forest algorithm. The also predict after upgrading discharge standards. We found that unit electricity exhibited an average increase 17% effluent standard increased from class I B A (as per China's classification). correlation coefficient 0.702. provide better understanding efficiency

Язык: Английский

Процитировано

38

Identifying Functional Groups that Determine Rates of Micropollutant Biotransformations Performed by Wastewater Microbial Communities DOI
Stephanie L. Rich, Michael Zumstein, Damian E. Helbling

и другие.

Environmental Science & Technology, Год журнала: 2021, Номер 56(2), С. 984 - 994

Опубликована: Дек. 23, 2021

The goal of this research was to identify functional groups that determine rates micropollutant (MP) biotransformations performed by wastewater microbial communities. To meet goal, we a series incubation experiments seeded with four independent communities and spiked them mixture 40 structurally diverse MPs. We collected samples over time used high-resolution mass spectrometry estimate biotransformation rate constants for each MP in experiment propose structures 46 products. then developed random forest models classify the based on presence specific or observed biotransformations. extracted classification importance metrics from model compared across Our analysis revealed 30 define as either promoters, inhibitors, structural features can be biotransformed uncharacterized community, are not rate-determining. experimental data provide novel insights into more accurately predict inform design new chemical products may readily biodegradable during treatment.

Язык: Английский

Процитировано

36

Future Frameworks for Fluoride and Algorithms for Environmental System DOI

Mridu Kulwant,

Divya Patel,

Saba Shirin

и другие.

Water science and technology library, Год журнала: 2023, Номер unknown, С. 343 - 364

Опубликована: Янв. 1, 2023

Язык: Английский

Процитировано

16

Enhancing compound confidence in suspect and non-target screening through machine learning-based retention time prediction DOI

Dehao Song,

Ting Tang,

Rui Wang

и другие.

Environmental Pollution, Год журнала: 2024, Номер 347, С. 123763 - 123763

Опубликована: Март 14, 2024

Язык: Английский

Процитировано

6

Information Communication and Computation Technologies (ICCT) for Agricultural and Environmental Information Systems for Society 5.0 DOI Open Access
Shubhrajyotsna Aithal, P. S. Aithal

International Journal of Applied Engineering and Management Letters, Год журнала: 2024, Номер unknown, С. 67 - 100

Опубликована: Фев. 28, 2024

Purpose: This paper aims to discover the dynamic landscape of Information Communication and Computation Technologies (ICCT) within agriculture environmental information management, elucidating their evolutionary trajectory resonance Society 5.0 principles in fostering innovative solutions. By scrutinizing core technologies constituting ICCT these sectors, it endeavours shed light on potential for integration framework 5.0, contemplating both possibilities challenges inherent this convergence. Methodology: exploratory chapter delves into evolving its pivotal emphasis address complex management. Analysis/Results: The provides a background evolution establishes rationale exploring role advancing Agricultural Environmental Systems transformative societal framework. are explored through IoT applications precision agriculture, impact blockchain agricultural supply chains, utilization remote sensing Earth observation systems along with data analytics insights. further investigates systems, unveiling how support smart farming practices, citizen engagement decision-making, sustainable resource Case studies highlight successful implementations underscoring best practices lessons learned. Emerging trends science explored, providing insights future developments. Originality/Value: Through lens case showcasing implementations, seeks distill key insights, while also conducting forward-looking assessment emerging applications, thus contributing deeper understanding shaping paradigms context future. Type Paper: Exploratory analysis.

Язык: Английский

Процитировано

6

Predicting removal efficiency of organic pollutants by soil vapor extraction based on an optimized machine learning method DOI
Shuai Zhang, Jiating Zhao, Lizhong Zhu

и другие.

The Science of The Total Environment, Год журнала: 2024, Номер 927, С. 172438 - 172438

Опубликована: Апрель 12, 2024

Язык: Английский

Процитировано

6

Enhancing Data Integration, Interoperability, and Reuse to Address Complex and Emerging Environmental Health Problems DOI Creative Commons

Michelle Heacock,

Adeline R. Lopez, Sara M. Amolegbe

и другие.

Environmental Science & Technology, Год журнала: 2022, Номер 56(12), С. 7544 - 7552

Опубликована: Май 12, 2022

Environmental health sciences (EHS) span many diverse disciplines. Within the EHS community, National Institute of Health Sciences Superfund Research Program (SRP) funds multidisciplinary research aimed to address pressing and complex issues on how people are exposed hazardous substances their related consequences with goal identifying strategies reduce exposures protect human health. While disentangling interrelationships that contribute environmental effects over course life remains difficult, advances in data science sharing offer a path forward explore across disciplines reveal new insights. Multidisciplinary SRP-funded teams well-positioned examine best integrate domains multifaceted problems. As such, SRP supported collaborative projects designed foster enhance interoperability reuse streams. This perspective synthesizes those experiences as landscape view challenges identified while working increase FAIR-ness (Findable, Accessible, Interoperable, Reusable) opportunities them.

Язык: Английский

Процитировано

20

Integration of Per- and Polyfluoroalkyl Substance (PFAS) Fingerprints in Fish with Machine Learning for PFAS Source Tracking in Surface Water DOI
John Stults, Christopher P. Higgins, Damian E. Helbling

и другие.

Environmental Science & Technology Letters, Год журнала: 2023, Номер 10(11), С. 1052 - 1058

Опубликована: Май 23, 2023

Per- and polyfluoroalkyl substances (PFASs) are a class of environmental contaminants that originate from various sources. The unique chemical fingerprints associated with many commercial products industrial applications make PFASs ideal candidates for machine learning (ML)-assisted forensics. Here, we propose novel use PFAS in fish tissue surface water systems to classify exposure multiple sources using proof-of-concept demonstration. Three supervised ML classification techniques (k-nearest neighbors (KNN), decision trees, support vector machines) implementing two predictive features used literature-reported (n = 1057). importance additional was explored brute force optimization multifeature KNN algorithm. multiclass considered aqueous film-forming foam-impacted water, paper industry wastewater, diffuse sources, or undergoing long-range transport. optimized classifiers demonstrated 85%–94% accuracy this first known also 79%–92% set independent external validation data 192). Our results demonstrate may be an effective means source tracking systems. code is provided guidance on best practices ML-assisted

Язык: Английский

Процитировано

13

Operational parameter prediction of electrocoagulation system in a rural decentralized water treatment plant by interpretable machine learning model DOI
Bowen Li,

Chaojie Lu,

Jin Zhao

и другие.

Journal of Environmental Management, Год журнала: 2023, Номер 333, С. 117416 - 117416

Опубликована: Фев. 7, 2023

Язык: Английский

Процитировано

11