Alexandria Engineering Journal, Год журнала: 2025, Номер 125, С. 354 - 366
Опубликована: Апрель 18, 2025
Язык: Английский
Alexandria Engineering Journal, Год журнала: 2025, Номер 125, С. 354 - 366
Опубликована: Апрель 18, 2025
Язык: Английский
Signal Processing, Год журнала: 2023, Номер 208, С. 108962 - 108962
Опубликована: Фев. 5, 2023
Язык: Английский
Процитировано
71Neurocomputing, Год журнала: 2023, Номер 545, С. 126300 - 126300
Опубликована: Май 13, 2023
Язык: Английский
Процитировано
50Information Sciences, Год журнала: 2024, Номер 660, С. 120130 - 120130
Опубликована: Янв. 21, 2024
Язык: Английский
Процитировано
20Electronics, Год журнала: 2023, Номер 12(4), С. 1024 - 1024
Опубликована: Фев. 18, 2023
With the advancement of computer technology, transformer models have been applied to field vision (CV) after their success in natural language processing (NLP). In today’s rapidly evolving medical field, radiologists continue face multiple challenges, such as increased workload and diagnostic demands. The accuracy traditional lung cancer detection methods still needs be improved, especially realistic scenarios. this study, we evaluated performance Swin Transformer model classification segmentation cancer. results showed that pre-trained Swin-B achieved a top-1 82.26% mission, outperforming ViT by 2.529%. Swin-S demonstrated improvement over other terms mean Intersection Union (mIoU). These suggest pre-training can an effective approach for improving these tasks.
Язык: Английский
Процитировано
43Applied Sciences, Год журнала: 2023, Номер 13(13), С. 7566 - 7566
Опубликована: Июнь 27, 2023
Pose recognition in character animations is an important avenue of research computer graphics. However, the current use traditional artificial intelligence algorithms to recognize animation gestures faces hurdles such as low accuracy and speed. Therefore, overcome above problems, this paper proposes a real-time 3D pose system, which includes both facial body poses, based on deep convolutional neural networks further designs single-purpose estimation system. First, we transformed human extracted from input image abstract data structure. Subsequently, generated required at runtime dataset. This challenges conventional concept monocular estimation, extremely difficult achieve. It can also achieve running speed resolution 384 fps. The proposed method was used identify multiple-character using multiple datasets (Microsoft COCO 2014, CMU Panoptic, Human3.6M, JTA). results indicated that improved algorithm performance by approximately 3.5% 8–10 times, respectively, significantly superior other classic algorithms. Furthermore, tested system pose-recognition datasets. attitude reach 24 fps with error 100 mm, considerably less than 2D 60 learning study yielded surprisingly performance, proving deep-learning technology for has great potential.
Язык: Английский
Процитировано
26Scientific Reports, Год журнала: 2025, Номер 15(1)
Опубликована: Март 15, 2025
Denoising is one of the most important processes in digital image processing to recover visual quality and structural integrity images. Traditional methods often suffer from limitations like computational complexity, over-smoothing, inability preserve critical details, particularly edges. This paper introduces a hybrid denoising algorithm combining Adaptive Median Filter (AMF) Modified Decision-Based (MDBMF) address these challenges. The AMF adjusts window sizes dynamically precisely detect noisy pixels, MDBMF selectively recovers corrupted pixels without affecting intact regions, effectively reducing noise while preserving subjective analysis supplemented with objective analyses which proves that approach performance considerably outperforms existing state-of-the-art methods. test conducted on nine benchmark images standard medical dataset, namely, Chest Liver different densities range 10 90%. Quantitative evaluations PSNR, MSE, IEF, SSIM, FOM VIF clearly show superiority when compared approaches. improvement PSNR was up 2.34 dB, IEF more than 20%, MSE 15% over other BPDF, AT2FF, SVMMF. Improvement values SSIM 0.07, confirms improved similarity. Furthermore, metrics demonstrate remarkable approach: both exceeded all techniques evaluated, reaching 0.68 0.61, respectively.
Язык: Английский
Процитировано
1Applied Sciences, Год журнала: 2023, Номер 13(8), С. 5175 - 5175
Опубликована: Апрель 21, 2023
Financial time-series prediction has been an important topic in deep learning, and the of financial time series is great importance to investors, commercial banks regulators. This paper proposes a model based on multiplexed attention mechanisms linear transformers predict series. The transformer faster training efficiency long-time forecasting capability. Using reduces original transformer’s complexity preserves decoder’s mechanism. results show that proposed method can effectively improve accuracy model, increase inference speed reduce number operations, which new implications for
Язык: Английский
Процитировано
18Neurocomputing, Год журнала: 2023, Номер 548, С. 126284 - 126284
Опубликована: Май 3, 2023
Язык: Английский
Процитировано
18Frontiers in Energy Research, Год журнала: 2023, Номер 11
Опубликована: Апрель 13, 2023
Accurate wind power prediction is crucial for the safe and stable operation of grid. However, generation has large random volatility intermittency, which increases difficulty prediction. In order to construct an effective model based on achieve grid dispatch after connected grid, a WT-BiGRU-Attention-TCN proposed. First, wavelet transform (WT) used reduce noises sample data. Then, temporal attention mechanism incorporated into bi-directional gated recurrent unit (BiGRU) highlight impact key time steps results while fully extracting features context. Finally, performance enhanced by further more high-level through convolutional neural network (TCN). The show that our proposed outperforms other baseline models, achieving root mean square error 0.066 MW, absolute percentage 18.876%, coefficient determination (R 2 ) reaches 0.976. It indicates noise-reduction WT technique can significantly improve performance, also shows using TCN accuracy.
Язык: Английский
Процитировано
16Frontiers in Energy Research, Год журнала: 2023, Номер 11
Опубликована: Май 5, 2023
A smart grid is a new type of power system based on modern information technology, which utilises advanced communication, computing and control technologies employs sensors, measurement, communication devices that can monitor the status operation various in real-time optimise dispatch through intelligent algorithms to achieve efficient system. However, due its complexity uncertainty, how effectively perform prediction an important challenge. This paper proposes model attention mechanism convolutional neural network (CNN) combined with bi-directional long short-term memory BiLSTM.The has stronger spatiotemporal feature extraction capability, more accurate capability better adaptability than ARMA decision trees. The traditional models tree often only use simple statistical methods for prediction, cannot meet requirements high accuracy efficiency load so CNN-BiLSTM Bayesian optimisation following advantages suitable compared tree. CNN hierarchical structure containing several layers such as layer, pooling layer fully connected layer. mainly used extracting features from data images, dimensionality reduction features, classification recognition. core operation, locally weighted summation input extract data. In convolution different be extracted by setting kernels BiLSTM capture semantic dependencies both directions. consists two LSTM process sequence forward backward directions combine obtain comprehensive contextual information. access front back inputs at each time step results. It prevents gradient explosion disappearance while capturing longer-distance dependencies. extracts then optimises them Bayes. By collecting system, including power, load, weather other factors, our uses deeply learn grids key future prediction. Meanwhile, algorithm model’s hyperparameters, thus improving performance. provide reference help energy utilisation
Язык: Английский
Процитировано
15