Joint optimization of storage assignment and order batching in robotic mobile fulfillment system with dynamic storage depth and surplus items DOI
Zhi Liu, Jiansha Lu, Chenhao Ren

et al.

Computers & Industrial Engineering, Journal Year: 2024, Volume and Issue: unknown, P. 110767 - 110767

Published: Dec. 1, 2024

Language: Английский

Deep reinforcement learning for dynamic order picking in warehouse operations DOI
Sasan Mahmoudinazlou, Abhay Sobhanan, Hadi Charkhgard

et al.

Computers & Operations Research, Journal Year: 2025, Volume and Issue: unknown, P. 107112 - 107112

Published: May 1, 2025

Language: Английский

Citations

0

Multi-objective optimization enabling CFRP energy-efficient milling based on deep reinforcement learning DOI
Meihang Zhang, Hua Zhang, Wei Yan

et al.

Applied Intelligence, Journal Year: 2024, Volume and Issue: unknown

Published: Sept. 18, 2024

Language: Английский

Citations

2

Online Three-Dimensional Fuzzy Reinforcement Learning Modeling for Nonlinear Distributed Parameter Systems DOI Open Access
Xianxia Zhang,

Runbin Yan,

Gang Zhou

et al.

Electronics, Journal Year: 2024, Volume and Issue: 13(21), P. 4217 - 4217

Published: Oct. 27, 2024

Distributed parameter systems (DPSs) frequently appear in industrial manufacturing processes, with complex characteristics such as time–space coupling, nonlinearity, infinite dimension, uncertainty and so on, which is full of challenges to the modeling system. At present, most DPS methods are offline. When internal parameters or external environment change, offline model incapable accurately representing dynamic attributes real Establishing an online for that reflects real-time dynamics system very important. In this paper, idea reinforcement learning creatively integrated into three-dimensional (3D) fuzzy a learning-based 3D method proposed. The agent improves strategy by continuously interacting environment, can adaptively establish from scratch. Specifically, paper combines deterministic gradient algorithm based on actor critic framework function represented two updated alternately. uses TD (0) target via semi-gradient method; using chain derivation rule behavior value established model. Since continuous problem, proposes average reward, effectively realize modeling. suggested methodology implemented three-zone rapid thermal chemical vapor deposition reactor simulation results demonstrate efficacy methodology.

Language: Английский

Citations

0

Joint optimization of storage assignment and order batching in robotic mobile fulfillment system with dynamic storage depth and surplus items DOI
Zhi Liu, Jiansha Lu, Chenhao Ren

et al.

Computers & Industrial Engineering, Journal Year: 2024, Volume and Issue: unknown, P. 110767 - 110767

Published: Dec. 1, 2024

Language: Английский

Citations

0