Transfer Reinforcement Learning for Mixed Observability Markov Decision Processes with Time-Varying Interval-Valued Parameters and Its Application in Pandemic Control DOI

Mu Du,

Hongtao Yu, Nan Kong

и другие.

INFORMS journal on computing, Год журнала: 2024, Номер unknown

Опубликована: Июнь 6, 2024

We investigate a novel type of online sequential decision problem under uncertainty, namely mixed observability Markov process with time-varying interval-valued parameters (MOMDP-TVIVP). Such data-driven optimization problems learning widely have real-world applications (e.g., coordinating surveillance and intervention activities limited resources for pandemic control). Solving MOMDP-TVIVP is great challenge as system identification reoptimization based on newly observational data are required considering the unobserved states parameters. Moreover, many practical problems, action state spaces intractably large optimization. To address this challenge, we propose transfer reinforcement (TRL)-based algorithmic approach that ingrates (TL) into deep (DRL) in an offline-online scheme. accelerate reoptimization, pretrain collection promising networks fine-tune them acquired system. The hallmark our comes from combining strong approximation ability neural high flexibility TL through efficiently adapting previously learned policy to changes dynamics. Computational study different uncertainty configurations scales shows outperforms existing methods solution optimality, robustness, efficiency, scalability. also demonstrate value fine-tuning by comparing TRL DRL, which at least 21% improvement can be yielded no more than 0.62% time spent pretraining each period instances continuous state-action space modest dimensionality. A retrospective control use case Shanghai, China improved making via several public health metrics. Our first-ever endeavor employing intensive network training solving processes requiring reoptimization. History: Accepted Paul Brooks, Area Editor Applications Biology, Medicine, & Healthcare. Funding: This work was supported part National Natural Science Foundation [Grants 72371051 72201047] first second authors [Grant 1825725] third author. Supplemental Material: software supports findings available within paper its Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2022.0236 ) well IJOC GitHub repository https://github.com/INFORMSJoC/2022.0236 ). complete Software Data Repository https://informsjoc.github.io/ .

Язык: Английский

Collusion by mistake: Does algorithmic sophistication drive supra-competitive profits? DOI Creative Commons
Ibrahim Abada, Xavier Lambin,

Nikolay Tchakarov

и другие.

European Journal of Operational Research, Год журнала: 2024, Номер 318(3), С. 927 - 953

Опубликована: Июнь 22, 2024

A burgeoning literature shows that self-learning algorithms may, under some conditions, reach seemingly-collusive outcomes: after repeated interaction, competing earn supra-competitive profits, at the expense of efficiency and consumer welfare. This paper offers evidence such behavior can stem from insufficient exploration during learning process algorithmic sophistication might increase competition. In particular, we show allowing for more thorough does lead otherwise Q-learning to play competitively. We first provide a theoretical illustration this phenomenon by analyzing competition between two stylized in Prisoner's Dilemma framework. Second, via simulations, sophisticated exploit ones. Following these results, argue advancement computational capabilities situations, solution challenge seeming collusion, rather than exacerbate it.

Язык: Английский

Процитировано

7

Review-based recommendation under preference uncertainty: An asymmetric deep learning framework DOI

Yingqiu Xiong,

Yezheng Liu, Yang Qian

и другие.

European Journal of Operational Research, Год журнала: 2024, Номер 316(3), С. 1044 - 1057

Опубликована: Фев. 3, 2024

Язык: Английский

Процитировано

5

A K-means Supported Reinforcement Learning Framework to Multi-dimensional Knapsack DOI Creative Commons
Sabah Bushaj, İ. Esra Büyüktahtakın

Journal of Global Optimization, Год журнала: 2024, Номер 89(3), С. 655 - 685

Опубликована: Фев. 15, 2024

Abstract In this paper, we address the difficulty of solving large-scale multi-dimensional knapsack instances (MKP), presenting a novel deep reinforcement learning (DRL) framework. DRL framework, train different agents compatible with discrete action space for sequential decision-making while still satisfying any resource constraint MKP. This framework incorporates decision variable values in 2D where agent is responsible assigning value 1 or 0 to each variables. To best our knowledge, first model its kind which environment formulated, and an element solution matrix represents item Our configured solve MKP dimensions distributions. We propose K-means approach obtain initial feasible that used agent. four present results comparing them CPLEX commercial solver. The show can learn generalize over sizes shows it medium-sized at least 45 times faster CPU time 10 large instances, maximum gap 0.28% compared performance CPLEX. Furthermore, 95% items are predicted line solution. Computations also provide better optimality respect state-of-the-art approaches.

Язык: Английский

Процитировано

5

Digital Transformation and Organizational Performance DOI
Amira Khelil

Advances in human resources management and organizational development book series, Год журнала: 2025, Номер unknown, С. 109 - 132

Опубликована: Фев. 7, 2025

Although artificial intelligence has driven digital transformation in several countries and sectors, many large companies are still lagging behind adopting these technologies. In fact, business managers remain unaware of the strategic role AI can play. Therefore, explaining potential its implications could be a viable solution to address this issue. context, chapter explores how enhance organizational performance by developing dynamic capabilities. Using survey-based approach, we collected data from multinational firms Tunisia examine indirect effect adoption on performance. Data was gathered 226 analyzed through structural equation modeling. Our findings reveal that positively impacts three key capabilities: exploration innovation, decision-making speed, exploitation innovation. These results highlight benefits firms, fostering capabilities that, turn,

Язык: Английский

Процитировано

0

New Frontiers in Machine Learning Optimization DOI

Pooja Dehankar,

Susanta Das

Advances in computational intelligence and robotics book series, Год журнала: 2025, Номер unknown, С. 427 - 454

Опубликована: Янв. 10, 2025

Machine learning (ML) optimization techniques serve as essential for training models to achieve high performance in a diverse areas. This chapter offers thorough summary of machine techniques. analysis the development over time. A number common constraints are also discussed. Developing model that works effectively and provides accurate predictions certain set instances is main objective ML. We require ML accomplish that. The practice modifying hyper parameters with an technique minimize cost function called optimization. Because indicates difference between actual value estimated parameter predicted by model, it crucial reduce it. will provide general explanation workings drawbacks strategies. Numerous advancements have been put forth this chapter.

Язык: Английский

Процитировано

0

Explainable Artificial Intelligence for Business and Economics: Methods, Applications and Challenges DOI Creative Commons
Qi Lyu, Shaomin Wu

Expert Systems, Год журнала: 2025, Номер 42(4)

Опубликована: Фев. 26, 2025

ABSTRACT In recent years, artificial intelligence (AI) has made significant strides in research and shown great potential various application fields, including business economics (B&E). However, AI models are often black boxes, making them difficult to understand explain. This challenge can be addressed using eXplainable Artificial Intelligence (XAI), which helps humans the factors driving decisions, thereby increasing transparency confidence results. paper aims provide a comprehensive understanding of state‐of‐the‐art on XAI B&E by conducting an extensive literature review. It introduces novel approach categorising techniques from three different perspectives: samples, features modelling method. Additionally, identifies key challenges corresponding opportunities field. We hope that this work will promote adoption B&E, inspire interdisciplinary collaboration, foster innovation growth ensure explainability.

Язык: Английский

Процитировано

0

Data-driven optimization for drone delivery service planning with online demand DOI Creative Commons

Aditya Paul,

Michael W. Levin, S. Travis Waller

и другие.

Transportation Research Part E Logistics and Transportation Review, Год журнала: 2025, Номер 198, С. 104095 - 104095

Опубликована: Апрель 2, 2025

Язык: Английский

Процитировано

0

A Location-Production-Routing Problem for Distributed Manufacturing Platforms: A Neural Genetic Algorithm Solution Methodology DOI Creative Commons

Behrang Bootaki,

Guoqing Zhang

International Journal of Production Economics, Год журнала: 2024, Номер 275, С. 109325 - 109325

Опубликована: Июль 3, 2024

Язык: Английский

Процитировано

3

Optimization of three-dimensional urban underground logistics system alignment: a deep reinforcement learning approach DOI

Longlong Hou,

XU Yuan-xian,

Rui Ren

и другие.

Computers & Industrial Engineering, Год журнала: 2025, Номер unknown, С. 111185 - 111185

Опубликована: Май 1, 2025

Язык: Английский

Процитировано

0

Learning Optimal Solutions via an LSTM-Optimization Framework DOI Open Access
Dogacan Yilmaz, İ. Esra Büyüktahtakın

Operations Research Forum, Год журнала: 2023, Номер 4(2)

Опубликована: Июнь 6, 2023

Язык: Английский

Процитировано

8