Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Network-aware compute and memory allocation in optically composable data centers with deep reinforcement learning and graph neural networks

Not Accessible

Your library or personal account may give you access

Abstract

Composable data center architectures promise a means of pooling resources remotely within data centers, allowing for both more flexibility and resource efficiency underlying the increasingly important infrastructure-as-a-service business. This can be accomplished by means of using an optically circuit switched backbone in the data center network (DCN), providing the required bandwidth and latency guarantees to ensure reliable performance when applications are run across non-local resource pools. However, resource allocation in this scenario requires both server-level and network-level resources to be co-allocated to requests. The online nature and underlying combinatorial complexity of this problem, alongside the typical scale of DCN topologies, make exact solutions impossible and heuristic-based solutions sub-optimal or non-intuitive to design. We demonstrate that deep reinforcement learning, where the policy is modeled by a graph neural network, can be used to learn effective network-aware and topologically scalable allocation policies end-to-end. Compared to state-of-the-art heuristics for network-aware resource allocation, the method achieves up to a 20% higher acceptance ratio, can achieve the same acceptance ratio as the best performing heuristic with $3 \times$ less networking resources available, and can maintain all-around performance when directly applied (with no further training) to DCN topologies with ${10^2} \times$ more servers than the topologies seen during training.

© 2023 Optica Publishing Group

Full Article  |  PDF Article
More Like This
QoS-aware data center network reconfiguration method based on deep reinforcement learning

Xiaotao Guo, Fulong Yan, Xuwei Xue, Bitao Pan, George Exarchakos, and Nicola Calabretta
J. Opt. Commun. Netw. 13(5) 94-107 (2021)

Experimental evaluation of a latency-aware routing and spectrum assignment mechanism based on deep reinforcement learning

C. Hernández-Chulde, R. Casellas, R. Martínez, R. Vilalta, and R. Muñoz
J. Opt. Commun. Netw. 15(11) 925-937 (2023)

Reconfiguring multicast sessions in elastic optical networks adaptively with graph-aware deep reinforcement learning

Xiaojian Tian, Baojia Li, Rentao Gu, and Zuqing Zhu
J. Opt. Commun. Netw. 13(11) 253-265 (2021)

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (6)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (3)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.