Techniques for applying reinforcement learning to routing and wavelength assignment problems in optical fiber communication networks

Josh W. Nevin; Sam Nallaperuma; Nikita A. Shevchenko; Zacharaya Shabka; Georgios Zervas; Seb J. Savory

doi:10.1364/JOCN.460629

Journal of Optical Communications and Networking
Vol. 14,
Issue 9,
pp. 733-748
(2022)
•https://doi.org/10.1364/JOCN.460629

Techniques for applying reinforcement learning to routing and wavelength assignment problems in optical fiber communication networks

Josh W. Nevin, Sam Nallaperuma, Nikita A. Shevchenko, Zacharaya Shabka, Georgios Zervas, and Seb J. Savory

Not Accessible

Your library or personal account may give you access

Get PDF
Email
Share
Get Citation
Copy Citation Text
Josh W. Nevin, Sam Nallaperuma, Nikita A. Shevchenko, Zacharaya Shabka, Georgios Zervas, and Seb J. Savory, "Techniques for applying reinforcement learning to routing and wavelength assignment problems in optical fiber communication networks," J. Opt. Commun. Netw. 14, 733-748 (2022)

Export Citation
- BibTex
- Endnote (RIS)
- HTML
- Plain Text
Citation alert
Save article

Abstract

We propose a novel application of reinforcement learning (RL) with invalid action masking and a novel training methodology for routing and wavelength assignment (RWA) in fixed-grid optical networks and demonstrate the generalizability of the learned policy to a realistic traffic matrix unseen during training. Through the introduction of invalid action masking and a new training method, the applicability of RL to RWA in fixed-grid networks is extended from considering connection requests between nodes to servicing demands of a given bit rate, such that lightpaths can be used to service multiple demands subject to capacity constraints. We outline the additional challenges involved for this RWA problem, for which we found that standard RL had low performance compared to that of baseline heuristics, in comparison with the connection requests RWA problem considered in the literature. Thus, we propose invalid action masking and a novel training method to improve the efficacy of the RL agent. With invalid action masking, domain knowledge is embedded in the RL model to constrain the action space of the RL agent to lightpaths that can support the current request, reducing the size of the action space and thus increasing the efficacy of the agent. In the proposed training method, the RL model is trained on a simplified version of the problem and evaluated on the target RWA problem, increasing the efficacy of the agent compared with training directly on the target problem. RL with invalid action masking and this training method outperforms standard RL and three state-of-the-art heuristics, namely, $k$ shortest path first fit, first-fit $k$ shortest path, and $k$ shortest path most utilized, consistently across uniform and nonuniform traffic in terms of the number of accepted transmission requests for two real-world core topologies, NSFNET and COST–239. The RWA runtime of the proposed RL model is comparable to that of these heuristic approaches, demonstrating the potential for real-world applicability. Moreover, we show that the RL agent trained on uniform traffic is able to generalize well to a realistic nonuniform traffic distribution not seen during training, thus outperforming the heuristics for this traffic. Visualization of the learned RWA policy reveals an RWA strategy that differs significantly from those of the heuristic baselines in terms of the distribution of services across channels and the distribution across links.

Full Article | PDF Article

More Like This

Interpreting multi-objective reinforcement learning for routing and wavelength assignment in optical networks

Sam Nallaperuma, Zelin Gan, Josh Nevin, Mykyta Shevchenko, and Seb J. Savory
J. Opt. Commun. Netw. 15(8) 497-506 (2023)

Pre- and post-processing techniques for reinforcement-learning-based routing and spectrum assignment in elastic optical networks

Takafumi Tanaka and Masayuki Shimoda
J. Opt. Commun. Netw. 15(12) 1019-1029 (2023)

Routing in optical transport networks with deep reinforcement learning

José Suárez-Varela, Albert Mestres, Junlin Yu, Li Kuang, Haoyu Feng, Albert Cabellos-Aparicio, and Pere Barlet-Ros
J. Opt. Commun. Netw. 11(11) 547-558 (2019)

Previous Article Next Article

Cited By

You do not have subscription access to this journal. Cited by links are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Figures (16)

You do not have subscription access to this journal. Figure files are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Tables (7)

You do not have subscription access to this journal. Article tables are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Equations (15)

You do not have subscription access to this journal. Equations are available to subscribers only. You may subscribe either as an Optica member, or as an authorized user of your institution.

Contact your librarian or system administrator
or
Login to access Optica Member Subscription

Parameter	Value	Units
Notional carrier wavelength ( $λ_{0}$ )	1550	nm
Symbol rate $(R_{S})$	100	GBd
WDM channel spacing	100	GHz
Total modulated bandwidth $(B)$	10	THz
Loss coefficient $(α)$	0.2	$dB / km$
Fiber GVD coefficient $(β_{2})$	–21.7	$p s^{2} / km$
Nonlinear coefficient $(γ)$	1.2	$/ W / km$
Lumped amplifier spacing $(L_{s})$	100	km
Lumped amplifier noise figure $(NF)$	4.5	dB

	kSP-FF	kSP-MU	FF-kSP	RL
Median	6710	6525	6818	7002
Mean	6701	6543	6820	7002
Min	6545	6234	6674	6857
Max	6831	6841	6964	7159
SD	55	175	63	59
IQR	80	332	75	83

	kSP-FF	kSP-MU	FF-kSP	RL
Median	15,156	14,208	14,624	15,279
Mean	15,156	14,170	14,624	15,283
Min	14,968	12,921	14,308	15,004
Max	15,333	15,345	14,865	15,549
SD	80	1015	126	110
IQR	119	2012	163	141

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

	kSP-FF	kSP-MU	FF-kSP	RL
Median	6168	6021	6093	6315
Mean	6171	6011	6090	6313
Min	6023	5690	5921	6167
Max	6295	6295	6248	6465
SD	56	170	76	66
IQR	81	318	98	96

	kSP-FF	kSP-MU	FF-kSP	RL
Median	11,610	11,560	11,975	12,106
Mean	11,611	11,542	11,990	12,106
Min	11,455	11,166	11,736	11,976
Max	11,775	11,864	12,198	12,229
SD	65	140	102	70
IQR	84	227	132	74

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

Abstract

Cited By

Figures (16)

Tables (7)

Equations (15)

Journal of Optical Communications and Networking

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0

$ID$	1	2	3	4	5	6	7	8	9	10	11	12	13	14
1	0	47	76	9	17	94	5	39	40	32	30	63	27	2
2	47	0	157	20	36	195	11	80	82	67	63	131	55	4
3	76	157	0	31	57	312	19	129	131	107	101	208	89	7
4	9	20	31	0	7	39	2	16	16	13	13	26	11	1
5	17	36	57	7	0	71	4	28	30	23	23	47	20	2
6	94	195	312	39	71	0	23	160	164	134	126	261	111	8
7	5	11	19	2	4	23	0	10	10	8	8	16	7	1
8	39	80	129	16	28	160	10	0	67	55	52	108	45	2
9	40	82	131	16	30	164	10	67	0	56	53	110	46	4
10	32	67	107	13	23	134	8	55	56	0	43	90	38	2
11	30	63	101	13	23	126	8	52	53	43	0	84	36	2
12	63	131	208	26	47	261	16	108	110	90	84	0	74	5
13	27	55	89	11	20	111	7	45	46	38	36	74	0	2
14	2	4	7	1	2	8	1	2	4	2	2	5	2	0

$ID$	1	2	3	4	5	6	7	8	9	10	11
1	0	177	34	21	30	60	14	19	10	5	20
2	177	0	480	303	432	847	200	267	142	91	283
3	34	480	0	57	83	164	38	51	27	17	54
4	21	303	57	0	52	103	23	32	17	11	34
5	30	432	83	52	0	147	34	46	23	16	49
6	60	847	164	103	147	0	68	91	47	31	95
7	14	200	38	23	34	68	0	21	11	7	22
8	19	267	51	32	46	91	21	0	15	10	30
9	10	142	27	17	23	47	11	15	0	5	16
10	5	91	17	11	16	31	7	10	5	0	10
11	20	283	54	34	49	95	22	30	16	10	0