Segmentation of anatomical layers and imaging artifacts in intravascular polarization sensitive optical coherence tomography using attending physician and boundary cardinality losses

Mohammad Haft-Javaherian; Mohammad Haft-Javaherian; Martin Villiger; Kenichiro Otsuka; Joost Daemen; Peter Libby; Polina Golland; Brett E. Bouma; Brett E. Bouma

doi:10.1364/BOE.514673

1. Introduction

Despite progress with effective therapies for treating acute coronary events, their prediction and prevention continue to present a major clinical challenge [1]. More than one million individuals suffered from acute coronary events last year in the United States alone [2]. In addition to pharmacological medical therapy, patients suffering an acute coronary event frequently receive percutaneous coronary intervention (PCI). PCI is similarly used in patients with chronic coronary syndrome.

Intravascular (IV) optical coherence tomography (OCT) is increasingly used for guiding PCI. It acquires high resolution images of the subsurface microstructure of coronary atherosclerotic lesions [3,4] and helps with identification of the culprit lesion, stent sizing, and confirming stent implantation. The use of intravascular imaging, such as IV-OCT, is supported by multiple randomized clinical trials [5–7] and has been demonstrated to reduce major adverse cardiac events in complex lesions [8]. Nonetheless, clinical adoption of IV-OCT has been modest [9,10]. This contrasts with its important role as an essential clinical research instrument for investigating the pathophysiology of coronary atherosclerosis [11]. One factor impairing more widespread use may be the image contrast, signal statistics, and speckle characteristics specific to OCT, which complicate image interpretation. Extensive training affords refined interpretation of pullback data sets but is based nearly exclusively on subjective criteria that are difficult to learn, which results in modest intra- and inter-reader agreement [12]. Furthermore, the clinical workflow precludes time-consuming interpretation, emphasizing the need for automated analysis that presents the operator with clear indications to guide the intervention. However, until the recent introduction of Ultreon by Abbott [13], and the HyperView imaging system by SpectraWave [14], image processing algorithms used in the catheterization laboratories have primarily been limited to lumen segmentation, disregarding subsurface vessel morphology. The automatic detection of anatomical layers within the vessel wall and other features beyond the lumen promises to refine the guidance of PCI and simplify translational research in the catheterization laboratory by eliminating the need for extensive training and time-consuming manual segmentation.

Polarization sensitive (PS) OCT is an extension of conventional OCT. It employs polarization-diverse detection to measure the polarization state of the detected light. It furthermore modulates the polarization state of the illumination to probe the polarization-dependence of the vessel wall. In parallel to conventional OCT (Fig. 1.A), the additional polarization-resolved measurements of PS-OCT afford reconstruction of images of tissue birefringence and depolarization (Fig. 1.B-C), providing extra contrast that is complementary to the conventional OCT signal. Microscopic PS-OCT in human aortic plaques reports tissue birefringence that can quantify collagen and smooth muscle cell content features, which play an important role in plaque stability and vascular healing [15]. Intravascular polarimetry with PS-OCT performs similar measurements of coronary atherosclerosis through an IV-OCT catheter [16]. In addition to tissue birefringence, intravascular polarimetry also measures depolarization, which is increased in tissues containing lipid particles, macrophage accumulations, or cholesterol crystals, as confirmed by correlation with histology in a human cadaver heart study [16]. Although intravascular polarimetry requires a dedicated image console, it is compatible with current intravascular imaging catheters, which facilitated its translation into the clinic [17,18].

Fig. 1. Expert annotation of IV PS-OCT images. An example of a multi-channel IV PS-OCT cross-sectional image, including backscatter signal intensity (A), birefringence (B), and depolarization (C) channels. The white arrow highlights a section of the media that exhibits little contrast in the intensity signal, but is readily identified in the birefringence image. D. Manual annotation using all three channels of PS-OCT by an expert using our Matlab graphical user interface. E. Inclusive pixel-level labels derived from the manual annotation (see Data Section). F. Equivalent exclusive labels defined in Table 1. Scale bar: 2 mm.

Download Full Size | PDF

Imaging in patients confirmed the improved image contrast available to polarimetric measurements. In particular, the smooth muscle cell-rich tunica media features consistent and high birefringence, often separated from the adjacent intima and adventitia layers by fine bands of low birefringence, co-locating with the internal and external elastic laminae (IEL, EEL). Depolarization enhances lipid-rich lesions and simplifies their differentiation from calcification, which can have a similar appearance in conventional intensity tomograms. The ability to consistently measure the EEL diameter along the entire coronary would be highly relevant for the sizing of stent diameter and length [19].

Intravascular polarimetry also offers a window of opportunity for the prospective identification of remote lesions with a propensity for causing subsequent acute events. There remains a high rate of recurrent coronary events within only a few years following initial PCI, caused in about 50% of the cases by a lesion not involved in the original event [20]. Plaques that rupture typically are depleted of collagen [21,22], and are expected to be lowly birefringent. The fibrous caps of target lesions in patients with chronic coronary syndrome featured indeed significantly higher birefringence than the caps of patients with acute coronary syndrome [18]. Combined, intravascular polarimetry converts the polarization properties of tissues into endogenous imaging contrast that may facilitate segmentation of subsurface features that are difficult to appreciate in conventional IV-OCT and could in turn enable improved guidance of PCI, as well as refined assessment of remote lesions.

We build on the success of deep learning methods for segmentation tasks, which have been used widely for segmenting conventional IV-OCT images. This paper leverages the additional image contrasts available to intravascular polarimetry (birefringence and depolarization in addition to conventional intensity), which have not previously been used to improve the segmentation of anatomical layers of the coronary artery. This study proposes a convolutional neural network accepting the appended multi-channel polarimetric images as input and optimizes its performance using a new multi-term loss function.

The multi-term loss function includes two common segmentation loss terms, i.e., weighted cross-entropy loss and generalized multi-class Dice loss. In addition to the common segmentation loss terms, a boundary loss term focuses on the accuracy of the model only within the pixels close to the boundaries between the anatomical layers. The boundary loss is suitable for problems requiring precise object boundary detection. Similarly, "boundary cardinality loss" penalizes the model from a topological point of view when the number of anatomical layers is different between the model’s prediction and ground-truth by counting the number of boundary pixels along the radial axis. The boundary cardinality loss imposes a topological prior on the layered tissues. Additionally, a feature denoted “attending physician loss” uses an independently-trained critique model, which distinguishes between low- and high-quality labels. The attending physician loss enables the utilization of the auxiliary information embedded in datasets with heterogeneous manual labeling qualities.

We trained and evaluated our method on a set of 984 images from 57 patients and compared it to the performance of state-of-the-art algorithms reported in the literature. Our work is the first demonstration of automatic segmentation of anatomical layers and the shadow artifacts arising from the guidewire and lipid-rich lesions using intravascular PS-OCT. It improves the boundary detection of the coronary lumen compared to other methods, and identifies the guidewire and plaque shadows in a single step. Furthermore, to the best of our knowledge, only two other studies [23,24] reported on the detection accuracy of intima-media and the media-adventitia boundaries, and only in regions of minimal disease, specifically excluding areas with thickened intima characteristic of atherosclerotic lesions.

2. Related work

The clear need for simplified interpretation of IV-OCT images has motivated the development of automated methods, with a focus on lumen segmentation, that tackle the challenges and artifacts presented by typical IV-OCT data.

OCT A-lines are independently acquired along the radial direction of the vessel in cylindrical coordinates as the probe rotates. Simultaneously, the core of the catheter is pulled back through the vessel, resulting in a helical scan pattern. Contemporary IV-OCT reconstructs individual A-lines from measurements in the frequency domain and visualizes the logarithm of the power of the reconstructed signal (dB). The presence of the guidewire, used to safely deploy the intravascular imaging catheter, casts a shadow on the vessel wall. The resulting intrinsic discontinuity even in the lumen signal creates an artifact that needs to be addressed by all analysis approaches. Prior signal processing techniques aimed to analyze A-lines independently in order to exploit the rich characteristics of the OCT signal [25–29].

Researchers proposed various classical segmentation methods to detect the vessel lumen in IV-OCT images. The region-based active contour segmentation methods with level-set energy functions utilize the prior cross-sectional information [30–32]. While dynamic programming [32,33] or small artificial neural networks [34] can be used to correct the remaining artifacts after the application of the primary methods, level-set methods perform poorly in low signal-to-noise (SNR) regions and produce non-smooth and imprecise lumen boundaries. Graphical models, such as graph-cuts, have also been used for segmentation of OCT images as the various boundaries do not intersect making the graphical models well suited for this problem [35,36]. This approach has been used for the detection of internal anatomical layers, i.e., the inner and outer boundary of the media [24], or also including the outer adventitia boundary [23], in areas of minimal intimal thickness. The counterpart physics-based methods formulate the segmentation as a diffusion problem [37–39]. Despite the success of graphical methods, difficulties arise in low SNR regions with speckle, anatomical anomalies, and external objects, which limit the practical applications of these models and cause a cascade of increasing errors that require follow-up manual corrections done by expert annotators in the post-processing stage [40].

Recently, deep learning models have emerged as a solution to many medical image analysis problems, including IV-OCT. Yong et al. [41] used a regression deep learning network to detect the vessel lumen along the radial direction in polar coordinates. Gharaibeh et al. [42] segmented vessel lumen and coronary calcifications in IV-OCT images using a U-Net architecture and post-processed the output with a conditional random field model. Abdolmanafi et al. [43,44] used pre-trained convolutional neural networks (CNN) to identify and classify several tissue types encountered in the coronary arteries using transfer learning. Specific attention has also been paid to the segmentation of stent struts to confirm stent placement and identify malapposition [30,35,39,45,46]. Our model builds on these previous approaches and extends the state-of-the-art deep learning methods to detect guidewire and plaque shadows as well as anatomical layers not only in minimally diseased vessels but also through thickened intima in coronary arteries imaged with intravascular polarimetry.

3. Methods

This section introduces the loss functions tailored for vessel layer and artifact segmentation, describes the training and evaluation procedures, and provides implementation details. Figure 2 illustrates the architecture of our model. As described below, one of our datasets was revised extensively to provide a curated set of high-quality segmentations. This practice resembles the scenario of an attending physician reviewing and scoring manual annotations performed by resident physicians to provide constructive feedback for training. By analogy, we trained a model to critique the multi-class labels conditioned on their input images by distinguishing between the initial and final revisions of the labels. The trained model was used as the attending physician loss term ($\mathcal {L}_{AP}$) to critique the quality of predicted labels by the main model.

Fig. 2. The proposed model architecture. The main model takes the multi-channel polarimetric image as the input and produces a multi-class probability prediction as the output ($\hat {\boldsymbol {\mathcal {Y}}}$). The auxiliary critique model (a.k.a. Attending Physician) is trained independently with concatenated images ($\boldsymbol {\mathcal {I}}$) and ground-truth labels ($\boldsymbol {\mathcal {Y}}$) as its input to predict the quality level of manual labeling. This critique model then evaluates the segmentation during the main model’s training by providing one of the loss terms.

Download Full Size | PDF

3.1 Segmentation loss

We developed a multi-term multivariate loss function that includes novel loss terms. The first loss term is the weighted cross-entropy function

(1)$$\mathcal{L}_{WCE} (\mathbf{y}, \mathbf{\hat{y}}) ={-} \frac{1}{N} \sum_{c=1}^{N_c} \omega_c \sum_{i,j} \left[ \mathbf{y}_{ijc} \log \mathbf{\hat{y}}_{ijc} \right]$$

that measures the cross-entropy between the target label $y$ and predicted label probabilities $\hat {y}$ of all $N$ pixels where $i$ and $j$ are 2D matrix indices, $c$ is the class index, and $N_c$ is the number of classes. Each pixel’s cross-entropy is then weighted proportionately to the inverse of its class population ($\omega _c=\lVert \mathbf {Y} \rVert _1 / \lVert \mathbf {Y}_c \rVert _1$).

The second loss term is a multi-class version of the generalized Dice loss function [47]

(2)$$\mathcal{L}_{Dice} (\mathbf{y}, \mathbf{\hat{y}}) = 1 - \frac{2}{N_c} \sum_{c=1}^{N_c} \frac{\sum_{i,j} \left[ \mathbf{y}_{ijc} \times \mathbf{\hat{y}}_{ijc} \right] + \epsilon}{\sum_{i,j} \left[\mathbf{y}_{ijc}^2 + \mathbf{\hat{y}}_{ijc}^2 \right] + \epsilon}$$

that measures each label’s segmentation accuracy similar to the Dice coefficient. Dice loss uses the prediction probability (e.g., softmax of logits) instead of the classification result and ranges from zero to one, with zero corresponding to the most accurate result. We add a small constant ($\epsilon$) for numerical stability. These segmentation loss functions are used widely in the field [47,48].

3.2 Boundary loss

The weighted cross-entropy and Dice loss terms are only marginally affected by the errors on the boundary because the boundary pixels are a small portion of the target objects. The third loss term is based on the boundary segmentation accuracy and focuses the network’s attention on the close vicinity of label boundaries. The boundary precision loss term ($\mathcal {L}_{BP}$) utilizes a boundary neighborhood mask,

(3)$$\begin{aligned}\pmb{\beta} (\mathbf{y}) =& \left( \bigvee_{c=1}^{N_c} \left[ \left( \mathbf{1}_{b \times 1} \ast \mathbf{y}_c \right) \oplus \left( \mathbf{y}_c \right) \right] \right) \bigvee\\ &\left( \bigvee_{c=1}^{N_c} \left[ \left( \mathbf{1}_{b \times 1} \ast (1 - \mathbf{y}_c) \right) \oplus \left( 1 - \mathbf{y}_c \right) \right] \right), \end{aligned}$$

which masks the cross-entropy loss values of the pixels that are not in close vicinity of the label boundaries along the radial axis ($b=10$ pixels) using all-ones matrix ($\mathbf {1}$) and two logical operators, i.e., convolution ($\ast$), disjunction (${\scriptstyle \bigvee }$) and exclusive disjunction ($\oplus$). The boundary precision loss term,

(4)$$\mathcal{L}_{BP} (\mathbf{y}, \mathbf{\hat{y}}, \pmb{\beta}) = \frac{-1}{\sum_{i,j} \pmb{\beta}_{ij}} \sum_{c=1}^{N_c} \sum_{i,j} \pmb{\beta}_{ij} \mathbf{y}_{ijc} \log \mathbf{\hat{y}}_{ijc},$$

is differentiable with respect to the model parameters as long as $\pmb {\beta }$ is a function of the ground truth target label.

3.3 Attending physician loss

The training of a critique model involves a loss function that measures the distance of model parameters from the optimal solution in the parameter space. Arjovsky et al. [49] proposed Wasserstein-1 (a.k.a. Earth-Mover) distance,

(5)$$W_1(P,P') = \inf_{\gamma \in \Pi(P,P')} \mathbb{E}_{(x,x') \sim \gamma} \lVert x - x' \rVert_1 ,$$

where $\Pi (P,P')$ is the set of all joint distributions $\gamma (x,x')$ that their marginal distributions are equal to $P$ and $P'$. Wasserstein-1 is the optimal cost of transporting a mass with distribution $P$ to another mass with distribution $P'$ when the transport cost and transport distance are linearly related. Stable learning with a meaningful learning curve that avoids common problems, including mode collapse, can be obtained when Wasserstein-1 distance is adopted [49]. Since the infimum in (5) is intractable, Kantorovich and Rubinstein [50] proposed a tractable dual problem,

(6)$$W_1(P,P') = \sup_{\lVert f \rVert_L \leq 1} \{ \mathbb{E}_{x \sim P} [f(x)] - \mathbb{E}_{x' \sim P'} [f(x')] \},$$

where $f$ is a 1-Lipschitz function, mapping the support of $P$ and $P'$ to real numbers.

Similarly, we can select $f$ from a family of parameterized functions ($\{f_w\}_{w \in \mathcal {W}}$) that are at least K-Lipschitz for a constant K and optimize (5) over the functional parameter space,

(7)$$W_1(P,P') = \max_{w \in \mathcal{W}} \{\mathbb{E}_{x \sim P} [f_w(x)] - \mathbb{E}_{x' \sim P'} [f_w(x')]\},$$

The requirement of $f$ being K-Lipschitz for the function family of deep neural networks can be imposed by clipping the parameter values with an absolute value upper limit [49] or enforcing the gradient of parameters to be 1 almost everywhere through a gradient penalty loss term [51]. Gulrajani et al. [51] showed that the gradient constraining method improves the learning process compared to the weight clipping method. We observed a similar effect while training our Attending Physician model, which is then used as the fourth loss term

(8)$$\mathcal{L}_{AP} = \mathbb{E}_{x \sim P} [f_w(x)] - \mathbb{E}_{x' \sim P'} [f_w(x')].$$

in addition to PS-OCT images, we used final high-quality annotations and lower-quality annotations from an initial round of manual segmentation (see Section 4. on Data below) to train the critique model. After training the critique model, we trained the main model by integrating the critique model with fixed parameters (Fig. 2).

3.4 Topological loss

The last loss term examines the labels from a topological point of view. Ideally, the predicted labels along A-lines are composed of three or four connected components without any void, starting with the lumen label in the center and ending with the outside label. The area between the lumen and outside labels should be occupied by two adjacent solid anatomical layers (i.e., intima and media) or by one of the artifact labels (i.e., guidewire or plaque shadows). These configurations are distinguishable in terms of the number of label boundaries along the radial direction. The soft boundary cardinality loss term ($\mathcal {L}_{BC}$) penalizes the discrepancy between the predicted and ground truth labels based on the number of boundary pixels along the radial axis. We propose to employ

(9)$$\mathbf{S} (\mathbf{y}) = 1 + \tanh{\left( M \left[ \mathbf{y} - \max_{c} \mathbf{y}_{ijc} \right] \right)}$$

that is a differentiable proxy for the arguments of the maxima and a saturated equivalent of the softmax function, i.e., $e^\mathbf {x}/ \lVert e^\mathbf {x_i} \rVert _1$. The soft argmax admits the predicted class probabilities at each pixel and maps the probability of the most probable class and other classes to $\sim 1$ and $\sim 0$, respectively. The level of saturation is controlled by the large number $M$ and the precision of the probability values. Since the value of the soft argmax for a given class changes between two adjacent pixels at the label boundary, the soft boundary set cardinality along the radial axis

(10)$$\mathbf{BC} (\mathbf{S}) = \frac{1}{2} \sum_i \sum_c \lvert \mathbf{S}(\mathbf{y})_{(i+1)jc} - \mathbf{S}(\mathbf{y})_{ijc} \rvert$$

approximates the number of class boundaries in each A-line. The boundary cardinality loss function

(11)$$\mathcal{L}_{BC} (\mathbf{y}, \mathbf{\hat{y}}) = \sigma(\mathbf{BC} (\mathbf{S}(\mathbf{y})), \mathbf{BC} (\mathbf{S}(\mathbf{\hat{y}})))$$

compares the prediction and ground-truth labels with respect to the number of boundaries, where $\sigma$ measures the difference between two $\mathbf {BC}$ vectors (e.g., norm 1). We considered 1, 100, and $100/\epsilon$ for $M$, where $\epsilon$ is the small number used for mathematical stability in (2) and sofmax value clipping. For $\sigma$, we considered $\lVert \cdot \rVert _1$, $\lVert \cdot \rVert _2$, and $\max (\cdot )$. Based on the validation dataset and the convexity of $\mathcal {L}_{BC}$, norm-1 ($\lVert \cdot \rVert _1$) and $100/\epsilon$ are the optimal choices for $\sigma$ and $M$, respectively.

The final loss function combines all five loss terms:

(12)$$\begin{aligned}\mathcal{L} =& \lambda_{WCE} \mathcal{L}_{WCE} + \lambda_{Dice} \mathcal{L}_{Dice} +\\ & \lambda_{BP} \mathcal{L}_{BP} + \lambda_{AP} \mathcal{L}_{AP} + \lambda_{BC} \mathcal{L}_{BC}, \end{aligned}$$

in which loss term weights ($\lambda _.$) are selected within the range $[10^{-3}, 10^{3}]$ and optimized over their logarithmically-spaced multidimensional grid using greedy algorithms.

3.5 CNN architecture

The proposed network architecture scheme is based on the U-Net [52] and deep residual learning [53] models (Fig. 2). The auxiliary critique model was trained independently to distinguish low- and high-quality labels. Subsequently, the main model was trained by combining the trained critique model and other loss terms to segment three anatomical layers and two shadows artifacts (Table 1). The optimized architecture contains multi-scale encoder and decoder sections with skip connections at each scale. The input consists of the three-channel images of conventional intensity, birefringence, and depolarization, in polar coordinates, down sampled to 512 by 512 pixels. The output consists of the six concatenated classes of the same pixel dimension. The convolutional complex contains three convolutional layers with a $3 \times 3$ pixel kernel size and a leaky version of the rectified linear unit (L-ReLU) activation function, which has a negative slope coefficient of 0.3. These three convolutional layers compute the residual values by using an internal skip connection. The max-pooling layers with a $2 \times 2$ pixel kernel size are applied after convolutional complexes in the encoding section for down-scaling while the counterpart deconvolutional layers are applied for bi-linear up-scaling within the decoding section. The encoding output and decoding input are connected through two convolutional complexes that operate at the latent representation level. The layers within each of the three scales and the latent representation layers have 8, 8, 16, and 16 features, respectively.

Table 1. The definition of the six exclusive labels that are based on the manual expert annotations and shown in Fig. 1(D).

View Table | View all tables in this article

The critique model architecture accepts the concatenation of image channels and output label channels as the input and applies three convolutional complexes with 32, 64, and 128 features, respectively. Each complex consists of two convolutional layers with a $3 \times 3$ pixel kernel size and the ReLU activation function followed by a max-pooling with a $2 \times 2$ pixel kernel size. The last complex’s output is flattened and processed by a three-layer dense neural network with 1024, 256, and 128 hidden nodes and ReLU activation function, respectively. The final output has one feature and uses the hyperbolic tangent activation function (Fig. 2).

3.6 Training and Implementation

We randomly divided the annotated dataset between training, validation, and hold-out testing dataset by selecting 45, 6, and 6 patients (80%/10%/10%), respectively. Augmentation included random mirroring, rotation, multi-channel image intensity distribution manipulations ([-0.05, 0.05] brightness and [0.9, 1.1] contrast), and spatial scaling ([0.875, 1.125]). An element from the power set of the image augmentation set was applied to each given PS-OCT cross-section with randomly selected transformation parameters sampled uniformly and independently from the ranges above. The geometric transformations were defined in the Cartesian coordinate system, but they were implemented and applied in the polar coordinate system. The data augmentation methods were implemented and executed on a GPU to improve the model’s runtime.

We implemented our model in Python using Keras and Tensorflow. We commonly used RMSprop optimizer with $10^{-3}$–$10^{-4}$ learning rate and mini-batch size of 20 per GPU. The GPU memory size was the limiting factor in the learning rate and mini-batch size selection. We used two NVIDIA GeForce RTX 2080 Ti or four NVIDIA Tesla V100.

3.7 Post-processing

We investigated a post-processing procedure to the model output to enforce known topology of the multi-class segmentations. Initially, small objects and holes within each class were removed, and their interfaces were smoothed. Then, a set of logical operations was applied to impose the topological relationships between the classes in the polar coordinate system. The proposed set includes the following constraints:

• Lumen is a single connected object without any 2D void. The same rule applies to both guidewire shadow and outside.
• Guidewire and plaque shadows are confined between the lumen, the outside, and two A-lines.
• The order of layers from inside to outside ends is lumen, intima, media, and outside.

3.8 Performance metrics

Based on the ground-truth labels, we evaluated the performance of the multi-class prediction model using accuracy and Dice coefficient,

(13)$$Dice(\mathbf{Y}, \mathbf{\hat{Y}}) = \frac{2}{\lVert \mathcal{C} \rVert}\sum_{c\in \mathcal{C}} \frac{\lVert \mathbf{Y}_c \cap \mathbf{\hat{Y}}_c\rVert}{\lVert \mathbf{Y}_c \rVert + \lVert {\mathbf{\hat{Y}}}_c \rVert},$$

where $\lVert \cdot \rVert$ is the set cardinality, $\mathbf {\hat {Y}}_c$ is the set of predicted pixels as class $c$, $\mathbf {Y}_c$ is the set of pixels in ground-truth as class $c$, $\mathcal {C}$ is the set of classes, and $\cap$ is the intersection operation. Furthermore, we evaluated the precision of inter-class boundaries using the average distance error (ADE) along the radial direction and modified Hausdorff distance (MHD) [54] in 2D within the cross-section:

(14)$$d(a, \mathcal{B}) = \inf_{b \in \mathcal{B}} \{ \lVert a - b \rVert_2 \},$$

(15)$$ADE(\hat{\mathcal{B}}; \mathcal{B}) = \frac{1}{\lVert \hat{\mathcal{B}} \rVert} \sum_{a \in \hat{\mathcal{B}}} d(a, \mathcal{B}),$$

(16)$$MHD(\hat{\mathcal{B}}, \mathcal{B}) = \max \{ ADE(\hat{\mathcal{B}}; \mathcal{B}) , ADE(\mathcal{B}; \hat{\mathcal{B}}) \},$$

where $\mathcal {B}$ and $\hat {\mathcal {B}}$ are the set of boundary pixels in the ground truth and prediction, respectively, and $\lVert. \rVert _2$ is the Euclidean norm. ADE and MHD are measured in $\mu m$.

4. Data

We demonstrate the method on images from an intravascular polarimetry pilot study, which included two cohorts and enrolled a total of 57 patients who underwent percutaneous coronary intervention and PS-OCT imaging at the Erasmus University Medical Center in Rotterdam. Of the 57 pullbacks, only segments of native vessel wall or containing old stents from previous interventions were included in this study. The Ethics Committee of Erasmus Medical Center approved the study protocol, and all procedures were performed in accordance with local and federal regulations and the Declaration of Helsinki.

The imaging system consists of “FastView” intravascular catheters (Terumo Co., Tokyo, Japan) interfaced with our custom-built PS-OCT system [17], operating at 1300 nm central wavelength similar to commercially available clinical IV-OCT systems. The wavelength scanning range was 110 nm, achieving a radial resolution below 10 $\mu$m, assuming a tissue refractive index of 1.34. The dimension of the pixels in the reconstructed tomograms in the radial direction were 4.2 $\mu$m and 4.43 $\mu$m, respectively, for the two cohorts. The repetition rate of the wavelength sweep was 103.6 kHz. The catheter’s rotation speed was just over 100 RPS, with 1024 radial scans per rotation, and pullbacks were performed at 10 mm/s or 20 mm/s, at the operator’s discretion. Non-ionic contrast solution was injected at a rate of 3-4 mL/s during the pullback to displace coronary blood and obtain an unperturbed view of the vessel wall.

Intravascular polarimetry was performed based on our earlier work [16,18,55–57]. Briefly, an electro-optic polarization modulator was used to alternate the polarization state of the light incident on the tissue between consecutive depth scans and a polarization-diverse receiver enabled determination of the detected light’s polarization state and intensity. Polarimetric analysis employed spectral binning [58] to reconstruct maps of tissue birefringence and depolarization. Birefringence is the difference in the refractive index experienced by orthogonal polarization states aligned and orthogonal to the tissue optic axis, respectively. Tissue depolarization measures the randomness of the detected light’s polarization state using the complement to one of the degrees of polarization.

Initially, an expert interventional cardiologist (K.O.) excluded partial segments of 3D pullbacks that were uninterpretable and suffered from severe artifacts caused by insufficient blood clearing. The qualified pullback segments added up to 3936 mm of pullbacks at a 100 or 200 $\mu$m pitch. Subsequently, the expert annotated a total of 984 PS-OCT cross-sections spaced 4 mm apart using our in-house Matlab graphical user interface (Fig. 1.D), using the conventional OCT signal (backscatter signal intensity) as well as the polarization channels. The manual annotations included the outer boundaries of the lumen, tunica intima (i.e., IEL), and tunica media (i.e., EEL). The location of IEL and EEL within the plaque and guidewire shadows were extrapolated based on their visible segments (Fig. 1.E). Additionally, angular segments containing plaque, guidewire, stent struts, side branches, or thrombus were identified and used for segmentation or selective analysis without influencing the main label categories. Consequently, as summarized in Table 1, the manual annotations were converted into six exclusive labels: outside, lumen, visible intima, visible media, plaque shadow, and guidewire shadow (Fig. 1.F).

To manage the workload, we annotated the total dataset in four separate batches and through three phases: initial annotation, high-precision annotation, and annotation approval. One of the batches was revised extensively at the pixel-level, requiring four times as long as other batches. The high-quality annotated batch, in combination with its initial annotation, was utilized to train the proposed critique model and its resulting loss term.

5. Results

We compared the model’s automated annotation results to the expert’s ground-truth annotations in Fig. 3 to qualitatively characterize our model, illustrate the model’s strengths, and identify possible areas of improvement. Our model’s annotations and the ground-truth are overlaid on the gray-scale intensity image in blue and red outlines, respectively. Visualization 1 is showing the segmentations and all individual PS-OCT signal channels.

Fig. 3. Qualitative assessment of PS-OCT segmentation. The annotations of our model and the ground-truth are overlaid on the gray-scale intensity image in blue and red outlines, respectively. See the text for detailed discussion, and Visualization 1 for all individual PS-OCT signal channels. Scale bar: 1 mm.

Download Full Size | PDF

The most common complication for boundary annotation, particularly for the outer intima and outer media, is the presence of thick plaques or calcium (e.g. Figure 3.A green arrow) and thickened vessel walls (e.g. Figure 3.G, green arrow) that cause significant reduction in the detected signal. The background signal and statistical noise characteristics within the plaque regions impede the model’s objective to annotate the anatomical layers and result in higher annotation variability (e.g. Figure 3.F, yellow arrow; Fig. 3.G, yellow arrow; Fig. 3.K, yellow arrow).

Nonetheless, whenever the image information supports the ground-truth boundaries, the model matches well with the expert annotations even in these challenging cases (e.g. Figure 3.D, both arrows). Correspondingly, the boundaries detected by the model may conform with the underlying multi-dimensional images more accurately than the ground-truth annotations (e.g. Figure 3.F, green arrow), suggesting inconsistencies in the manual ground-truth segmentation.

The guidewire obstructs the probing light, causing a fuzzy signal at its boundaries, resulting in imprecise automatic and manual boundary detection (e.g. Figure 3.B, green arrow). Moreover, the physical proximity of the vessel lumen with the guidewire and catheter leads to perturbed pixel-level delineation of the lumen boundary (e.g. Figure 3.B, green arrow; Fig. 3.I, green arrow).

Side branches can appear in various locations of the field of view and could be expected to exhibit confusing features, yet our model analyzes these cases in concordance with the ground truth annotation. Such vessels might appear outside the vessel wall (e.g. Figure 3.B, yellow arrow; Fig. 3.E, green arrow), directly adjacent to the vessel wall boundary (e.g. Figure 3.I, yellow arrow), inside the intima (e.g. Figure 3.C, yellow arrow), or in direct communication with the lumen (e.g. Figure 3.L, yellow arrow).

Even though non-ionic contrast solution is injected during catheter pull-back to displace blood, residues of blood may persist in the vessel lumen vicinity (e.g. Figure 3.A, yellow arrow; Fig. 3.E, yellow arrow). Blood clearance can be incomplete, especially at the onset or the end of contrast injection Fig. 3.H, both arrows). Still, in all these cases, our model successfully detects the lumen outer boundaries. Equivalently, the dark and bright tissue patterns (e.g. Figure 3.C, green arrow; and Fig. 3.K, green arrow) are observed beyond the media layer and mimic the multi-layer vessel wall structures but they do not distract the automatic boundary allocations.

While our study only included intravascular imaging prior to intervention, previously embedded stents are commonly encountered, owing to the high recurrence rate of acute coronary syndrome and myocardial infarction. Depending on the specific stent material and patient history, stents might appear embedded in the vessel wall (e.g. Figure 3.J, green arrow) or protruding into the lumen (e.g. Figure 3.J, yellow arrow; Fig. 3.L, green arrow). Stents generate diverse and strong image artifacts that impede the model’s ability to correctly detect the boundaries. Exact layer segmentation behind stents presents challenges even for expert readers. With the exception of neointimal hyperplasia, previously stented segments are unlikely to reside in the culprit segment. Such segments were included in our data set merely to train the model to ignore the ensuing artifacts. Notably, there exists a distinct class of models designed to detect stent struts and verify correct stent deployment [30,35,39]).

To complement the qualitative assessment of the model with quantitative metrics Table 2 lists the model’s performance for individual label classes, evaluated on the hold-out test set. The lumen segmentation achieved the best scores for all metrics while the plaque shadow performance influenced by the more ambiguous ground-truth labels owing to the lack of clear structural markers. Nonetheless, the individual metrics confirm the overall high quality of segmentation achieved by the model.

Table 2. The performance of our multi-label classification model based on different performance metrics.

View Table | View all tables in this article

To substantiate the design of the model, we conducted an ablation study to examine the individual effects of the various loss terms, i.e.,

1. The soft boundary cardinality loss term ($\mathcal {L}_{BC}$).
2. The Attending Physician (a.k.a. Wasserstein critique model) loss term ($\mathcal {L}_{AP}$),
3. The boundary precision loss term ($\mathcal {L}_{BP}$),
4. The generalized soft multi-class dice loss term ($\mathcal {L}_{Dice}$), and
5. The weighted cross-entropy loss term ($\mathcal {L}_{WCE}$),

We measured the accuracy, Dice coefficient, and modified Hausdorff distance (MHD) in $\mu m$ averaged among all label classes on the hold-out test dataset for models trained with a reduced number of loss terms. The results of the ablation study on the loss terms are tabulated in Table 3 and confirm that each loss term contributes to the performance of the method by achieving their individual objectives, such as controlling the number of boundaries or strengthening humanlike annotation outputs, while attaining persistent overall performance metrics.

Table 3. Loss terms ablation study.

View Table | View all tables in this article

Before developing and refining the individual loss terms, we set out to confirm the advantage of using intravascular polarimetry compared to the conventional IV-OCT for the visualization and segmentation of anatomical layers. Using the proposed architecture we trained the model with only the weighted cross-entropy loss function ($\mathcal {L}_{WCE}$) and compared its performance to an adapted model that was trained with only the single intensity channel as input. The Dice coefficient of the media class using intravascular polarimetry data was 70.7%, while it was only 62.7% when using conventional IV-OCT. The subsequent optimization of the model’s performance improved the Dice coefficient of the media class using PS-OCT to 79.5%. The significant gain in performance achieved by using the polarimetric channels even with only the $\mathcal {L}_{WCE}$ loss term confirms our previous qualitative observations of improved contrast for the media layer [16,17].

Table 4 and Table 5 report the performance of previous studies that developed segmentation methods for the lumen and for those that also segmented the two additional anatomical layers, respectively. None of these studies provide publicly available code that would enable direct comparison on our data sets, and the reported performance values originate from the manuscripts of the respective studies, which ensures optimal application of these methods to the envisioned vessel pathologies.

Table 4. Comparison of our model and other studies that segment the vessel lumen. Other metrics include accuracy (Acc), True Positive Rate (TPR), True Negative Rate (TNR), and absolute distance error (ADE).

View Table | View all tables in this article

Table 5. Comparison of our model and other studies that detect the outer boundary of lumen, intima, and media by reporting mean $\pm$ standard deviation.

View Table | View all tables in this article

There are many methods ([24–30,32,35,38,39,41,42,44,60–62]) that extract the lumen with Dice 95-95% and our method outperforms them all at 99%. Moreover, Table 5 indicates that our model achieves lower absolute distance error (ADE) for both outer intima and media boundaries compared to the two other reports accomplishing and reporting on this task. Here, we excluded thickened vessel walls from evaluation of the outer boundaries in Table 5, in line with the analysis in [23], which only evaluated layer segmentation in ’healthy regions’, and [24], which inspected allograft vessels with minimal intimal thickening. However, thickened vessel walls are the result of coronary atherosclerosis and very common, especially in the population of patients likely to undergo intravascular imaging. Importantly, our model is able to segment cross-sections including thickened vessel wall segments, although imaging through this additional tissue degrades the achieved ADE (2.60, 16.9, and 20.85 $\mu m$ for outer the lumen, intima, and media, respectively). Still, these values are comparable to the previous methods that detect these layers only in segments with minimal disease.

6. Discussion

PS-OCT complements the IV-OCT backscatter intensity signal by measuring the polarization state of reflected light and reconstructing tissue birefringence and depolarization signals. These polarimetric signals provide a more detailed characterization of the vessel wall and can help to differentiate tissue layers that have comparable scattering properties but distinct polarization features. PS-OCT enriches the visualization of anatomical layers and hence facilitates downstream image processing tasks. We proposed a convolutional neural network model with a new multi-term loss function that leverages the increased contrast available to PS-OCT to segment the vessel lumen, as well as the intima-media and the media-adventitia boundary. Furthermore, the model works on all plaque types and correctly segments the inner and outer media boundaries even through thickened vessel walls, as long as the plaque is not opaque. Conversely, angular segments of lipid-rich or calcified plaques that impede detection of the subluminal anatomical layers are identified as plaque shadows. The model, however, continues to estimate the outer media boundary throughout these opaque regions. The model also identifies guidewire shadows without interrupting the lumen and outer media segmentation.

Our comprehensive multi-class image segmentation model can support many downstream image analysis tasks. Automated and objective image segmentation simplifies both clinical research and affords integration into the clinical workflow by removing the workload of manual segmentation. For guidance of PCI, robust and automated measurement of the EEL diameter would simplify stent sizing [63]. Evaluation of the intimal thickness along the vessel could enhance the common simplified visualization of the culprit vessel based on the lumen diameter with complementary information on the location and extent of plaques to select a suitable landing zone. In a clinical research setting, automated segmentation of the intimal thickness along entire coronary vessels would enable the formulation of questions that are currently impractical to address due to the workload of manual segmentation. Crucially, automated segmentation also enables evaluation of tissue polarization properties in distinct anatomical areas, which previously relied on tedious manual segmentation [16,18,55]. We anticipate that such volumetric analysis of polarization properties will offer refined insight into plaque composition and may enable the formulation of polarization-informed plaque index similar to the lipid-core burden index of near infrared spectroscopy [20].

The high-performance segmentation of the lumen and outside classes is an indication that we approached the limits imposed by using a single-reader ground-truth. The media boundaries and shadow classes likely suffer from higher intra-reader ground-truth variability. The anatomical layers beyond the lumen are located in areas of decaying signal quality and the shadows intrinsically have a poorly defined border. The increased boundary to area ratio of the media furthermore deteriorates typical segmentation metrics even without degradation in the boundary precision.

In addition to the use of a single-reader ground truth, the limitations of the current study include a modest number of pullbacks and a limited spectrum of atherosclerotic disease. Also, segmentation was performed on individual cross-sections. While segmentation of adjacent cross-sections enables volumetric segmentation, there may be information embedded in the volumetric data that escapes the current model. Lastly, intravascular polarimetry with PS-OCT uses commercial clinical imaging catheters but currently uses a custom imaging console, which complicates clinical translation. Towards resolving this limitation, Xiong et al. [64] proposed a new method that may be compatible with existing imaging consoles and could accelerate the clinical translation of using anatomical layer segmentation based on tissue polarization properties.

7. Conclusion

We proposed a method for the segmentation of intravascular polarimetry images of coronary arteries. The method performance compares favorably with state-of-the-art baseline algorithms, which operate on conventional IV-OCT images. The additional polarization contrast available to PS-OCT affords improved segmentation across a wide range of atherosclerotic lesion types and significantly improves the segmentation of the media boundaries in diseased vessels. Intravascular polarimetry with automated segmentation could be used for refined lesion characterization and may simplify and improve guidance of percutaneous coronary interventions.

Funding

Massachusetts General Hospital (Bullock Postdoctoral Fellowship); American Heart Association (18CSA34080399); National Institutes of Health (1R01HL134892, 1R01HL163099-01, P41EB-015902, P41EB-015903); RRM Charitable Fund; Simard Fund.

Acknowledgment

This work was done partially using MIT-IBM Satori hardware resources.

Disclosures

Drs. Villiger and Bouma are inventors on patents for OCT technology and methods that are owned by Massachusetts General Hospital and licensed to Terumo Corporation. Dr. Libby is an unpaid consultant to, or involved in clinical trials for Amgen, AstraZeneca, Baim Institute, Beren Therapeutics, Esperion Therapeutics, Genentech, Kancera, Kowa Pharmaceuticals, Medimmune, Merck, Moderna, Novo Nordisk, Novartis, Pfizer, and Sanofi-Regeneron. Dr. Libby is a member of the scientific advisory board for Amgen, Caristo Diagnostics, Cartesian Therapeutics, CSL Behring, DalCor Pharmaceuticals, Dewpoint Therapeutics, Eulicid Bioimaging, Kancera, Kowa Pharmaceuticals, Olatec Therapeutics, Medimmune, Novartis, PlaqueTec, TenSixteen Bio, Soley Thereapeutics, and XBiotech, Inc. Dr. Libby’s laboratory has received research funding in the last 2 years from Novartis, Novo Nordisk and Genentech. Dr. Libby is on the Board of Directors of XBiotech, Inc. Dr. Libby has a financial interest in Xbiotech, a company developing therapeutic human antibodies, in TenSixteen Bio, a company targeting somatic mosaicism and clonal hematopoiesis of indeterminate potential (CHIP) to discover and develop novel therapeutics to treat age-related diseases, and in Soley Therapeutics, a biotechnology company that is combining artificial intelligence with molecular and cellular response detection for discovering and developing new drugs, currently focusing on cancer therapeutics. Dr. Libby’s interests were reviewed and are managed by Brigham and Women’s Hospital and Mass General Brigham in accordance with their conflict-of-interest policies. Dr. Daemen received institutional grant/research support from Abbott Vascular, Boston Scientific, ACIST Medical, Medtronic, Microport, Pie Medical, and ReCor medical, and consultancy and speaker fees from Abbott Vascular, Abiomed, ACIST medical, Boston Scientific, Cardialysis BV, CardiacBooster, Kaminari Medical, ReCor Medical, PulseCath, Pie Medical, Sanofi, Siemens Health Care and Medtronic.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request. The source code and the trained model are publicly available at [65].

References

1. M. Franco, R. S. Cooper, U. Bilal, et al., “Challenges and opportunities for cardiovascular disease prevention,” The Am. J. Med. 124(2), 95–102 (2011). [CrossRef]

2. C. W. Tsao, A. W. Aday, Z. I. Almarzooq, et al., “Heart disease and stroke statistics—2023 update: a report from the American Heart Association,” Circulation 147(8), e93–e621 (2023). [CrossRef]

3. I.-K. Jang, G. Tearney, and B. Bouma, “Visualization of tissue prolapse between coronary stent struts by optical coherence tomography: comparison with intravascular ultrasound,” Circulation 104(22), 2754 (2001). [CrossRef]

4. T. Okamura, Y. Onuma, H. M. Garcia-Garcia, et al., “First-in-man evaluation of intravascular optical frequency domain imaging (OFDI) of Terumo: a comparison with intravascular ultrasound and quantitative coronary angiography,” EuroIntervention 6(9), 1037–1045 (2011). [CrossRef]

5. N. Meneveau, G. Souteyrand, P. Motreff, et al., “Optical coherence tomography to optimize results of percutaneous coronary intervention in patients with non–st-elevation acute coronary syndrome: results of the multicenter, randomized doctors study (does optical coherence tomography optimize results of stenting),” Circulation 134(13), 906–917 (2016). [CrossRef]

6. Z. A. Ali, U. Landmesser, A. Maehara, et al., ““Optical coherence tomography–guided versus angiography-guided PCI,” N. Engl. J. Med. 389(16), 1466–1476 (2023). [CrossRef]

7. D.-Y. Kang, J.-M. Ahn, S.-C. Yun, et al., “Optical coherence tomography–guided or intravascular ultrasound-guided percutaneous coronary intervention: the OCTIVUS randomized clinical trial,” Circulation 148(16), 1195–1206 (2023). [CrossRef]

8. N. R. Holm, L. N. Andreasen, O. Neghabat, et al., “OCT or angiography guidance for PCI in complex bifurcation lesions,” N. Engl. J. Med. 389(16), 1477–1487 (2023). [CrossRef]

9. Z. A. Ali, K. K. Galougahi, G. S. Mintz, et al., “Intracoronary optical coherence tomography: state-of-the-art and future directions,” EuroIntervention 17(2), e105–e123 (2021). [CrossRef]

10. A. G. Truesdell, M. A. Alasnag, P. Kaul, et al., “Intravascular imaging during percutaneous coronary intervention,” J. Am. Coll. Cardiol. 81(6), 590–605 (2023). [CrossRef]

11. B. E. Bouma, M. Villiger, K. Otsuka, et al., “Intravascular optical coherence tomography,” Biomed. Opt. Express 8(5), 2660–2686 (2017). [CrossRef]

12. O. Manfrini, E. Mont, O. Leone, et al., “Sources of error and interpretation of plaque morphology by optical coherence tomography,” The Am. J. Cardiology 98(2), 156–159 (2006). [CrossRef]

13. R. Januszek, W. Siłka, K. Sabatowski, et al., “Procedure-related differences and clinical outcomes in patients treated with percutaneous coronary intervention assisted by optical coherence tomography between new and earlier generation software (Ultreon™ 1.0 software vs. Aptivue™ software),” J. Cardiovasc. Dev. Dis. 9(7), 218 (2022). [CrossRef]

14. “HyperVue Imaging System,” https://www.spectrawave.com/product

15. S. K. Nadkarni, M. C. Pierce, B. H. Park, et al., “Measurement of collagen and smooth muscle cell content in atherosclerotic plaques using polarization-sensitive optical coherence tomography,” J. Am. Coll. Cardiol. 49(13), 1474–1481 (2007). [CrossRef]

16. M. Villiger, K. Otsuka, A. Karanasos, et al., “Coronary plaque microstructure and composition modify optical polarization: a new endogenous contrast mechanism for optical frequency domain imaging,” JACC: Cardiovasc. Imaging 11(11), 1666–1676 (2018). [CrossRef]

17. M. Villiger, K. Otsuka, A. Karanasos, et al., “Repeatability assessment of intravascular polarimetry in patients,” IEEE Trans. Med. Imaging 37(7), 1618–1625 (2018). [CrossRef]

18. K. Otsuka, M. Villiger, S. K. Nadkarni, et al., “Intravascular polarimetry: Clinical translation and future applications of catheter-based polarization sensitive optical frequency domain imaging,” Front. Cardiovasc. Med. 7, 146 (2020). [CrossRef]

19. L. Räber, G. S. Mintz, K. C. Koskinas, et al., “Clinical use of intracoronary imaging. part 1: guidance and optimization of coronary interventions. an expert consensus document of the European Association of Percutaneous Cardiovascular Interventions,” Eur. Heart J. 39(35), 3281–3300 (2018). [CrossRef]

20. D. Erlinge, A. Maehara, O. Ben-Yehuda, et al., “Identification of vulnerable plaques and patients by intracoronary near-infrared spectroscopy and ultrasound (prospect ii): a prospective natural history study,” The Lancet 397(10278), 985–995 (2021). [CrossRef]

21. A. P. Burke, A. Farb, G. T. Malcom, et al., “Coronary risk factors and plaque morphology in men with coronary disease who died suddenly,” N. Engl. J. Med. 336(18), 1276–1282 (1997). [CrossRef]

22. R. Virmani, F. D. Kolodgie, A. P. Burke, et al., “Lessons from sudden coronary death: a comprehensive morphological classification scheme for atherosclerotic lesions,” Arterioscler., Thromb., Vasc. Biol. 20(5), 1262–1275 (2000). [CrossRef]

23. G. Zahnd, A. Hoogendoorn, N. Combaret, et al., “Contour segmentation of the intima, media, and adventitia layers in intracoronary OCT images: application to fully automatic detection of healthy wall regions,” Int. J. Comput. Assist. Radiol. Surg. 12(11), 1923–1936 (2017). [CrossRef]

24. Z. Chen, M. Pazdernik, H. Zhang, et al., “Quantitative 3d analysis of coronary wall morphology in heart transplant patients: Oct-assessed cardiac allograft vasculopathy progression,” Med. Image Anal. 50, 95–105 (2018). [CrossRef]

25. M. C. Moraes, D. A. C. Cardenas, and S. S. Furuie, “Automatic lumen segmentation in IVOCT images using binary morphological reconstruction,” Biomed. Eng. Online 12(1), 78 (2013). [CrossRef]

26. M. Bologna, S. Migliori, and E. Montin, “Automatic segmentation of optical coherence tomography pullbacks of coronary arteries treated with bioresorbable vascular scaffolds: Application to hemodynamics modeling,” PLoS One 14(3), e0213603 (2019). [CrossRef]

27. M. M. Macedo, W. V. Guimarães, M. Z. Galon, et al., “A bifurcation identifier for IV-OCT using orthogonal least squares and supervised machine learning,” Comput. Med. Imaging Graph. 46, 237–248 (2015). [CrossRef]

28. M. M. G. de Macedo, C. K. Takimura, P. A. Lemos, et al., “A robust fully automatic lumen segmentation method for in vivo intracoronary optical coherence tomography,” Res. Biomed. Eng. 32(1), 35–43 (2016). [CrossRef]

29. A. Akbar, T. Khwaja, A. Javaid, et al., “Automated accurate lumen segmentation using l-mode interpolation for three-dimensional intravascular optical coherence tomography,” Biomed. Opt. Express 10(10), 5325–5336 (2019). [CrossRef]

30. G. J. Ughi, T. Adriaenssens, K. Onsea, et al., “Automatic segmentation of in-vivo intra-coronary optical coherence tomography images to assess stent strut apposition and coverage,” The Int. J. Cardiovasc. Imaging 28(2), 229–241 (2012). [CrossRef]

31. S. Joseph, A. Adnan, and D. Adlam, “Automatic segmentation of coronary morphology using transmittance-based lumen intensity-enhanced intravascular optical coherence tomography images and applying a localized level-set-based active contour method,” J. Med. Imag. 3(04), 1 (2016). [CrossRef]

32. Y. Cao, K. Cheng, X. Qin, et al., “Automatic lumen segmentation in intravascular optical coherence tomography images using level set,” Comput. Math. Methods Med. 2017, 1–11 (2017). [CrossRef]

33. F. Zhu, Z. Ding, K. Tao, et al., “Automatic lumen segmentation using uniqueness of vascular connected region for intravascular optical coherence tomography,” J. Biophotonics 14(10), e202100124 (2021). [CrossRef]

34. H. S. Nam, C.-S. Kim, J. J. Lee, et al., “Automated detection of vessel lumen and stent struts in intravascular optical coherence tomography to evaluate stent apposition and neointimal coverage,” Med. Phys. 43(4), 1662–1675 (2016). [CrossRef]

35. S. Tsantis, G. C. Kagadis, K. Katsanos, et al., “Automatic vessel lumen segmentation and stent strut detection in intravascular optical coherence tomography,” Med. Phys. 39(1), 503–513 (2011). [CrossRef]

36. A. Wang, J. Eggermont, J. H. Reiber, et al., “Fully automated side branch detection in intravascular optical coherence tomography pullback runs,” Biomed. Opt. Express 5(9), 3160–3173 (2014). [CrossRef]

37. A. G. Roy, S. Conjeti, S. G. Carlier, et al., “Lumen segmentation in intravascular optical coherence tomography using backscattering tracked and initialized random walks,” IEEE J. Biomed. Health Inform. 20(2), 606–614 (2016). [CrossRef]

38. M. L. Olender, L. S. Athanasiou, and M. José, “A mechanical approach for smooth surface fitting to delineate vessel walls in optical coherence tomography images,” IEEE Trans. Med. Imaging 38(6), 1384–1397 (2019). [CrossRef]

39. S. Yang, H.-J. Yoon, S. J. M. Yazdi, et al., “A novel automated lumen segmentation and classification algorithm for detection of irregular protrusion after stents deployment,” The Int. J. Med. Robotics Comput. Assist. Surg. 16(1), e2033 (2020). [CrossRef]

40. M. Pazdernik, Z. Chen, H. Bedanova, et al., “Early detection of cardiac allograft vasculopathy using highly automated 3-dimensional optical coherence tomography analysis,” The J. Heart Lung Transplant. 37(8), 992–1000 (2018). [CrossRef]

41. Y. L. Yong, L. K. Tan, R. A. McLaughlin, et al., “Linear-regression convolutional neural network for fully automated coronary lumen segmentation in intravascular optical coherence tomography,” J. Biomed. Opt. 22(12), 1 (2017). [CrossRef]

42. Y. Gharaibeh, D. S. Prabhu, C. Kolluru, et al., “Coronary calcification segmentation in intravascular OCT images using deep learning: application to calcification scoring,” J. Med. Imag. 6(04), 1 (2019). [CrossRef]

43. A. Abdolmanafi, L. Duong, N. Dahdah, et al., “Characterization of coronary artery pathological formations from OCT imaging using deep learning,” Biomed. Opt. Express 9(10), 4936–4960 (2018). [CrossRef]

44. A. Abdolmanafi, N. Dahdah, L. Duong, et al., “Fully automatic artificial intelligence diagnostic model of coronary artery lesions using OCT imaging,” Can. J. Cardiol. 35(10), S61–S62 (2019). [CrossRef]

45. G. F. Attizzani, D. Capodanno, Y. Ohno, et al., “Mechanisms, pathophysiology, and clinical aspects of incomplete stent apposition,” J. Am. Coll. Cardiol. 63(14), 1355–1367 (2014). [CrossRef]

46. P. Wu, J. L. Gutiérrez-Chico, H. Tauzin, et al., “Automatic stent reconstruction in optical coherence tomography based on a deep convolutional model,” Biomed. Opt. Express 11(6), 3374–3394 (2020). [CrossRef]

47. F. Milletari, N. Navab, and S.-A. Ahmadi, “V-net: Fully convolutional neural networks for volumetric medical image segmentation,” in 2016 Fourth International Conference on 3D Vision (3DV), (IEEE, 2016), pp. 565–571.

48. G. Litjens, T. Kooi, B. E. Bejnordi, et al., “A survey on deep learning in medical image analysis,” Med. Image Anal. 42, 60–88 (2017). [CrossRef]

49. M. Arjovsky, S. Chintala, and L. Bottou, “Wasserstein GAN,” arXiv, arXiv:1701.07875 (2017). [CrossRef]

50. C. Villani, Optimal Transport: Old and New, vol. 338 (Springer Science & Business Media, 2008).

51. I. Gulrajani, F. Ahmed, M. Arjovsky, et al., “Improved training of wasserstein gans,” in Advances in Neural Information Arocessing Systems, (2017), pp. 5767–5777.

52. O. Ronneberger, P. Fischer, and T. Brox, “U-Net: Convolutional networks for biomedical image segmentation,” in International Conference on Medical Image Computing and Computer-assisted Intervention, (Springer, 2015), pp. 234–241.

53. K. He, X. Zhang, S. Ren, et al., “Deep residual learning for image recognition,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2016), pp. 770–778.

54. M.-P. Dubuisson and A. K. Jain, “A modified hausdorff distance for object matching,” in Proceedings of 12th International Conference on Pattern Recognition, vol. 1 (IEEE, 1994), pp. 566–568.

55. K. Otsuka, M. Villiger, S. K. Nadkarni, et al., “Intravascular polarimetry for tissue characterization of coronary atherosclerosis,” Circ. Rep. 1(12), 550–557 (2019). [CrossRef]

56. K. Otsuka, M. Villiger, L. J. van Zandvoort, et al., “Polarimetric signatures of vascular tissue response to drug-eluting stent implantation in patients,” Cardiovasc. Imaging 13(12), 2695–2696 (2020). [CrossRef]

57. K. Otsuka, M. Villiger, A. Karanasos, et al., “Intravascular polarimetry in patients with coronary artery disease,” Cardiovasc. Imaging 13(3), 790–801 (2020). [CrossRef]

58. M. Villiger, E. Z. Zhang, S. K. Nadkarni, et al., “Spectral binning for mitigation of polarization mode dispersion artifacts in catheter-based optical frequency domain imaging,” Opt. Express 21(14), 16353–16369 (2013). [CrossRef]

59. G.-A. Cheimariotis, Y. S. Chatzizisis, V. G. Koutkias, et al., “ARCOCT: Automatic detection of lumen border in intravascular OCT images,” Comput. Methods Programs Biomed. 151, 21–32 (2017). [CrossRef]

60. M. L. Olender, L. S. Athanasiou, M. José, et al., “Simultaneous multi-surface fitting for vessel wall layer delineation,” in 2019 IEEE EMBS International Conference on Biomedical & Health Informatics (BHI), (IEEE, 2019), pp. 1–4.

61. K.-P. Tung, W.-Z. Shi, R. De Silva, et al., “Automatical vessel wall detection in intravascular coronary OCT,” in 2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro, (IEEE, 2011), pp. 610–613.

62. Z. Wang, H. Kyono, H. G. Bezerra, et al., “Automatic segmentation of intravascular optical coherence tomography images for facilitating quantitative diagnosis of atherosclerosis,” in Optical Coherence Tomography and Coherence Domain Optical Methods in Biomedicine XV, vol. 7889 (International Society for Optics and Photonics, 2011), p. 78890N.

63. Z. A. Ali, A. Maehara, P. Généreux, et al., “Optical coherence tomography compared with intravascular ultrasound and with angiography to guide coronary stent implantation (ILUMIEN III: OPTIMIZE PCI): a randomised controlled trial,” The Lancet 388(10060), 2618–2628 (2016). [CrossRef]

64. G. L. Jones, Q. Xiong, X. Liu, et al., “Single-input polarization-sensitive optical coherence tomography through a catheter,” Biomed. Opt. Express 14(10), 4609–4626 (2023). [CrossRef]

65. M. Haft-Javaherian, “OCTseg,” GitHub, 2019, https://github.com/mhaft/OCTseg

Class	Label	Definition
1	Outside	external elastic lamina and deeper tissues
2	Lumen	interior of the convex vessel lumen
3	Intima	visible tunica intima
4	Media	visible tunica media
5	G-Shadow	guidewire shadows between lumen and EEL
6	P-Shadow	plaque shadows between lumen and EEL

Label	Sensitivity	Specificity	Accuracy	Dice
Outside	99.0%	99.5%	99.3%	99.3%
Lumen	99.7%	99.9%	99.8%	99.7%
Intima	86.1%	99.7%	98.4%	91.0%
Media	78.2%	99.5%	99.0%	79.5%
G-Shadow	94.6%	97.5%	97.3%	83.8%
P-Shadow	86.7%	80.2%	82.4%	76.7%

$L_{W C E}$	$L_{D i c e}$	$L_{B P}$	$L_{A P}$	$L_{B C}$	Accuracy	Dice	MHD ( $μ m$ )
✓	✓	✓	✓	✓	96.0%	88.3%	2.3
✓	✓	✓	✓	✕	95.3%	86.5%	3.2
✓	✓	✓	✕	✕	96.1%	88.3%	3.8
✓	✓	✕	✕	✕	94.3%	84.7%	6.0
✓	✕	✕	✕	✕	94.6%	83.8%	7.2

Model	Dice Coefficient	Other Metrics
Abdolmanafi et al. [44]	-	Acc=96%
Akbar et al. [29]	-	TPR=93.1%
Bologna et al. [26]	-	TPR=97.4%
Bologna et al. [26]	-	TNR=99.5%
Cao et al. [32]	98.1%	-
Cheimariotis et al. [59]	93.5%	-
Chen et al. [24]	-	ADE=2.37 $μ m$
Gharaibeh et al. [42]	98%	Acc=98%
Joseph et al. [31]	78%	-
Macedo et al. [27]	97.5%	-
Macedo et al. [28]	97.0%	-
Moraes et al. [25]	97.1%	-
Olender et al. [38]	95.9%	-
Olender et al. [60]	-	Acc=94.9%
Tsantis et al. [35]	96.7%	-
Tung et al. [61]	97%	-
Ughi et al. [30,32]	96.9%	-
Wang et al. [62]	97%	-
Yang et al. [39]	97.6%	-
Yong et al. [41]	98.5%	-
Zhu et al. [33]	94.6%	Acc=98.0%
Our method	99.7%	ADE=2.36 $μ m$
		Acc=99.8%
		TPR=99.7%
		TNR=99.9%

Study	Absolute Distance Error ( $μ$ m)
Study	Outer Lumen	Outer Intima	Outer Media
Zahnd et al. [23]	-	$29 \pm 46$	$30 \pm 50$
Chen et al. [24]	$2.37 \pm 1.84$	$13.61 \pm 27.22$	$16.43 \pm 30.32$
Our method	$2.36 \pm 3.88$	$6.89 \pm 9.99$	$7.53 \pm 8.64$

Segmentation of anatomical layers and imaging artifacts in intravascular polarization sensitive optical coherence tomography using attending physician and boundary cardinality losses

Abstract

1. Introduction

2. Related work

3. Methods

3.1 Segmentation loss

3.2 Boundary loss

3.3 Attending physician loss

3.4 Topological loss

3.5 CNN architecture

3.6 Training and Implementation

3.7 Post-processing

3.8 Performance metrics

4. Data

5. Results

6. Discussion

7. Conclusion

Funding

Acknowledgment

Disclosures

Data availability

References

Supplementary Material (1)

Data availability

Cited By

Figures (3)

Tables (5)

Equations (16)

Biomedical Optics Express

Mohammad Haft-Javaherian	https://orcid.org/0000-0002-8551-5008
Martin Villiger	https://orcid.org/0000-0003-3819-1271