Anomaly Detection of Light-Emitting Diodes Using the ... - IEEE Xplore

Report 4 Downloads 82 Views
1852

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

Anomaly Detection of Light-Emitting Diodes Using the Similarity-Based Metric Test Moon-Hwan Chang, Chaochao Chen, Diganta Das, Member, IEEE, and Michael Pecht, Fellow, IEEE

Abstract—Today’s decreasing product development cycle time requires rapid and cost-effective reliability analysis and testing. Qualification is the process of demonstrating that a product is capable of meeting or exceeding specified requirements. Lightemitting diode (LED) qualification tests are often as long as 6000 h, but this length of time does not guarantee the typically required lifetime of 10 years or more. This paper presents a prognostics-based technique that reduces the LED qualification time. An anomaly detection technique called the similarity-based metric test is developed to identify anomalies without utilizing historical libraries of healthy and unhealthy data. The similaritybased metric test extracts features from the spectral power distributions (SPDs) using peak analysis, reduces the dimensionality of the features using principal component analysis, and partitions the data set of principal components into groups using a k-nearest neighbor (KNN)-kernel density-based clustering technique. A detection algorithm then evaluates the distances from the centroid of each cluster to each test point and detects anomalies when the distance is greater than the threshold. From this, the dominant degradation processes associated with the LED die and phosphors in the LED package can be identified. In our case study, anomalies were detected at less than 1200 h using the similarity-based metric test. Thus, our method could decrease the amount of LED qualification testing time by providing users with an earlier time to begin remaining useful life prediction without waiting 6000 h as required by industrial standards. Index Terms—Anomaly detection, clustering, light-emitting diode (LED), prognostics and health management (PHM), prognostics-based qualification, similarity-based metric test.

I. I NTRODUCTION

L

IGHT-EMITTING diodes (LEDs) have been implemented in a wide range of applications, such as medical devices, automobiles, and general lighting, because they enable design flexibility, have low power consumption, and are eco-friendly (e.g., no mercury) [1]–[3]. The U.S. Energy

Manuscript received February 26, 2014; revised April 22, 2014; accepted May 23, 2014. Date of publication June 30, 2014; date of current version August 05, 2014. This work was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 71231001. Paper no. TII14-0238. M.-H. Chang was with the Center for Advanced Life Cycle Engineering, University of Maryland, College Park, MD 20742 USA. He is now with IT Optical and Mechanical Development Group, Samsung Display Co., Ltd., Yongin-si 446-711, South Korea (e-mail: [email protected]). C. Chen is with the DEI Group, Millersville, MD 21108 USA, on leave from the Center for Advanced Life Cycle Engineering, University of Maryland, College Park, MD 20742 USA (e-mail: [email protected]). D. Das and M. Pecht are with the Center for Advanced Life Cycle Engineering, University of Maryland, College Park, MD 20742 USA (e-mail: [email protected]; [email protected]). Color versions of one or more of the figures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identifier 10.1109/TII.2014.2332116

Independence and Security Act of 2007 encourages the use of energy-efficient alternatives, such as LEDs, over incandescent bulbs [4]. However, customers want LED manufacturers to guarantee the lifetimes of their LED products. This has created a competitive edge for LED manufacturers who can guarantee the reliability of their products for the widest array of applications, while getting their products to market more quickly than their competitors. Qualification tests are conducted to meet the reliability expectations of commercial end users [5], [6]. The performance of LEDs is evaluated during qualification tests, where the performance of the product consists of quality (which includes function) and reliability, and the requirements are set during product design. Qualification test times vary depending on the manufacturer [7]–[9]. Some qualification tests require testing the products to failure in order to provide reliability information about the product [10]. As a result, it can take several years to observe failures under the operating life test conditions. Accelerated testing is used to reduce the qualification test time and predict the lifetime of LEDs by multiplying the estimated lifetime based on the projection of light output degradation (up to 70% of the original light output) by an acceleration factor, often based on the Arrhenius model [11], [12]. In addition to light output degradation, LED manufacturers track color shift during qualification tests. LED lifetime is defined by the general lighting industry as the time to reach 70% light output [13] and 0.007 color shift [14] on the Commission Internationale De L’eclairage (CIE) 1976 chromaticity diagram. The Illuminating Engineering Society (IES) recommends that LED manufacturers collect color data from LEDs over 6000 h of operation while collecting light output data [13]. The U.S. Department of Energy (DOE) also published the “Energy Star Program Requirements: Product Specification for Luminaires” with the Environmental Protection Agency (EPA) to address LED color failure [15]. It describes color shift using the sevenstep standard deviation of color matching (SDCM) in product qualification tests, where the change in chromaticity over the first 6000 h of LEDs should be within 0.007 on the CIE 1976 chromaticity diagram. Color degradation is a critical issue for some application areas of LEDs, such as museum lighting or decorative lighting, as one of the benefits of LEDs is that they can produce a wide range of colors from 2500 K to 12 000 K [1]. If an LED product cannot maintain its initial color properties, the product will lose its advantage over traditional lighting sources, such as fluorescent lamps or incandescent bulbs. The qualification

c 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission. 1551-3203  See http://www.ieee.org/publications standards/publications/rights/index.html for more information.

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

1853

Fig. 1. SPD of white LED with die SPD and phosphor SPD.

testing method for color failure utilizes the 0.007 color shift without identifying whether it was die degradation or phosphor degradation. However, if qualification testing could identify the parts in an LED that degrade, such as the die, phosphors, or encapsulant, then the product design, including the geometric dimensions and material properties, can be optimized to meet the specified targets for products. Light and color output are evaluated from the LED spectral power distribution (SPD). An example SPD for a phosphorconverted white LED is shown in Fig. 1. The phosphors convert some portion of the short wavelength light from the LED (i.e., leaked blue light) into long wavelength light (phosphorconverted light), and the LED light combined with the downconverted light produces the desired white light. Depending on the product design, application, cost, and material properties, the number of SPD components can be two or more. LED phosphors are embedded inside a resin that surrounds the LED die. The two SPDs in the LED SPD show the effects of die degradation, phosphor degradation, and package degradation. A life prediction method based on the color failure of LEDs has not yet been developed. It is difficult to extrapolate color change because of the differences in the design, materials, manufacturing processes, and optics applied to LEDs and in their use conditions. LED color change has been studied under different current loads [16]–[18], but there is no model that provides detailed information on color degradation. Therefore, an anomaly detection technique is needed for qualification testing to obtain an earlier time to predict the remaining useful life of the products so that accurate prognostics-based qualification can be achieved that does not require 6000 h or increase the prediction error. Prognostics and health management (PHM) is an enabling discipline consisting of technologies and methods to assess the reliability of a product in its actual life cycle conditions to determine the advent of failure and mitigate system risk by detecting anomalies and predicting the remaining useful life of products [19]–[21]. Several previous studies have been conducted on anomaly detection in LEDs. Sutharssan et al. [22] conducted real-time monitoring and detected anomalies with Mahalanobis distance (MD) and Euclidean distance (ED) using applied voltage and three (current, light, and temperature) sensor voltages. The thresholds were identified as the points where the light output started to decrease continuously for each LED, and the minimum values for detection were the

Fig. 2. Anomaly detection scheme with the similarity-based metric test over the entire SPD.

thresholds of MD and ED. Avenel [23] developed a method to identify at least one faulty LED in a string of LEDs using voltage responses. Fan et al. [24] studied anomaly detection of LEDs for chromaticity shift of high-power white LEDs using the MD approach. However, there has been little work on anomaly detection of LED color degradation by utilizing SPD analysis to provide more detailed information than simply chromaticity shift. A method for conducting anomaly detection on the color failure of LEDs is presented in Section II. Features were extracted by analyzing the data of SPDs, which are defined as the power per unit area per unit wavelength of a radiant existence. The features were reduced by principal component analysis. Then, the principal components were partitioned into clusters. Anomalies were detected when the similarity distance was greater than the predetermined thresholds. Sections III and IV present a discussion and conclusion. II. A NOMALY D ETECTION F RAMEWORK Anomaly detection was performed by following the steps in Figs. 2 and 3. Fig. 2 shows an approach that uses the entire SPDs for anomaly detection. Fig. 3 shows an approach that utilizes each specific SPD, including the die SPD and phosphor SPD. Individual anomaly detection with each SPD (i.e., die and phosphor) in the entire SPD was conducted to identify the die and the phosphor degradation or to determine whether the die or phosphors degrade faster depending on the failure mechanisms of the LEDs. The die SPD and phosphor SPD were separated based on the wavelength range (in this case study, 380–495 nm for the die SPD and 495–745 nm for the phosphor SPD) in the entire SPD. Song and Han [25], [26] also stated that an SPD is required to analyze the performance of LEDs, since the die and phosphor SPDs are changed by different degradation mechanisms. The detection method is described in Section II as follows. Twelve features [e.g., peak area, full width at half maximum (FWHM), and peak centroid] were extracted from each die and phosphor SPD (see Section II-A). Then, the features from all

1854

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

Fig. 5. Data collection.

Fig. 3. Anomaly detection scheme with similarity-based metric test identifying parts that degraded in the LED package.

Fig. 6. Color failure result (seven-step SDCM).

Fig. 4. LEDs mounted on the MCPCB.

the component SPDs (i.e., using the entire SPD) in Fig. 2 and the features from each separate SPD in Fig. 3 were reduced to three principal components to reduce the dimensionality (see Section II-B). Each data set (the entire SPD, die SPD, and phosphor SPD) was partitioned into clusters using a k-nearest neighbors (KNN)-kernel density-based clustering technique (see Section II-C). The similarity-based metric test was developed to evaluate the distance from each centroid in each individual cluster to the test data points to conduct anomaly detection (see Section II-D). If the distance was greater than the predetermined threshold, an anomaly was detected. Otherwise, the algorithm continued to measure the distance between the centroids and the test data points in threedimensional PC space. A. Feature Extraction Data from an aging test were used to develop and validate the anomaly detection process. The test samples in this study were 3 W InGaN white LEDs. Sixteen LEDs samples were mounted on an aluminum metal core-printed circuit board (MCPCB), as shown in Fig. 4. LEDs were in contact with the surface of the top copper (Cu) trace layer of the aluminum MCPCB by SnPb solder. The MCPCB dimensions were

223 mm × 105 mm × 1.6 mm, and the pitch between each LED was 25 mm in each direction. The test setup is shown in Fig. 5. The test condition was a 350-mA constant current (i.e., the typical current recommended by its manufacturer) and a chamber temperature of 40 ◦ C (as the third temperature by the IES-LM-80-08 standard). The junction temperature was expected to stay below the absolute maximum rating junction temperature of 135 ◦ C [27]. The SPDs of all 16 LEDs were measured after every 22.5 h of exposure to this condition. The relative humidity conditions in both the aging and test environments were controlled in the same conditions and chamber. The failure criterion was a seven-step SDCM for color shift. The times to color failure data are plotted with a threeparameter Weibull in Fig. 6. The shape parameter was 1.2, the scale parameter was 91.3, and the location parameter was 1859.6. The time-to-failure (TTF) range for color failure was 1891–2206 h. The mean time to color failure was 1945.5 h (referred to as the unadjusted plot) in Fig. 6. Features were extracted from both the LED die SPD and phosphor SPD, as shown in Fig. 7. The LED die SPD was the wavelength range between 380 and 495 nm, and the phosphor SPD was the wavelength range between 495 and 745 nm. Twelve features were extracted from each SPD: peak area (denoted as v1), average of the peak (v2), peak centroid (v3), peak height (v4), root-mean-square (RMS) (v5), crest factor (v6), standard deviation (v7), skewness (v8), kurtosis (v9), FWHM (v10), peak wavelength (v11), and left half width (v12). The entire SPD was defined by a function f (x) in the range of wavelength x of a and b (i.e., a < x < b), as

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

1855

v8 =

v9 =

1 N 1 N

1 N 1 N

N

− v2)3

3/2 N 2 (f (x ) − v2) i i=1 i=1 (f (xi )

(8)

N

− v2)4

2 − 3 N 2 (f (x ) − v2) i i=1 i=1 (f (xi )

(9)

v10 = x2 − x1

(10)

v11 = peak wavelength

(11)

v12 = v11 − x1

(12)

where x1 , x2 , v4, v10, and v11 are shown in Fig. 8, and N is the number of data points in the SPD by optical measurements.

Fig. 7. Feature extraction from SPD.

B. Dimensionality Reduction Using Principal Component Analysis

Fig. 8. Description of features in the SPD.

shown in Fig. 8. All features are described in (1)–(12) in mathematical terms b v1 =

f (x)dx

(1)

a

1 v2 = b−a

b f (x)dx

(2)

a

b

xf (x)dx v3 = a b f (x)dx a

(3)

v4 = h

(4)

  b   1 v5 = RMSx =  f (x)2 dx b−a

(5)

a

v6 =

h RMSx

(6) C. KNN-Kernel Density-Based Clustering

 ⎤2 ⎡ ⎤2 ⎡  b b  1 1 f (x)2 dx⎦ − ⎣ f (x)dx⎦ (7) v7 = ⎣ b−a b−a a

Principal component analysis, an exploratory data analysis technique, was used to transform the 24 extracted features (12 from the die SPD and 12 from the phosphor SPD) into principal components. A number of training data points (i.e., the initial test data points) can be applied to detect anomalies. Having a larger number of training data points reduces the error in anomaly detection because more training data points include more information on the degradation trend of LEDs. To analyze the detection accuracy into the number of training data points, anomaly detection using the entire SPD, with die and phosphor SPDs together, was conducted with four different training data sets: 10 data points, 20 data points, 30 data points, and 40 data points from each LED. For each LED, 123 data points were collected. The data points that were not used in the data set were used for testing (i.e., anomaly detection). In addition to detection with the entire SPD, anomaly detection was conducted for each SPD from each LED with 10 data points, 20 data points, 30 data points, and 40 data points. Each training data set was used to evaluate the loading matrix and variances for the principal component analysis. The principal component scores for the test data points were obtained by multiplying the loading matrix and the feature vectors. The 12 features for the die SPD and the phosphor SPD were reduced to three principal components using the Scree test results for dimensionality reduction, as shown in Figs. 9 and 10, respectively. The Scree test plots the principal components in the x-axis and their corresponding eigenvalues (i.e., variances of principal components) in the y-axis. All of the points along the level part of the line, including the transition point, are dropped, and three points are counted along the precipitously dropping part of the line.

a

Clustering is the process of partitioning a set of data (or objects) into sets of meaningful subclasses called clusters. A cluster is a collection of data (or objects) that are similar (based on proximity measures) to one another, and thus can

1856

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

Fig. 11. KNN-kernel density-based clustering.

Fig. 9. Scree test results for the die SPD using 480 data points (i.e., 30 data points from each LED).

1  K((x − xj )./Hx ) (14) ni Vx x ∈ω j i where ni is the size of cluster ωi and ni = N . Hence, (13) pˆ(x|ωi ) =

i

can be modified using (14) as    1 K((x − xl )./Hx ) p(ωi ) ni Vx x ∈ω l ⎛i ⎞ 1 ⎝ > K((x − xl )./Hx )⎠p(ωj ) nj Vx x ∈ω l

Fig. 10. Scree test results for the phosphor SPD using 480 data points (i.e., 30 data points from each LED).

where p(x|ωi ) is the class-conditional density function at x of each class ωi , and p(ωi ) is the prior probability function. The class-conditional density function for a nonparametric KNNkernel density-based clustering technique is such that

∀j = i. (15)

If the prior probability functions of p(ωi ) and p(ωj ) are n simplified as nNi and Nj , respectively, we obtain Bayes’ decision rule for KNN-kernel density-based clustering   K((x − xl )./Hx )> K((x − xl )./Hx ) ∀j = i. xl ∈ωi

be treated collectively as a group [28], [29]. It is difficult to define clusters and to determine the number of clusters in data when clusters have different sizes, densities, and shapes [30], [31]. Also, it is challenging to find clusters in data when the data contain a large amount of noise and outliers, particularly with higher dimensional data. Density-based clustering algorithms define clusters based on the density of data points in a region. In other words, density-based algorithms define a cluster as a region in the data space that exceeds a given density threshold. One advantage of density-based clustering is that it can identify clusters of arbitrary shapes. For anomaly detection in this paper, clusters were partitioned with a KNN-kernel density-based clustering technique. The KNN-kernel density-based clustering method is based on a combination of nonparametric KNN and kernel density estimation methods [32]. The KNN-kernel density estimation technique makes it possible to model clusters of different densities in high-dimensional data sets. The KNN-kernel density-based clustering satisfies the classification rule (i.e., a method to assign objects, or data, to clusters) based on KNN-kernel density estimates [33], [34]. The common classification rule is based on Bayes’ decision rule (13) p(x|ωi )p(ωi ) > p(x|ωj )p(ωj ) ∀j = i

j

xl ∈ωj

(16)

The algorithm of KNN-kernel density-based clustering follows the steps shown in Fig. 11. The first step is to input data with N single clusters. A KNN table (N × k) is then formed, which lists the kth nearest neighbor distances for each of the N data sets. Cluster memberships are assigned using the class condition shown in (15), and the cluster memberships are recalculated if the cluster memberships change in order to maximize the total class conditional density of the point xi being assigned to cluster c as D=

N 

pˆ(xi |c).

(17)

i=1

The KNN-kernel density-based algorithm adjusts and selects all parameters and conditions except for the user-defined parameter k, which is the number of neighborhood points. The number of clusters decreases with a larger k. In this study, the optimal number for k was selected by trying multiple values of k and picking the one that captured the features of the data set with three-dimensional (3-D) graphical exploratory data analysis. The optimal numbers for k for 160, 320, 480, and 640 training data points with the entire SPD were 12, 19, 22, and 28, respectively. The number of partitioned clusters from the four training data sets was 9 for a data set with 160 training data points, 10 for a data set with 320 training data points, 8 for a data set with 480 training data points, and 14 for a data set with 640 training data points. The results show that having more training data results in better cluster formation.

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

Fig. 12. WCSS for die SPD training data with 480 data points.

1857

Fig. 14. KNN-kernel density-based clustering for die SPD training data with 480 data points.

Fig. 13. WCSS for phosphor SPD training data with 480 data points.

To identify the degradation components in the LED package, the optimal numbers for k were 60 (for 480 training data points) and 68 (for 640 training data points) using the triangular kernel function for both the die SPD and the phosphor SPD. In addition to the 3-D graphical exploratory data analysis, a plot similar to the Scree test was used to determine the correct number of clusters in the data set. The “protrusion of the curve” in the plot (along the sharply declining part of the line) was considered based on within-cluster sum of squares (WCSS) to choose the number of clusters. The WCSS was defined for given a set of observations (x1 , x2 , . . . , xn ), where each observation is a d-dimensional real vector. The KNN-kernel density-based clustering partitions the n observations into k sets (k ≤ n) of clusters C = {C1 , C2 , . . . , Ck } with the centroid νj in cluster Cj [35], [36] WCSS =

k 



2

xj − vj  .

(18)

i=1 xj ∈Cj

Two training data sets (one for die SPD and the other for phosphor SPD) were partitioned with seven clusters by the algorithm for a data set with 480 training data points, as shown in Figs. 12 and 13. Elbows (i.e., protrusion of the curve) were observed in the third and seventh clusters shown in Fig. 12 and in the second and seventh clusters shown in Fig. 13. Due to multiple bins (i.e., classification) of LEDs based on optical properties (such as light output and color) in the specification sheet, the degradation paths and patterns of LEDs were grouped in seven clusters rather than in two or three clusters.

Fig. 15. KNN-kernel density-based clustering for phosphor SPD training data with 480 data points.

Anomaly detections were made with three clusters for die SPD training data and two clusters for phosphor SPD training data with 480 data points. The detection results missed the anomalies. Seven clusters were utilized for both die SPD, as shown in Fig. 14, and phosphor SPD training data, as shown in Fig. 15. With the same algorithm, two training data sets with 640 training data points were partitioned with eight clusters.

D. Similarity-Based Metric Test for Anomaly Detection of LEDs After partitioning the training data into clusters using the KNN-kernel density-based clustering technique, the similaritybased metric test (depicted in Fig. 16) was conducted to detect anomalies in the color change. When the training data were partitioned with m clusters, each centroid (Vj ) from each cluster was evaluated. Then, the distance (D) between one data point and each cluster was evaluated with the Euclidian distance from the data point to the centroid. The distance from each cluster was evaluated and compared to a predetermined threshold. The detection threshold (Tj ) for anomaly detection was the threshold of the cluster with the shortest distance (Dj ) from the threshold (i.e., the maximum Tj −Dj distance). This procedure was repeated for the next test data point if the distance (Dj ) was smaller than the detection threshold (Tj ). If the distance (Dj ) was greater than the detection threshold (Tj ), then the algorithm detected an anomaly.

1858

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

Fig. 16. Similarity-based metric test for anomaly detection.

The mean radius (R(Cj )) is the average distance from member points in the cluster to the centroid of each cluster. In other words, the mean radius (R(Cj )) is a measure of the tightness of the cluster around the centroid. Given m-dimensional data vectors vi in a cluster Cj = {vi |j = 1, 2, . . . , m }, the centroid vj and mean radius R(Cj ) are evaluated as [37], [38] m vi (19) vj = i=1 m  m 2 i=1 (vi − vj ) . (20) R(Cj ) = m If the data form a spherical shape, the detection threshold (Tj ) is ideally the mean radius (R(Cj )) of the closest cluster. The data shown in Figs. 14 and 15 are not spherical, but elliptical. The elliptical shape has regions of clusters wider than the mean radius (R(Cj )) and regions of clusters narrower than the mean radius (R(Cj )). So, the accuracy of the detection is determined by the detection threshold. If the cluster shapes are not spherical, then the threshold is in a range between the minimum distance from the centroid (i.e., the distance between the closest member point and the centroid) and the maximum distance from the centroid (i.e., the distance between the farthest member point and the centroid) in the cluster. The threshold changes the sensitivity of the anomaly detection. In other words, the fault detection rate increases if the detection threshold (Tj ) is close to the minimum distance from the centroid in the cluster, whereas the missed alarm rate increases if the detection threshold (Tj ) is close to the maximum distance from the centroid in the cluster. To improve the accuracy of anomaly detection, the distributions of points in a cluster are considered to define a new threshold for each cluster. In this study, a detection threshold metric was developed as Detection threshold (T ) = mean radius (r) +

standard deviation (σ) dimensional factor (d)

(21)

where the standard deviation (σ) is the standard deviation of all distances of member points from the centroid in the cluster

and the dimensional factor (d) is the number of dimensions of the data space. In this study, the dimensional factor (d) was 3, because data analysis was conducted in 3-D principal component space, as shown in Figs. 2 and 3. This metric minimizes the error from the cluster shape and the data distribution of the cluster. The anomaly detection threshold is determined with data characteristics in terms of the similarity-based metric (i.e., unit) based on feature extraction from SPD, dimensionality reduction in principal components, and grouping with the KNN-kernel density-based clustering technique. The data characteristics are based on the degradation of LEDs in each test condition and on the device under test (DUT). Anomalies are detected with the early signature of degradation characteristics in terms of the new similarity-based metric. For this reason, the detection threshold does not change in terms of n-step SDCM (e.g., even if the conventional color failure threshold does change from 0.007 threshold to 0.004). In the case of 0.004 color shift, 94% (i.e., 15 of 16) of anomalies was successfully detected using the same detection thresholds as in the case of 0.007 color shift. This means that the developed threshold defining method is more consistent than the conventional n-step SDCM method. 1) Anomaly Detection Using the Entire SPD: For the entire SPD, including both the die SPD and the phosphor SPD, 160 training data points were used to detect anomalies, with the thresholds of both the mean radius in (20) and the developed detection threshold in (21). The centroids of the clusters were evaluated using (19) for the ten clusters. Data were collected for 123 days in total, with 113 days for anomaly detection after the initial data points and 10 days to create the clusters. When a new data point was collected during the accelerated test, the closest cluster was found by evaluating the distance (Dj ) of the data point from the each centroid (Cj ). Anomalies were detected when Dj was greater than Tj , as shown in Fig. 16. Data points from LED 15 are used to illustrate this detection scheme. Detailed distance plots were constructed for the entire data collection period from clusters 1 to 8 for LED 15 utilizing the developed detection thresholds. Clusters 1 and 2 gave false alarms, as shown in Figs. 17 and 18, respectively, with an alarm at 11 days immediately after starting the detection algorithm. An anomaly was detected at day 72 from cluster 8, as seen in Fig. 19. The anomaly was detected in cluster 8 because the maximum Tj − Dj distance came from cluster 8. The actual TTF for LED 15 was day 88. The distance plots of clusters 3, 4, and 5 are similar to cluster 1, as shown in Fig. 17, while the distance plots of clusters 6, 7, and 9 are similar to cluster 2, as shown in Fig. 18. Application of the developed detection threshold improved the anomaly detection rate from 25% (4 of the 16 LEDs) with the mean radius metric to 50% (8 of the 16 LEDs) for the case with 160 training data points, as shown in Table I. A false alarm was defined as when anomalies were detected immediately after beginning the algorithm (within 5% of the ratio of the detection time divided by the actual TTF).

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

1859

TABLE I A NOMALY D ETECTION R ESULTS U SING S IMILARITY-BASED M ETRIC T EST

Fig. 17. Distance measure of cluster 1 from LED 15.

Fig. 18. Distance measure of cluster 2 from LED 15. Fig. 20. Anomaly detection results with 640 training data points using the entire SPD.

Fig. 19. Distance measure of cluster 8 from LED 15.

Detection results with 320, 480, and 640 training data points were also improved using the detection threshold in (21). Table I summarizes the anomaly detection results for LEDs based on the similarity-based metric test. As the number of training data points increases, the anomaly detection rate increases. Anomalies were successfully detected in 81.25% of LEDs using 14 clusters with 640 training data points, as shown in Fig. 20. The algorithm missed an alarm for LED 14, where the anomaly was detected at day 88, but the actual TTF was day 84 (Fig. 20). The detection rate increases with the number of training data points. The initial data points have variation caused by the short-term aging effect of the color degradation parameters. When the number of training data points increases, the short-term aging effect is decreased and the detection accuracy increases. The detection results showed that

the similarity-based metric test can provide advance warning of failures. 2) Anomaly Detection Using Die SPD and Phosphor SPD: An advanced technique that does not necessarily wait until the signature of the anomalies is close to the TTF, as shown in Fig. 20, is required for the fast qualification of LEDs. The die SPD and the phosphor SPD were independently utilized for anomaly detection to identify the die and phosphor degradation and to determine whether the die or phosphors degrade faster. As a case study, the training data set with 480 data points (i.e., 30 data points from each of the 16 LEDs) was partitioned into seven clusters, as shown in Fig. 14 for the die SPD and Fig. 15 for the phosphor SPD, and 93 data points from each LED were used for anomaly detection. LED 3 is used to illustrate the detection scheme. First, the centroids of the clusters were evaluated using (19) for seven clusters. Then the developed threshold (Tj ) of each cluster from the die SPD and phosphor SPD were calculated. The distances from the centroids of the clusters to the new data points of LED 3 were evaluated for the entire data collection period. The Tj − Dj distances over time were greatest at cluster 4 for the die SPD and cluster 2 for the phosphor SPD. An anomaly was detected at day 59 from cluster 4 for the die SPD, as seen in Fig. 21. An anomaly was detected at day 83 from cluster 2 for the phosphor SPD, as shown in Fig. 22. The actual TTF for LED 3 was day 91.

1860

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

Fig. 24. SPD changes at different detection times for LED 3. Fig. 21. Distance measure of cluster 4 from LED 3 for the die SPD.

Fig. 22. Distance measure of cluster 2 from LED 3 for the phosphor SPD.

Fig. 23. Anomaly detection using 480 data points (i.e., 30 data points from each LED).

Fig. 23 shows the anomaly detection results for all LEDs based on the similarity-based metric test using 480 data points. The results showed that die degradation is an earlier sign of color failure of LEDs than phosphor degradation. Therefore, die degradation initiates color failure of LEDs. As a result, future package designs must consider how to strengthen LED die performance. The die and phosphor degradation can be explained by SPD changes over time during the aging test. The SPDs for LED 3 at day 0, the anomaly detection times for die and phosphor SPDs, and the actual TTF are shown in Fig. 24. The phosphor SPD degraded together with the die SPD due to die degradation. All LEDs degraded in a manner similar to the result in Fig. 24. In Fig. 23, most of the anomalies from

Fig. 25. Anomaly detection using 640 data points (i.e., 40 data points from each LED).

the die SPDs were detected in less than 1000 h. This shows that this test can reduce the time needed for predicting the remaining useful life of LEDs during qualification tests. The starting point for early anomaly detection is the earliest time at which users can begin to predict the remaining useful life of LEDs. For all LEDs, the die SPD increased from day 0 until detection (i.e., days 59 and 83), and then decreased to the initial day 0 value at the actual TTF. On the other hand, the phosphor SPDs decreased from day 0 until the actual TTF. The amount of die degradation was less (i.e., 0.066) than the amount of phosphor degradation (i.e., 0.143), but the die degradation detected the anomalies earlier than the phosphor degradation. As seen in Fig. 25, using the same algorithm with 640 data points, the anomalies were first detected from the die degradation and later from the phosphor degradation. The anomalies were detected at about 1200 h in the die SPD data. The detection time for both the die and phosphor SPDs increased due to the larger amount of training data (640 data points vs. 480 data points), and therefore provided more information about the degradation trends.

III. D ISCUSSION This is the first study to use SPD for anomaly detection in qualification tests. SPD, which denotes the radiant power at each wavelength per wavelength interval in the visible spectrum, can be used to obtain degradation information for LEDs. The similarity-based metric test utilizes features from

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

the SPDs as leading indicators. The entire SPD of the tested LEDs includes two parts: 1) SPD from the LED die; and 2) SPD from the phosphors. LED phosphors are embedded inside a resin that surrounds the LED die. The blue light emitted from the die excites the phosphors, which then emit yellow light; the blue light and yellow light combine to emit white light. Previous research driven by industrial standards has utilized u v color shift in the CIE 1976 chromaticity diagram to explain color change in LEDs. Color shift can be used to identify color change; however, it cannot explain the internal phenomena of how each optical element in the entire SPD degraded. The aging time varies depending on the range of the color temperatures (usually having different spectral shapes), the test condition with applied current, the ambient temperature, and the humidity. In the test conditions recommended by the IES standards, the measurement has to be continued for at least 6000 h. When the test conditions are overstress test conditions (i.e., more severe than the IES test conditions), the number of training data points can be modified based on the level of stress and possible type of failure mechanisms for quick anomaly detection. The similarity-based metric test was developed to diagnose degradation in the die and phosphors inside LEDs using the KNN-kernel density-based technique to cluster the data with independent optical components in the SPDs and measure the distance from the centroid. The underlying assumption is that die degradation reduces the phosphor SPD as well as the LED die SPD. As the amount of photons extracted from the LED die is reduced, the amount of phosphor light converted from the short wavelength (i.e., the LED die SPD) is also reduced. When phosphors are only degraded by phosphor thermal quenching, the phosphor SPD decreases and the LED die SPD will not change shape, since the LED die emits light as a normal condition independent of phosphor degradation. The similarity-based metric test does not require historical data. It integrates the advantages of anomaly detection under wearout performance degradation and is suitable for the qualification of new products. LED manufacturers can use the similarity-based metric test to increase the quality of their LED products by detecting anomalies early on and fixing potential problems. Combined with a remaining useful life (RUL) prediction method, this process will make product development, design improvement, and qualification better and will reduce qualification testing time. This will enhance the competitiveness of LED manufacturers that employ the qualification method by enabling them to quickly identify bad batches or designs during tests. As LED lighting is a growing field in the world, the similarity-based metric test will enable new technologies to be assessed and improved more rapidly.

IV. C ONCLUSION The similarity-based metric test for LED anomaly detection presented in this paper extracted features from the SPD, reduced the dimensionality of the features by using principal component analysis, and grouped the data set into clusters using the KNN-kernel density-based clustering technique.

1861

Then, the distances from the centroid of each cluster to each test point were measured. The algorithm detected anomalies when the distance was larger than the predetermined threshold. This technique can be applied to any type of phosphorconverted white LED composed of two or more optical elements in terms of different types of dies and phosphors in the entire SPD with various correlated color temperatures such as warm white, neutral white, and cool white LEDs. The similarity-based metric test can significantly decrease the amount of time needed for LED qualification by using prognostic techniques. This anomaly detection method will provide users with an earlier time to begin remaining useful life prediction than LED qualification tests, such as those based on the IES LM-80-08 standard and the U.S. DOE’s “Energy Star Program Requirements: Product Specification for Luminaires,” which require at least 6000 h of test time. The similarity-based metric test presented in this paper can potentially decrease the amount of time needed for qualification testing to about 1000 h. ACKNOWLEDGMENT The authors would like to thank more than 100 companies and organizations that annually support research activities at the Center for Advanced Life Cycle Engineering, University of Maryland, College Park, MD, USA. R EFERENCES [1] M.-H. Chang, D. Das, P. V. Varde, and M. Pecht, “Light emitting diodes reliability review,” Microelectron. Rel., vol. 52, no. 5, pp. 762–782, May 2012. [2] H. Chen and S. Y. Hui, “Dynamic prediction of correlated color temperature and color rendering index of phosphor-coated white lightemitting diodes,” IEEE Trans. Ind. Electron., vol. 61, no. 2, pp. 784–797, Feb. 2014. [3] K. I. Hwu and W. C. Tu, “Controllable and dimmable AC LED driver based on FPGA to achieve high PF and low THD,” IEEE Trans. Ind. Informat., vol. 9, no. 3, pp. 1330–1342, Aug. 2013. [4] 110th United States Congress. (2007, Dec.). Energy Independence and Security Act of 2007. Washington, DC, USA: U.S. Congress. [Online]. Available: http://www.gpo.gov/fdsys/pkg/BILLS-110hr6enr/pdf/BILLS110hr6enr.pdf [5] D. C. Hoang, P. Yadav, R. Kumar, and S. K. Panda, “Real-time implementation of a harmony search algorithm-based clustering protocol for energy-efficient wireless sensor networks,” IEEE Trans. Ind. Informat., vol. 10, no. 1, pp. 774–783, Feb. 2014. [6] A. Koubaa, R. Severino, M. Alves, and E. Tovar, “Improving qualityof-service in wireless sensor networks by mitigating hidden-node collisions,” IEEE Trans. Ind. Informat., vol. 5, no. 3, pp. 299–313, Aug. 2009. [7] H. Chen, B. Y. Huang, and Y. C. Chu, “Degradation mechanisms in GaN light-emitting diodes undergoing reverse-bias operations in water vapor,” Appl. Phys. Lett., vol. 103, pp. 173502-1–173502-3, Oct. 2013. [8] P.-H. Lee, J.-H. Chou, and H.-C. Tseng, “Thermal stability of highpower LEDs analyzed with efficient nondestructive methodology,” IEEE Trans. Device Mater. Rel., vol. 13, no. 3, pp. 401–406, Sep. 2013. [9] Specifications for White LED Model: NCSW119T-H3, Nichia STS-DA12296. Nichia Corp., Tokushima, Japan, 2012, pp. 1–16. [10] D. M. E. Ingram, P. Schaub, R. R. Taylor, and D. A. Campbell, “Performance analysis of IEC 61850 sampled value process bus networks,” IEEE Trans. Ind. Informat., vol. 9, no. 3, pp. 1445–1454, Aug. 2013. [11] Y. Deshayes, L. Bechou, F. Verdier, and Y. Danto, “Long-term reliability prediction of 935 nm LEDs using failure laws and low acceleration factor ageing tests,” Qual. Rel. Eng. Int., vol. 21, no. 6, pp. 571–594, Mar. 2005. [12] J.-S. Jeong, J.-K. Jung, and S.-D. Park, “Reliability improvement of InGaN LED backlight module by accelerated life test (ALT) and screen policy of potential leakage LED,” Microelectron. Rel., vol. 48, no. 8-9, pp. 1216–1220, Aug. 2008.

1862

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, VOL. 10, NO. 3, AUGUST 2014

[13] Illuminating Engineering Society (IES), Approved Method: Measuring Lumen Maintenance of LED Light Sources, IES LM-80-08. New York, NY, USA: Illuminating Engineering Society (IES), 2008, pp. 1–4. [14] Energy Star Program Requirements for Solid State Lighting Luminaires, Eligibility Criteria—Version 1.1. Washington, DC, USA: U.S. DOE, 2008, pp. 1–23. [15] Energy Star Program Requirements: Product Specification for Luminaires (Lighting Fixtures), Eligibility Criteria—Version 1.2. Washington, DC, USA: U.S. DOE, 2012, pp. 1–36. [16] E. Jung, M. S. Kim, and H. Kim, “Analysis of contributing factors for determining the reliability characteristics of GaN-based white lightemitting diodes with dual degradation kinetics,” IEEE Trans. Electron Devices, vol. 60, no. 1, pp.186–191, Jan. 2013. [17] M. Meneghini, M. D. Lago, N. Trivellin, G. Meneghesso, and E. Zanoni, “Thermally activated degradation of remote phosphors for application in LED lighting,” IEEE Trans. Device Mater. Rel., vol. 13, no. 1, pp. 316–318, Mar. 2013. [18] M. B¨urmen, F. Pernus, and B. Likar, “Accelerated estimation of spectral degradation of white GaN-based LEDs,” Meas. Sci. Technol., vol. 18, no. 1, pp. 230–238, Jan. 2007. [19] S. Cheng, M. H. Azarian, and M. G. Pecht, “Sensor systems for prognostics and health management,” Sensors, vol. 10, pp. 5774–5797, Jun. 2010. [20] M. G. Pecht, “Chapter 1. Introduction,” in Prognostics and Health Management of Electronics, 1st ed. Hoboken, NJ, USA: Wiley, 2008, pp. 1–24. [21] S. Cheng, K. Tom, and M. Pecht, “Anomaly detection of polymer resettable circuit protection devices,” IEEE Trans. Device Mater. Rel., vol. 12, no. 2, pp. 420–427, Jun. 2012. [22] T. Sutharssan, S. Stoynanov, C. Bailey, and Y. Rosunally, “Prognostics and health monitoring of high power LED,” Micromachines, vol. 3, no. 1, pp. 78–100, Feb. 2012. [23] J.-J. M. Avenel, “Technique for identifying at least one faulty light emitting diode in a string of light emitting diodes,” U.S. Patent 2012/0206146 A1, Aug. 9, 2012. [24] J. Fan, K. C. Yung, and M. Pecht, “Anomaly detection for chromaticity shift of high power white LED with Mahalanobis distance approach,” in Proc. EMAP, Hong Kong, 2012, pp. 13–26. [25] B. Song and B. Han, “SPD deconvolution scheme for phosphor converted white LED using multiple Gaussian functions,” Appl. Opt., vol. 52, no. 5, pp. 1016–1024, Feb. 2013. [26] B. Song and B. Han, “Analytical/experimental hybrid approach based on spectral power distribution for quantitative degradation analysis of phosphor converted LED,” IEEE Trans. Device Mater. Rel., vol. 14, no. 1, pp. 365–374, Mar. 2013. [27] Avago Technologies, ASMT-Jx3x 3W Mini Power LED Light Source: Data Sheet, AV02-1941EN. CA, USA: Avago Tech., 2009, pp. 1–15. [28] S. B. Kotsiantis and P. E. Pintelas, “Recent advances in clustering: A brief survey,” WSEAS Trans. Inf. Sci. Appl., vol. 1, pp. 73–81, 2004. [29] P. Berkhin. (2001). Survey of Clustering Data Mining Techniques. San Jose, CA, USA: Accrue Software Inc. [Online]. Available: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.145.895&rep= rep1&type=pdf [30] A. K. Jain, M. N. Murty, and P. J. Flyn, “Data clustering: A review,” ACM Comput. Surv., vol. 31, no. 3, pp. 264–323, Sep. 1999. [31] L. Ert¨oz, M. Steinbach, and V. Kumar, “Finding clusters of different sizes, shapes, and densities in noisy,” presented at the 2nd SIAM Int. Conf. Data Mining, Feb. 2003, pp. 1–12 [Online]. Available: http://wwwusers.cs.umn.edu/∼kumar/papers/kdd02 snn 28.pdf [32] T. N. Tan, R. Wehrens, and L. M. C. Buydens, “KNN-kernel densitybased clustering for high-dimensional multivariate data,” Comput. Stat. Data. Anal., vol. 51, no. 2, pp. 513–525, Nov. 2006. [33] A. Webb, “Density estimation—Nonparametric,” in Statistical Pattern Recognition, 2nd ed. Hoboken, NJ, USA: Wiley, 2002, pp. 81–122. [34] S. Zhong and J. Ghosh, “A unified framework for model-based clustering,” J. Mach. Learn. Res., vol. 4, pp. 1001–1037, Nov. 2003. [35] A. W. F. Edwards and L. L. Cavalli-Sforza, “A method for cluster analysis,” Biometrics, vol. 21, no. 2, pp. 362–375, 1965. [36] J. A. Hartigan and M. A. Wong, “Algorithm AS 136: A k-means clustering algorithm,” J. Roy. Stat. Soc. Ser. C., vol. 28, no. 1, pp. 100–108, 1979. [37] T. Zhang, R. Ramakrishnan, and M. Livny, “BIRCH: An efficient data clustering method for very large databases,” in Proc. 1996 ACM SIGMOD Int. Conf. Manage. Data, Montreal, Canada, Jun. 1996, pp. 103–114.

[38] K. Burbeck and S. Nadjm-Tehrani, “ADWICE—Anomaly detection with real-time incremental clustering,” in Proc. 7th ICISC 2004, 2004, pp. 407–424.

Moon-Hwan Chang received the B.S. degree in mechanical engineering from Sogang University, Seoul, South Korea, in 2002; the M.S. degree in mechanical and aeronautical engineering from the University of California at Davis, Davis, CA, USA, in 2004; and the Ph.D. degree in mechanical engineering from the University of Maryland, College Park, MD, USA, in 2014. He is a Senior Engineer with Samsung Display Co., Ltd., Yongin-si, South Korea. He has more than 15 published international patents with regard to light-emitting diode (LED) backlighting systems and LED reliabilities in the United States. His research interests include failure mechanism analysis, anomaly detection, fault diagnostics, and prognostics of electronics, and advanced light source development. Dr. Chang has been a Student Member of the Illuminating Engineering Society of North America (IESNA) for 6 years. He is a Six Sigma Green Belt holder. He received the National Science Foundation (NSF) Fellowship for his contribution in biosensor development. He received the Outstanding Student Paper Award in International Microelectronics Assembly and Packaging Society (IMAPS) Mid-Atlantic Microelectronics Conference, Atlantic City, NJ, USA, in 2011.

Chaochao Chen received the Ph.D. degree in mechanical engineering from the Kochi University of Technology, Kochi Prefecture, Japan, in 2007. He was a Research Fellow with the University of Michigan, Ann Arbor, MI, USA, the Georgia Institute of Technology, Atlanta, GA, USA, and the University of Maryland, College Park, MD, USA, working in prognostics and health management (PHM) areas in collaboration with multiple organizations in industry and the military for over 5 years. He is a Senior Reliability Engineer with the DEI Group, Millersville, MD, USA. He has served as a Session Chair for multiple international conferences, and has been invited to give talks in several international conferences and institutes. He has published over 20 scientific papers in his areas of interest. His research interests include development and implementation of PHM, data mining, reliability analytics, and decision support methods to ensure system-level asset availability and reliability.

Diganta Das (M’00) received the B.Tech. (Hons.) degree in manufacturing science and engineering from the Indian Institute of Technology, Kharagpur, India, in 1987, and the Ph.D. degree in mechanical engineering from the University of Maryland, College Park, MD, USA, in 1999. He is a Member of the Research Staff at the Center for Advanced Life Cycle Engineering (CALCE), College Park. His research interests include reliability, environmental, and operational ratings of electronic parts, uprating, electronic part reprocessing, technology trends in the electronic parts, and parts selection and management methodologies. He has published more than 50 articles on these subjects. He is a Six Sigma Black Belt, and a Member of the International Microelectronics Assembly and Packaging Society (IMAPS). He is an editorial board Member for Microelectronics Reliability and Circuit World. He was the Technical Editor for two IEEE standards, and is currently Vice Chairman of the IEEE Reliability Society Standards Working Group.

CHANG et al.: ANOMALY DETECTION OF LED USING THE SIMILARITY-BASED METRIC TEST

Michael Pecht (S’78–M’83–SM’90–F’92) received the B.S. degree in acoustics, the M.S. degrees in electrical engineering and engineering mechanics, and the Ph.D. degree in engineering mechanics from the University of Wisconsin at Madison, Madison, WI, USA, in 1982. He is the Founder of the Center for Advanced Life Cycle Engineering (CALCE) and the Electronic Products and Systems Consortium, University of Maryland, College Park, MD, USA, where he is also a Chair Professor. He has been leading a research team in the area of prognostics for the past ten years, and has now formed a new Electronics Prognostics and Health Management Consortium at the University of Maryland. He has consulted for over 50 major international electronics companies, providing expertise in strategic planning, design, test, prognostics, international patents (IPs), and risk assessment of electronic products and systems. He has written 18 books on electronic products development, use, and supply chain management. Dr. Pecht is a Professional Engineer and an American Society of Mechanical Engineers (ASME) Fellow. He has received the 3M Research Award for Electronics Packaging, the IEEE Undergraduate Teaching Award, and the International Microelectronics Assembly and Packaging Society (IMAPS) William D. Ashman Memorial Achievement Award for his contributions in electronics reliability analysis. He served as Chief Editor of the IEEE T RANSACTIONS ON R ELIABILITY for eight years and on the advisory board of IEEE S PECTRUM. He is Chief Editor for Microelectronics Reliability, and an Associate Editor for the IEEE T RANSACTIONS ON C OMPONENTS AND PACKAGING T ECHNOLOGY.

1863