# An Efficient Quality-Related Fault Diagnosis Method for Real-Time Multimode Industrial Process.

1. IntroductionWith modern industrial processes getting increasingly complex and large, prevention monitoring and fault diagnosis have become the key to ensure safe operation, improve product quality, and gain economic benefits. Due to the complex operation mechanism, sheer size, complex conditions, chaotic environment, and vague boundary conditions in complex industrial systems, it is quite tough to implement effective process monitoring. As a result, the data-driven process monitoring technology has become one of the research hotspots in the field of fault diagnosis. The core idea of this technique is to establish the data model by means of using historical data, mining useful information, and getting the features of normal and fault operation mode, so as to realize process monitoring. In the last decades, basic multivariate statistical monitoring techniques, such as principal component analysis (PCA) and partial least squares (PLS), have been established and successfully applied in practice [1].

However, PCA or PLS model is established with data which follow the basis hypothesis of data subject to stable single Gaussian mode. Due to the reasons of fluctuation of raw materials, product specifications, and differences among batches, process data show the characteristic of multimode in actual industrial processes especially for batch processes. Considering the problems existing in the multimode process, traditional fault detection methods and their improved algorithms are difficult to be applied directly; otherwise, the performance of data model in process monitoring will be reduced.

Many scholars have studied a lot and made some progress on those problems [1]. Hwang and Han proposed a hierarchical clustering based on the PCA modeling method [2]. Lane et al. proposed a pooled principal component analysis method [3]. However, the ensemble modeling methods, in which the common feature of subspace in each mode is extracted as a unified model, are unable to fully or accurately depict all operation models. Particularly, when there are many differences among various modes, the model characterization in their methods is often biased. Chen and Liu used the heuristic smoothing clustering algorithm to classify data automatically, which can get multiple operating modes [4]. Zhao et al. applied multiple PCA and multiple PLS method to fault monitoring for multimode processes [5], in which the similarity index between different operating models was established and used to analyze the shift between the models. In view of stage division, Doan and Srinivasan modeled different stages of the process, respectively, for fault monitoring [6]. Dealing with the multimode problem of the process, the former divided the process data using the clustering method and then established independent models, so as to make fault monitoring more targeted. However, the above independent modeling methods are often complex, have large calculating quantity, and are usually based on the experience of mode division. Whether the division is reasonable or not will directly affect the quality of monitoring results. All the above increase the difficulties of online monitoring.

Considering the unique advantages in dealing with non-Gaussian data, the Gaussian mixture model (GMM) has not been explored in multimode process monitoring until recently. Choi et al. integrated PCA and DA with GMM to detect and isolate the faults in a process with non-linearity, multistates, or dynamics [7]. Yoo et al. applied a similar strategy into multiway PCA to monitor biological batch processes [8]. However, these methods ignore the possibility that the monitored sample may come from other Gaussian components of lower posterior probabilities, which may lead to biased monitoring results. Yu and Qin proposed a new method that combines finite mixture Gaussian models with Bayesian inference to characterize different operation modes through Gaussian components and then realized fault detection [9]. In recent years, many scholars had proposed different methods to solve multimode monitoring [9].

The main contribution of this paper is summarized as follows. (1) An efficient method for multimode process monitoring based on finite Gaussian mixture models is proposed. (2) A gradient contribution rate is proposed to measure the contribution to the combined index and find out the variable

which should be in charge of the fault in quality. This rate can better show the changes of variables contribution rate over time after fault occurrence.

The remainder of this paper is organized as follows. In Section 2, the descriptions of traditional PCA and PLS models in covariance form are provided, and then the covariance description form of the total projection to latent structures (TPLS) model is derived. Multimode information is extracted from the principal component space by GMM and a new multimode total projection to potential structure (MTPLS) model is established in Section 3. A unified monitoring framework based on MTPLS in combination with Bayesian inference is constructed and quality-related fault monitoring is implemented using a combined index in Section 4. In Section 5, a hot strip mill process is taken as an example to verify the superiority of our new method in fault monitoring and diagnosis over traditional methods. The conclusions and future works are given in Section 6.

2. Multivariate Statistical Theory

2.1. PCA and Covariance Description Form. Principal component analysis model is one of the most basic projection models in multivariate statistical analysis. Let X [member of] [R.sup.N*m] be the dataset of m-dimensional process variables, where N stands for the number of samples. Matrix X can be decomposed into a score matrix and a loading matrix as follows [10]:

X = [??] + E = T[P.sup.T] + E, T = XP, (1)

where T [member of] [R.sup.N*A] and P [member of] [R.sup.m*A] stand for score matrix and loading matrix, respectively, and A is the number of principal components. The covariance matrix of normalized data can be defined as follows:

[[SIGMA].sub.X] [approximately equal to] 1/N-1 [X.sup.T]X. (2)

The PCA loading matrix P can be obtained by eigenvalue decomposition on the covariance matrix [[SIGMA].sub.X].

Based on the projection model, monitoring statistics indexes [T.sup.2] and SPE can be constructed. Let [x.sub.new] [member of] [R.sup.m]; the indexes can be designed as follows:

[mathematical expression not reproducible], (3)

where a denotes the principal component covariance matrix and [T.sup.2.sub.[alpha]] and [[delta].sup.2.sub.[alpha]] are the control limit with the confidence level of [alpha].

When the residual error is subject to normal distribution, Jackson and Mudholkar pointed out that the control limit can be calculated as follows:

[mathematical expression not reproducible], (4)

where [h.sub.0] = 1 - 2[[theta].sub.1][[theta].sub.3]/3[[theta].sup.2.sub.1], [[theta].sub.i] = [[summation].sup.m.sub.j = A+1] [[lambda].sup.i.sub.j] (i = 1,2,3), [c.sub.[alpha]] represents the threshold of standard normal distribution under the confidence level of [alpha], and [[lambda].sub.j] represents the eigenvalue of covariance matrix [[SIGMA].sub.X].

Similarly, in order to apply the sample covariance information into the monitoring index, the principal component covariance matrix can be expressed as

[mathematical expression not reproducible]. (5)

2.2. PLS and Covariance Description Form. In the actual industrial production, the changes of quality variables Y are of more concern, especially for the faults which can cause the change of quality variables. PLS model uses the quality variables to guide the decomposition of sample space.

PLS decomposition of X and Y results in the following:

X = T[P.sup.T] + E, Y = T[Q.sup.T] + F, (6)

where X [member of] [R.sup.N*m], Y [member of] [R.sup.N*l], and score matrix T can be formulated with X as T = XR.

Parameter matrix R can be obtained by the loading matrix P and weight matrix W in PLS iterative calculation, R = W[([P.sup.T]W).sup.-1].

According to the iterative process of the PLS model, Peng et al. proposed a model construction method using data covariance information [11], in which the covariance matrix of data was introduced into the iterative process, and model parameter matrices can be obtained at the same time. Compared with conventional PLS, the model construction method using data covariance information reduced the calculation amount although the intrinsic properties essence was not changed.

Different from the PCA projection model, the decomposition structure of space X in PLS is defined by two matrices P and R, and an oblique projection structure is induced in input space. It is the quality that guides the decomposition of sample space, so that the principal component space is changed. The covariance matrix of the principal component space can be expressed as

[LAMBDA] = 1/N-1 [T.sup.T]T = 1/N-1 [R.sup.T][X.sup.T]XR = [R.sup.T][[SIGMA].sub.X] R. (7)

Similar to PCA model monitoring, the monitoring sample statistics can be constructed by using the covariance matrix of the above formula as follows:

[mathematical expression not reproducible]. (8)

The control limit of the residual statistic can be calculated as follows:

[[delta].sup.2.sub.[alpha]] = g[[chi square].sub.h,a], (9)

where g = S/2[micro]i, h = 2[[mu].sup.2]/S, [mu] represents the sample mean of residual statistic Q, S represents the sample variance of Q, and g[[chi square].sub.h,a] is the threshold of [chi square] variables with scale factor g and free degree h.

3. TPLS Monitoring Model

3.1. TPLS. PLS algorithm uses two variable spaces to describe process change. However, the main component of samples contains the part which is orthogonal to Y, and this part cannot reflect the variations related to Y. On the other hand, PLS decomposition structure makes the residual in X remain very large, which is not suitable to be monitored by index Q. Therefore, Li et al. proposed a kind of total projection algorithm [12], which is based on traditional PLS decomposition. The original latent variable space is decomposed into one subspace relevant to quality variables directly and another subspace orthogonal to quality variables. At the same time, the residual space is decomposed into subspaces with large variance and residual subspace containing noise only, using the PCA orthogonal projection technique.

By further decomposition, we can model X and Y as follows:

X = [X.sub.y] + [X.sub.o] + [X.sub.r] + [E.sub.r],

Y = [T.sub.y][Q.sup.T.sub.y] + F, (10)

where [X.sub.y] = [T.sub.y][P.sup.T.sub.y], [X.sub.o] = [T.sub.o][P.sup.T.sub.o], and [X.sub.r] = [T.sub.r][P.sup.T.sub.r]. [X.sub.y] stands for the part which is relevant to Y directly in [??], [X.sub.o] stands for the part which is orthogonal to Y in [??], and [X.sub.r] stands for the part with large variance component in E.

At the same time, based on the structure of PLS projection, Li et al. also performed a detailed analysis of the space structure of TPLS and drew a good conclusion [12]. Similar to PLS, TPLS also exhibits an oblique projection, but TPLS projects x to four different spaces, which reflect different relationship among quality variables.

For a new measurement of sample [x.sub.new], the corresponding score and residual part can be calculated as follows [12]:

[mathematical expression not reproducible]. (11)

Compared with PLS, TPLS model is easy to be explained and suitable for process monitoring. Similar to PLS in monitoring strategy, TPLS uses two statistic indexes [T.sup.2] and Q in process monitoring. In TPLS, [X.sub.y], [X.sub.o], and [X.sub.r] represent the main variation in the process, and thus they are suitable for [T.sup.2] statistic, and [E.sub.r] represents the residual part of the process which is

suitable to be monitored by using statistic Q.

3.2. The Covariance Description of TPLS. The four spaces in TPLS can get a more detailed description of the different relationships between X and quality variables Y. Based on the covariance matrix of the PLS model, the parameter matrices P, Q, and W will be obtained. Then, parameter matrix R is calculated by R = W[([P.sup.T]W).sup.-1].

Combining with the covariance description form of PCA and PLS model, we can do space decomposition in the following form.

In PCA decomposition of [??], characteristic vectors of the covariance matrix [[SIGMA].sub.[??]] are extracted to construct [Q.sub.y], [[SIGMA].sub.[??]] can be expressed as

[mathematical expression not reproducible]. (12)

Similarly, in PCA decomposition of [[??].sub.0] and E, we can extract characteristic vectors of each covariance matrix to form a loading matrix in corresponding space. Covariance matrices can be expressed as

[mathematical expression not reproducible], (13)

where

[mathematical expression not reproducible], (14)

According to the score and the residual structure model of new measurement samples, let

[mathematical expression not reproducible]. (15)

It can be easily proved that this form is equivalent to the standard one.

The following part shows the calculation process of TPLS model using covariance information.

Covariance Description Form of TPLS Algorithm. Obtain [[SIGMA].sub.X] and [[SIGMA].sub.XY]:

(1) Use GMM-PLS algorithm, and obtain parameter matrix: P =[[p.sub.1], ..., [p.sub.A]] [member of] [R.sup.m*A], W =[[w.sub.1], ..., [w.sub.A]][member of] [R.sup.m*A], Q = [[q.sub.1], ..., [q.sub.A]] [member of] [R.sup.l*A], R = W[([P.sup.T]W).sup.-1].

(2) Calculate PCA decomposition of [??]: do an eigenvalue decomposition on [[SIGMA].sub.[??]]; obtain the loading matrix [mathematical expression not reproducible] and principal component number: Ay = rank(Q).

(3) [mathematical expression not reproducible].

(4) Calculate PCA decomposition of X0: do an eigenvalue decomposition on [mathematical expression not reproducible]; obtain the loading matrix [mathematical expression not reproducible] and principal component number: [A.sub.o] = A - [A.sub.y].

(5) Calculate PCA decomposition of E = X - [??]: do an eigenvalue decomposition on [[SIGMA].sub.E]; obtain loading matrix [mathematical expression not reproducible] and principal component number [A.sub.r]: based on the PCA method.

4. Multimode Process Monitoring and Fault Diagnosis

4.1. Mode Division of Principal Components. According to industrial process data with the characters of multimode, we need to determine a mixed model based on historical data firstly and then design a monitoring framework. Considering covariance information required for the statistical model, multimode modeling data can be processed by GMM. It is the assumption that data are made up of different Gaussian distributions. That is, for any sample data x, it is possible to take a certain probability from K different Gaussian distributions. As a result, global probability distribution can be expressed by the mixed model of the K Gaussian elements. It can be expressed as

p(x | [theta]) = [K.summation over (i=1)], p(x | [[theta].sub.i]), (16)

where K is the number of mixture components, [w.sub.i] denotes the weight of the ith Gaussian component, and [[summation].sup.K.sub.i=1] [w.sub.i] = 1, [[theta].sub.i] = {[[mu].sub.i], [[SIGMA].sub.i]} represents the statistical parameters. Parameters estimation usually adopts EM iterative algorithm. The corresponding multivariate Gaussian density function for the ith component is given by

[mathematical expression not reproducible]. (17)

According to the rule of Bayes inference, the posterior probability of x belonging to the ith Gaussian component is

[mathematical expression not reproducible]. (18)

However, due to factors such as production flow, batch, and specification, the quality variables of the final products have some certain degree of difference in real production processes. It may be the root cause that process data is with multimode and multistage features. Therefore, considering that the PLS algorithm is with the space decomposition under guidance of quality variables, this paper first performs mode division with principal component space T and acquires the mode label [C.sub.k] of [t.sub.i]. This method carried out with the projection of training data can highlight the influence of quality variables better.

Based on advantages of GMM in processing multimode problems, we deal with principal components matrix T with GMM for acquiring [[summation].sup.K.sub.i=1] [w.sub.ti] = 1 and [[theta].sub.ti] = {[[mu].sub.ti], [[SIGMA].sub.ti]}. The total number of estimated parameters is K((1/2)[A.sup.2] + (3/2)A+1)-1, where A is the number of the principal components. Usually, A is far less than process variables number m, which can reduce the number of estimated parameters greatly and speed up the calculation.

After mode division, principal component space model based on GMM is established, where each Gauss component corresponds to different mode characteristics. For training samples, [x.sub.i] can be divided into the modes whose principal variable belongs to

[mathematical expression not reproducible]. (19)

Taking process variables x [member of] [R.sup.m] and output variables y [member of] R into account, we construct a new vector z which stands for the process information as follows:

[mathematical expression not reproducible]. (20)

Assuming that variable z is satisfied with mixed Gauss distribution, the distribution parameters [[theta].sub.i] = {[[mu].sub.i], [[SIGMA].sub.i]} can be acquired by mean(z) and cov(z) directly; prior probabilities [[omega].sub.i] are the same as principal space distribution [[omega].sub.ti].

[mathematical expression not reproducible]. (21)

Divide [[mu].sup.(k).sub.z] and [[SIGMA].sup.(k).sub.z] into the forms of [11]

[mathematical expression not reproducible]. (22)

As above, it can be noted that mode classification will be under the guidance of quality variables. Then, because the number of principal components is far less than that of process variables, this has a great advantage in the treatment of estimated parameters calculation. In addition, after the mode division of original training data, multimode information such as covariance matrices can be directly calculated, which reduce the amount of calculation and improve calculation accuracy.

4.2. Multimode TPLS Based Fault Detection. According to the principle of building PLS and TPLS, the essence is to use data information, variance, and covariance to represent process characteristics. As far as PLS is concerned, the modeling process is to maximize the covariance of linear combinations of process variables and quality variables, so the modeling process can be converted into a covariance form through the initial data X and Y. Therefore, in order to adapt to the multimode characteristic of industrial process data better, we can extend multivariate statistical methods to multimode scope by covariance strategy which will improve the performance of the monitoring model.

Based on the above analysis, we can make a rational division of training data to obtain the multimode information in the process of fault monitoring. When the sample is collected and is ready for being monitored, it can be divided into corresponding models with the probability, using Bayes classification ability under the data pretreatment. Then, we can calculate the monitoring statistic of the sample to justify which mode it belongs to. We treat the posterior probability of the monitoring sample belonging to each Gauss component as the membership degree of the corresponding model.

By using data information of probability [w.sub.i] and parameters [theta] to monitor the process, the comprehensive monitoring index is constructed, which can be used to monitor the fault reasonably.

For a new monitoring sample [x.sub.new] [member of] [R.sup.m], the probability of sample data belonging to different modes is P([[theta].sub.k] | [x.sub.new]).

4.3. Comprehensive Monitoring Index. According to PCA decomposition of [??] in TPLS, [T.sub.y] = [??][Q.sub.y], the covariance matrix of principal components in space [X.sub.y] can be expressed as [13]

[mathematical expression not reproducible]. (23)

Available by [??] = T[Q.sup.T] and T = XR,

[mathematical expression not reproducible]. (24)

In the same way, the covariance matrices of principal components in spaces [X.sub.o] and [X.sub.r] can be done as in the above proof:

[mathematical expression not reproducible]. (25)

In order to realize the multimode fault monitoring, the monitoring index based on the MTPLS model is obtained by using the probability information and Bayesian inference:

[mathematical expression not reproducible]. (26)

Similarly,

[mathematical expression not reproducible]. (27)

The threshold can be inferred by the setting in standard TPLS.

In summary, we make use of covariance information mainly to calculate and then to achieve process monitoring in MTPLS. Compared with standard TPLS, the covariance model is more suitable for monitoring multimode processes and making full use of data information in the process of model construction and fault monitoring. Avoiding direct classification on data, the covariance model reduces the effect of classification on the final performance monitoring of the process.

4.4. Quality-Related Combined Index. In TPLS based process monitoring, space [X.sub.y] represents the change part related to quality variable, while space [E.sub.r] represents the uncertain parts related to quality variable. They reflect two different kinds of quality-related faults. Therefore, it is necessary to observe two subspaces at the same time. In practice, a unified monitoring index is more popular than the two separate ones. In PCA based fault detection, Yue and Qin proposed a combined index [14]. Li et al. proposed a combined one for TPLS based process monitoring [12]. Similarly, a combined index which incorporates [T.sup.2.sub.y] and [Q.sub.r] is proposed in a way as follows:

[[phi].sub.y] = [T.sup.2.sub.y]/[[delta].sub.y] + [Q.sub.r]/[[delta].sub.rr] = [x.sup.T][PHI]x, (28)

where

[mathematical expression not reproducible]. (29)

[[zeta].sup.2] is the threshold of this combined index which can be obtained by approximate distribution [[zeta].sup.2] = g[[chi square].sub.h,a]. It is supposed that there is no fault in the process when the monitoring result is [[phi].sub.y] < [[zeta].sup.2].

Scale factor g and free degree h are calculated in

[mathematical expression not reproducible], (30)

where S = cov(x) = [[SIGMA].sub.X], which is the covariance matrix of process variable x. Using this combined index, we can simultaneously monitor the anomalies in the two subspaces and thus monitor the faults associated with the quality variables Y.

4.5. Gradient Contribution Rate for Fault Diagnosis. It is necessary to isolate the faulty variables after a fault is detected. As a common fault separation method, the contribution plot assumes that the variables which have greater contribution to the monitoring statistics are very likely to be faulty variables. According to the description framework of complete decomposition of contribution proposed by Alcala and Qin, contribution to the combined index can be described as the following form [15]:

[mathematical expression not reproducible], (31)

where [[gamma].sub.i] represents the ith row of matrix [[PHI].sup.1/2], [[xi].sub.i] represents the ith row of identity matrix, and m represents the number of variables in one sample.

Traditional contribution plot method is used for analyzing a specific sample when the fault is detected, which shows the contribution value of each variable to one monitoring index in bar chart. After that, the variables with greater contribution will be selected as the possible cause of fault. Westerhuis et al. put forward a generalized contribution to statistics form and a method of obtaining the control limits for variable contributions [16]. Choi et al. proposed specific statistical methods to set the upper limit of the variable contribution to the four monitoring statistics [7]. Li et al. proposed a kind of contribution plot based on TPLS, which describes the contribution of all variables to monitoring index [T.sup.2.sub.y] and [Q.sub.r] in a unified way [12].

For the fault diagnosis method based on traditional contribution figure for one single sample after fault occurrence, there are some flaws that cannot well describe fault source and the change of other malfunction variables caused by fault source. In order to combine the idea of analyzing the contribution rate of faulty variables along the time coordinates with the change of the variable itself, reducing the impact of variable magnitude of value on the contribution rate, we refer to the gradient contribution rate to solve the fault variable analysis.

First, we introduce a mathematical symbol [dot encircle] and a scale factor vector v = [[[v.sub.1], ..., [v.sub.m]].sup.T], where [dot encircle] indicates element product. x[dot encircle]v = [[[x.sub.1][v.sub.1], ..., x, [v.sub.m], ..., [x.sub.m][v.sub.m]].sup.T], and x, v, indicates the change of variable [x.sub.i]. As can be segmented, if [v.sub.i] > 1, then [absolute value of ([x.sub.i][v.sub.i])] > [absolute value of ([x.sub.i])]; if [v.sub.i] = 1, then [x.sub.i][v.sub.i] = [x.sub.i]; if [v.sub.i] < 1, then [absolute value of ([x.sub.i][v.sub.i])] < [absolute value of ([x.sub.i])]. So, equation [mathematical expression not reproducible] can be established.

It can be seen from the first-order Taylor series expansion of [phi](v [dot encircle] x) near v = [1.sub.m] that

[mathematical expression not reproducible]. (32)

Based on the above conclusion, the contribution rate may be defined as follows.

For a monitoring sample [mathematical expression not reproducible] indicates the contribution rate of the ith variable to index [phi].

As described above, the contribution rate represents the gradient of each variable to detection index under the same abnormal changes. Variables which are with great contribution will be considered with great influence to index [phi], the same to quality variable.

For a new monitoring sample [x.sub.new], the contribution rate of the ith variable can be calculated as

[mathematical expression not reproducible]. (33)

As a result, the gradient contribution rate based on comprehensive monitoring index [[phi].sub.y] can be expressed as follows:

[mathematical expression not reproducible], (34)

where [x.sub.new,i], represents the value of the ith variable in monitoring sample [x.sub.new].

Due to the diffusion effect of fault, the method of setting absolute control limits using absolute value of variable contribution for fault diagnosis is not with good effect. Therefore, we use relative contribution rate; namely,

[mathematical expression not reproducible], (35)

where relative contribution rate satisfies

[m.summation over (i=1)][C.sub.r] ([x.sub.new], i) = 1. (36)

As described above, in index [[phi].sub.y] based quality-related fault diagnosis, the contribution rate can reflect contribution gradients of variables to the monitoring index. Therefore, those variables which have a larger contribution rate are able to affect combined index and quality variables significantly.

4.6. Framework of Fault Detection and Diagnosis. The schematic diagram of the proposed process monitoring and diagnosis is shown in Figure 1. Detailed procedures for multimode process detection can be summarized below:

(1) Collect a set of historical training data under all possible operating modes and determine the number of modes.

(2) Use EM algorithm to learn the Gaussian mixture model of principal component space and estimate the model parameter set [[THETA].sub.T] based on the iterative steps.

(3) Do multimode division and multimode information acquisition of process data according to [C.sub.k]. Then, for each monitored sample [x.sub.new], compute its posterior probabilities belonging to all Gaussian components through Bayesian inference strategy.

(4) Calculate local monitoring statistics for the monitored sample [x.sub.new] within each Gaussian component and integrate them into the comprehensive index with probabilities.

(5) Integrate the quality-related monitoring statistics into a quality-related combined index [[phi].sub.y].

(6) Specify a confidence level (1 - [alpha]) 100% for determining control threshold [[zeta].sup.2], and generate the monitoring plot for all the monitored samples.

(7) Detect the abnormal operating condition at the monitored samples satisfying [[phi].sub.y] > [[zeta].sup.2] which is helpful for fault diagnosis.

(8) Calculate the relative contribution rate of variables to the combined index [[phi].sub.y] before and after fault occurrence and generate the contribution rate plot for fault diagnosis analysis.

5. Application to HSMP

5.1. Hot Strip Mill Process. HSMP (hot strip mill process) is an extremely complex industrial production process. In the process of production, improving the quality of products can bring about higher economic and social benefits for the factory. Typical HSM machine production line is mainly composed of reheating furnace, roughing mill, transfer table, crop shear, finishing mill group, run-out table cooling, and coiler. Figure 2 shows the whole process flow chart. The reheating furnace can ensure the temperature of the strip reaches 1200 degrees Celsius before roughing mill. A slab of thickness of 100-200 mm is sent to the roughing mill group after cutting off the scales, eventually forming 28-45 mm thick middle slab through several times of rolling. Through the transport of transfer table and in turn with insulation cover, crop shear, and high pressure water descaling, the slab runs into seven stands of finishing mill group. In order to enhance the performance of the final product, the steel plate needs to go through laminar cooling. This paper focuses on fault monitoring in the part of finishing mill process (FMP).

As shown in Figure 2, FMP consists of seven stands. Every stand contains two working rolls and backup rolls, which are driven by their own power drive units. The distance between two working rolls is called roll gap, which can be adjusted by the hydraulic device. A detailed structure diagram of the finishing roll is shown in Figure 3. This means that the strip will go through all the seven stands during the finishing mill process.

In whole FMP, it is noted that the stands are actually not working independently but are coupled with each other by different control schemes. The thickness in the exit of the last stand is the key factor which directly affects the quality of products. Whole finishing mill process is controlled by automatic thickness control system. It can be seen that there is an obvious hysteresis control of the exit thickness. Not until the abnormal value of the exit thickness is detected, caused by some fault of front stands, can the thickness control system be started. Therefore, establishing real-time acquisition of the relationship between the process variables and exit thickness and then monitoring the thickness by real-time measuring process variables become very meaningful [14].

5.2. Fault Detection Simulation Analysis. Production specification can be determined by different thicknesses of the steel strip in HSMP which should meet different industrial demands. We select the steel plate data of two specifications for modeling: one is the thickness of 2.70 mm and the other is 3.95 mm. The sampling interval for the variable is 0.01 s and 4000 samples are used for training modeling.

In the actual finishing mill process, we can collect the data information including roll gap, milling force between working rolls, and bending force in every stand. Generally speaking, the exit thickness has more relationships with roll gap and milling force than with bending force. Using data collected under normal operating conditions, GMM iterative learning is performed in principal component space which is under the guidance of quality variables. With the model division result, the process of multimode parameters calculation of the original data is carried out. Figure 4 shows the clustering distribution of two kinds of normal production. In this part, the clustering numbers are K1 = 3 and K2 = 5 which are fixed according to Yu et al. The K-means algorithm is applied to roughly calculate [[mu].sup.k.sub.i]. Randomly initialize the value of [[sigma].sup.k.sub.i] before GMM iterative learning. Then, we establish the proposed PLS-MTPLS model. Variables concerned in FMP are as follows.

Process Variables

[x.sub.1] ~ [x.sub.7]: average gap of [F.sub.i] stand, i = 1, ..., 7, [micro]m

[x.sub.8] ~ [x.sub.14]: the force between supporting and working roll in [F.sub.i] stand, i = 1, ..., 7, KN

[x.sub.15] ~ [x.sub.20]: the bending force in the working roll of [F.sub.i] stand, i = 1, ..., 7, KN

Quality Variable

y: thickness of strip at the exit of FMP, mm

For different types of faults that may occur in FMP, we select three encountered faults as a detected object in this section which are shown in Table 1.

According to the exit thickness value of the strip steel under three types of fault condition, it is obvious that fault 1 and fault 3 are quality-related, while fault 2 is quality-unrelated. As Figures 6 and 9 show, the method based on PLSMTPLS gives a higher fault detection rate for fault 1 and fault 3. And for the quality-unrelated fault 2, PLS-MTPLS inherits the effect of space division in traditional TPLS method, making the monitoring index [T.sub.y.sup.2] which is directly related to the quality have a relatively low rate of false alarm.

To examine the advantages of our proposed approach, a comparison research has been done using two evaluating indexes: FDR and FAR.

FAR

Number of samples ([[phi].sub.y] > [[zeta].sup.2] | quality is normal)/total fault - free samples, (37)

FDR

Number of samples ([[phi].sub.y] > [[zeta].sup.2] | quality is faulty)/total faulty samples

False detection rates and false alarm rates are counted for three types of fault and statistical results are shown in Table 2. It shows that PLS-MTPLS method performs better.

Fault 1 represents the failure of hydraulic roll gap control structure. Fault occurs at about 20 s, namely, the 2000th monitoring sample. The values of roll gap [x.sub.4] in the fourth stand are directly affected, and then the sampling values of milling force [x.sub.11] in the fourth stand have also been affected.

Because of the influence of feedback control system, roll gaps and milling forces will be changed in the following stands, and then finally the exit thickness is affected. As shown in Figure 5, there is a delay for change of exit thickness value,

namely, quality variables with respect to fault occurrence. But for fault detection results, as shown in Figure 6, there is almost no delay. From the point of view of this analysis, detection results can be a good reference for field staffs, in order to take response measurements timely.

Figure 7 gives the observation of change of relative contribution rate for fault 1. As is shown, we can clearly see that many contribution rate values of related variables have changed since the 2000th monitoring sample. When the fault is detected, according to the observation of relative contribution rate, variable [x.sub.4] which has the largest relative contribution rate is diagnosed firstly. As a result, we can conclude that roll gap of F4 stand is the source of fault. At the same time, variables [x.sub.11] and [x.sub.5] - [x.sub.7] are subsequently diagnosed which are affected by the fault source, thereby causing fault propagation. Figure 7 shows curves of variables change in real data of fault source variable and variables affected. The diagnostic analysis is in accordance with the actual production situation. As a result, the relative contribution rate can not only diagnose fault variables, but also show the order of fault variables transmission. Then, it can help in finding out the real source of fault with causal relationship among these variables.

Fault 2 represents the fault of sampling value of bending force in F5 stand, which is a kind of step transition. When the fault occurs, the value of variable [x.sub.18] will increase greatly. Then, with feedback regulation of automatic control system, the bending force value in F6 and F7 stands will be changed correspondingly. But this kind of fault will only cause the change of strip plate, not thickness as shown in Figure 8.

Fault 3 is a kind of fault in cooling valve between F2 and F3 stands, which is usual in the process of finishing mill. It will make the rolling force and roll gap in stands following F3 stand be changed. Based on the monitoring results of comprehensive index, fault 3 can be detected timely as shown in Figure 9. The change of relative contribution rate is shown in Figure 10. It can be noted that variable [x.sub.3] is affected firstly, followed by variable [x.sub.10] and others. From the above analysis, we can draw a conclusion that the fault diagnosis method based on the relative contribution rate can be applied to FMP effectively.

In this section, we focus on the research of exit thickness of the strip. Twenty variables among measured variables in FMP were selected for building PLS-MTPLS model. Based on it, a kind of comprehensive monitoring index and a kind of relative contribution rate were established for fault monitoring and diagnosis, respectively, for three common faults. Results of monitoring and diagnosis verified that PLS-MTPLS has higher FDR and lower FAR than traditional multivariate statistics methods shown in Table 2. In addition, compared with MTPLS which clusters with original data directly, this method is with better monitoring effects in statistics [T.sub.y.sup.2] of principal component which can be seen in Table 2.

6. Conclusion

In this paper, a new PLS-MTPLS method is proposed on the basis of covariance descriptions of PCA and PLS algorithm for multimode process monitoring. After mode division of quality-related principal components, multimode information is embedded into the monitoring model by integrating GMM with TPLS, which avoids the direct use of process training data for modeling. Based on the quality-related multimode monitoring model PLS-MTPLS, a kind of comprehensive monitoring index is applied to execute real-time online monitoring. Then, a combined index is constructed for improving monitoring efficiency and extended to fault diagnosis by relative gradient contribution rate calculation.

The efficiency and superiority of PLS-MTPLS are demonstrated through application to the monitoring of HSMP. As can be seen from the comparison and analysis, the proposed approach can reduce computational complexity and be more suitable for multimode processes.

https://doi.org/ 10.1155/2017/9560206

Competing Interests

The authors declare that there are no competing interests regarding the publication of this paper.

Acknowledgments

This workis supported by the National Natural Science Foundation of China (NSFC) under Grants 61473033 and 61673032 and by Beijing Natural Science Foundation (4142035), China.

References

[1] L. H. Chiang, R. D. Braatz, and E. L. Russell, Fault Detection and Diagnosis in Industrial Systems, Springer, London, UK, 2001.

[2] D.-H. Hwang and C. Han, "Real-time monitoring for a process with multiple operating modes," Control Engineering Practice, vol. 7, no. 7, pp. 891-902, 1999.

[3] S. Lane, E. B. Martin, R. Kooijmans, and A. J. Morris, "Performance monitoring of a multi-product semi-batch process," Journal of Process Control, vol. 11, no. 1, pp. 1-11, 2001.

[4] J. Chen and J. Liu, "Mixture principal component analysis models for process monitoring," Industrial and Engineering Chemistry Research, vol. 38, no. 4, pp. 1478-1488, 1999.

[5] S. J. Zhao, J. Zhang, and Y. M. Xu, "Monitoring of processes with multiple operating modes through multiple principle component analysis models," Industrial and Engineering Chemistry Research, vol. 43, no. 22, pp. 7025-7035, 2004.

[6] X.-T. Doan and R. Srinivasan, "Online monitoring of multiphase batch processes using phase-based multivariate statistical process control," Computers and Chemical Engineering, vol. 32, no. 1-2, pp. 230-243, 2008.

[7] S. W. Choi, J. H. Park, and I.-B. Lee, "Process monitoring using a Gaussian mixture model via principal component analysis and discriminant analysis," Computers and Chemical Engineering, vol. 28, no. 8, pp. 1377-1387, 2004.

[8] C. K. Yoo, K. Villez, I.-B. Lee, C. Rosen, and P. A. Vanrolleghem, "Multi-model statistical process monitoring and diagnosis of a sequencing batch reactor," Biotechnology and Bioengineering, vol. 96, no. 4, pp. 687-701, 2007.

[9] J. Yu and S. J. Qin, "Multiway gaussian mixture model based multiphase batch process monitoring," Industrial and Engineering Chemistry Research, vol. 48, no. 18, pp. 8585-8594, 2009.

[10] R. A. Johnson and D. W. Wichern, Applied Multivariate Statistical Analysis, Prentice-Hall, New York, NY, USA, 1998.

[11] K. Peng, K. Zhang, B. You, and J. Dong, "Quality-related prediction and monitoring of multi-mode processes using multiple PLS with application to an industrial hot strip mill," Neurocomputing, vol. 168, pp. 1094-1103, 2015.

[12] G. Li, S.-Z. Qin, Y.-D. Ji, and D.-H. Zhou, "Total PLS based contribution plots for fault diagnosis," Acta Automatica Sinica, vol. 35, no. 6, pp. 759-765, 2009.

[13] K. Zhang, H. Hao, Z. Chen, S. X. Ding, and K. X. Peng, "A comparison and evaluation of key performance indicator-based multivariate statistics process monitoring approaches," Journal of Process Control, vol. 33, pp. 112-126, 2015.

[14] H. H. Yue and S. J. Qin, "Reconstruction-based fault identification using a combined index," Industrial and Engineering Chemistry Research, vol. 40, no. 20, pp. 4403-4414, 2001.

[15] C. F. Alcala and S. J. Qin, "Reconstruction-based contribution for process monitoring," Automatica, vol. 45, no. 7, pp. 1593-1600, 2009.

[16] J. A. Westerhuis, S. P. Gurden, and A. K. Smilde, "Generalized contribution plots in multivariate statistical process monitoring," Chemometrics and Intelligent Laboratory Systems, vol. 51, no. 1, pp. 95-114, 2000.

Kaixiang Peng, Bingzheng Wang, and Jie Dong

Key Laboratory for Advanced Control of Iron and Steel Process, Ministry of Education, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing 100083, China

Correspondence should be addressed to Jie Dong; dongjie@ies.ustb.edu.cn

Received 22 December 2016; Accepted 13 February 2017; Published 12 March 2017

Academic Editor: Zhijie Zhou

Caption: FIGURE 1: Schematic diagram of the proposed PLS-MTPLS and Bayesian-based process monitoring and diagnosis method.

Caption: FIGURE 2: The schematic of HSMP.

Caption: FIGURE 3: The structure of mill stand.

Caption: FIGURE 4: Principal components clustering distribution where = 3 and K2 = 5 (pc: principal component).

Caption: FIGURE 5: Curve of fault source [x.sub.4] and affected variables.

Caption: FIGURE 6: Detection result of fault 1.

Caption: FIGURE 7: Diagnosis result of fault 1.

Caption: FIGURE 8: Detection result of fault 2.

Caption: FIGURE 9: Detection result of fault 3.

Caption: FIGURE 10: Diagnosis result of fault 3.

TABLE 1: Faults that occurred in FMP. Fault number Fault description Duration Type 1 Malfunction of gap 20 s-30 s Quality-related control loop in F4 stand 2 Fault of roll 10 s-20 s Quality-unrelated bending force measuring sensor in F5 stand 3 10% stiction of the 10 s-20 s Quality-related cooling valve between F2 and F3 stands TABLE 2: Detection performance comparison. Fault number Type MPLS TPLS ([T.sup.2]) ([[phi].sub.y]) 1 FDR 0.7968 0.8977 2 FAR 0.3357 0.4335 3 FDR 0.7780 0.8510 Fault number MTPLS MTPLS ([T.sub.y.sup.2]) ([[phi].sub.y]) 1 0.8620 0.9995 2 0.1335 0.2610 3 0.8848 0.9729 Fault number PLS-MTPLS PLS-MTPLS ([T.sub.y.sup.2]) ([[phi].sub.y]) 1 0.9435 0.9995 2 0.0080 0.2605 3 0.9092 0.9735

Printer friendly Cite/link Email Feedback | |

Title Annotation: | Research Article |
---|---|

Author: | Peng, Kaixiang; Wang, Bingzheng; Dong, Jie |

Publication: | Journal of Control Science and Engineering |

Date: | Jan 1, 2017 |

Words: | 7124 |

Previous Article: | Thermal Model Parameter Identification of a Lithium Battery. |

Next Article: | Synchronization of Two Fractional-Order Chaotic Systems via Nonsingular Terminal Fuzzy Sliding Mode Control. |