# Power Cell SOC Modelling for Intelligent Virtual Sensor Implementation.

1. IntroductionAn electrical energy storage system is the most frequently used technology to solve the problem of intermittent energy production in renewable energy systems [1]. In renewable energy systems, the electric energy storage is used as a method of storing the energy when production is higher than demand and returning the energy when demand is higher than production. Also, these systems are used to supply portable devices or replace the fossil fuels in vehicles, such as in electric vehicles [1].

Nowadays, an increasing number of studies aim to improve electrical energy storage systems. Some research is focused on dealing with intermittent energy generation in renewable energy such as solar and wind power systems, proposing to implement those methods in Smart Grid systems [2]. Furthermore, the increasing use of portable devices like tablets or smartphones, which improve quality of life, demands higher autonomy, less charging time, or less weight at the energy storage components [3].

On the other hand, the electric vehicle forces the development and improvement of storage systems. Currently, a noticeable problem of the electric vehicle is the driving range, which is related to the storage system. Although the internal combustion engine is inefficient, it does at least provide long driving range due to the high energy density of fossil fuels [4].

This paper is focused on battery storage systems and especially on a Lithium-Ion (Li-Ion) power cell, which is commonly utilized in electric vehicles. The frequent use of this type of cells makes it possible to identify different characteristics, such as ageing forecasts, charging, and discharging behavior. This information is very important to model the power cells properly [5].

Li-Ion power cells are good candidates for electromobility and the renewable energy industry. The reasons why this battery is suitable for electric drive vehicles are its high power density, high voltage of the cell, low self-discharge, no memory effect, and long-life cycle [6]. Also, the Li-Ion batteries can be found in different chemistries and each one has its specific characteristics, such as energy cells or power cells, fitting the different requirements of the vehicle industry [7].

Due to the extensive use of these batteries in portable devices as well as in electric vehicles and the need for safety, the failure analysis and modelling of these batteries are very important [8].

Multiple Regression Analysis (MRA) methods are based on the traditional regression models [9]. These methods are used in several applications in different subjects due to their versatility [10-12]. References [10, 13] show a cost forecast in the early stage of projects, while [11, 14] evaluate the suppliers performance. The MRA methods are limited in some cases, like in [9] and in [15]. In these articles, MRA techniques were used, but due to the nonlinearity of the systems, a different approach was used. However, regression techniques based on soft computing can solve the problems of nonlinear systems, as was done in [16, 17], where meta-classifiers were applied in a predictive control system with the aim of predicting the state of the model. In [18, 19], artificial neural networks were combined with multiregression in order to optimize overbreak prediction. As shown in [20], failures in wind turbines can be detected and predicted using intelligent model techniques.

Although some regression problems are addressed with the new methods mentioned above, in several cases it is difficult to obtain accurate models, for various reasons. For instance, a model based on a highly nonlinear system can lead to a bad performance. Dividing the initial dataset into different groups before carrying out the regression could be a complementary solution [21, 22]. With this objective in mind, the clustering algorithm K-means is commonly used [23-25]. This method consists of splitting up the entire dataset into a number of clusters. Each piece of data will belong to a particular cluster, depending on its characteristics. After this step, the regression technique is made over each cluster. Prior researches like [26-29] solve other physical problems by using similar techniques.

This study develops an intelligent model that is able to predict the value of different system variables during a power cell test. Several clustering techniques were considered for grouping the data with similar behavior to implement the model. For each cluster, the three aforementioned regression methods were tested. The technique selection follows the minimum mean squared error criteria.

The present document is organized as follows. First, an introduction is described. Then, the case of study explains the test carried out and the process followed for obtaining the dataset. After the case study, the tested algorithms and the model approach considered in the study are presented. The obtained hybrid model and its configuration are shown in the Results. Finally, in the last section, future works and conclusions are presented.

2. Case Study

This research presents the SOC model of a LFP power cell used in electric vehicle battery systems. This power cell was tested as shown in Figure 1, obtaining different measurements.

The test carried out and power cell are explained in the subsections in more detail.

2.1. The Battery. One possible method of storing electric energy is by means of batteries. These devices store energy within an electrochemical medium, capable of storing and then supplying electrical energy due to electrochemical reactions [1]. Batteries are composed of two electrodes and an electrolyte, where a redox reaction happens. During the discharging process, the cathode is reduced while the anode is oxidized, producing a current flow [1]. During the charging process, the reaction occurs in reverse order. The number of times that the cycle can be repeated is determined by the use of the battery, up to the point at which the capacity decreases to unusable levels [30].

The most common batteries used in portable devices are the Lithium-Ion batteries. They are widely used in electronic devices and, more recently, in electric vehicles [31]. The Li-Ion cells are characterized by their high energy density, due to the high voltage they can produce in comparison with other battery types, while weighing less [1]. They also show a long-life cycle, low self-discharge, and no memory effect [30, 31].

2.2. Capacity Confirmation Test. The capacity confirmation test, which was performed on the battery, measures the capacity of the battery in ampere-hour [32]. The measurement of the capacity was carried out at constant current. The test starts with the battery at full charge, so the SOC is at its maximum. Then, the battery starts to supply constant current until it reaches the discharge voltage, which is determined by the manufacturer [32]. At this point, the battery rests till the voltage is recovered, so the charging process can begin. Once the voltage has recovered to a certain value, the battery is again charged to its maximum voltage, also specified by the manufacturer [32]. These processes are performed with constant current and the present SOC is calculated at each instant.

A battery tester was used for the capacity confirmation test. This device is able to charge and discharge the cell within the test parameters, in this case using a constant current and making the corresponding measurements. The measurements made are the battery voltage, the current delivered or absorbed by the battery, the temperature at two different points, and how long the test lasts.

Figure 1 shows the test scheme. This figure also shows the different measurement devices used and where they are placed: ammeter (A), voltmeter (V), and temperature sensors (T1 and T2). Given the fact that the temperature of the Li-Ion batteries is uniform, the sensors location does not affect theoretically the temperature measured. Two redundant temperature sensors are used to detect wrong measurements.

The testing device, which supplies or absorbs the current, is represented as a current source i(t).

During this research, the power cell used was a LiFeBATT X-1P [33], which is a LiFePO4 cell. It is characterized by a nominal capacity of 8 Ah and nominal voltage of 3.3 V.

Four different steps can be distinguished in the test (Figure 2):

(1) Charge: the initial voltage is 3 V and it is increased up to 3.65 V due to the constant current flowing from the tester to the battery.

(2) Rest after the charging process: there is no current flowing. During this step, the battery voltage reaches 3.3 V, which is the nominal value.

(3) Discharge: the 3.3 V achieved in the previous step is decreased to 2 V due to the constant current flowing from the battery to the tester.

(4) Rest after the discharging: there is no current flowing. The voltage of the battery goes from 2 V to 3 V. A new cycle is ready to be started again.

Figure 2 shows the voltage and current variation measured in one cycle. During the charging, the current is flowing from the tester to the battery, so it is considered positive. On the other hand, when the current flows from the battery to the tester, it is considered negative.

The energy provided or absorbed during the process can be obtained by integrating the current over time, obtaining the energy in ampere-hour. The representation of energy, at the top of Figure 3, shows how energy in the battery increases during the charging process and decreases during the discharging process. The representation of energy is shown as the SOC of the battery, which ranges from 100% with the battery fully charged, up to 0% with the battery completely discharged.

The two temperature sensors measure the temperature while the test is running. The measurement is made at two different locations. Their values also vary cyclically according to the state of the battery, depending on whether it is charging, discharging, or resting and also depending on the battery voltage.

The bottom of Figure 3 shows how the temperatures evolve in each operating region.

The full dataset used in this research has been obtained by testing a power cell, as explained before. The battery state is analyzed using the different parameters measured. Current and time are used to calculate the SOC of the battery at every moment. Furthermore, the temperatures were measured in order to detect a battery failure during the test. If values are far outside a typical range, a malfunction can be detected. All the data measured throughout the capacity battery test was labelled with the aim of determining the SOC.

3. Model Approach

The model approach is defined by the scheme shown in Figure 4. Taking into consideration the magnitudes measured during the capacity battery test and how the system behaves, the dataset can be split into several operation ranges. Thus, different clusters are obtained and regression techniques are applied for each one. A global model scheme is presented in Figure 4. The two inputs of this model are the voltage and current, and its output is the SOC. The cluster selector links the selected model with the output. Each cluster represents the best model obtained. The criterion used to assign input data to a specific cluster is the minimum Euclidean distance between the cluster centroids and the input data.

The model is obtained as per the diagram of Figure 5. The figure shows only the data division for testing and training. However, a cross-validation technique (k-fold) was implemented to process the dataset with the aim of achieving better results. The validation of the k-fold model gives a more real performance and a better generalization [34].

3.1. Techniques Used. In this research, different techniques were used in order to accomplish the best model. These techniques are described below.

3.1.1. Clustering

Principal Component Analysis. PCA is created by Pearson [35] and Hotelling [36]. This statistical technique describes the variation of multivariate data in terms of variables that are not correlated. Uncorrelated variables can be expressed as a linear combination of the initial variables. The aim of this technique is to derive new uncorrelated variables as a linear combination of the initial variables. This method may help to reduce the number of variables that represent the data.

Data Clustering: The K-Means Algorithm. Clustering is technique of unsupervised data grouping used to measure similarity [37, 38]. The main aim of the data clustering technique is to organize unlabelled vectors into different groups. Thus, all similar data will belong to the same cluster [39]. The K-means algorithm is frequently used to divide datasets into clusters. This algorithm organizes the samples by minimizing the function shown in (1), which represents the squared error function.

[mathematical expression not reproducible] (1)

Each cluster is set around a centroid, so the final clustering will depend on the amount of clusters and on the initial position of their centroids. The selection of the number of clusters is very important and requires an initial study of the data composition. This value is extremely hard to estimate. The partitional algorithm K-means performs well when the data is near its cluster and if each cluster is hyperspherical in shape and sufficiently separated in hyperspace.

Neural PCA and Maximum Likelihood Hebbian Learning (MLHL). As mentioned above, PCA is a statistical compressing data method. The best linear compression of the data can be achieved in terms of LMSE. Some techniques of artificial neural networks (ANN) can be used to carry out PCA, for example, [40, 41].

A PCA with a negative feedback network [40] can be used, according to the following definition. Given N x 1 input matrix x and M x 1 output matrix y, a weight matrix [W.sub.ij] links each output i with the input j, [eta] being a learning rate. At the beginning of the training process, the network does not have activation. The weights from the input layer neurons to the output layer neurons feed back the input data. The output neuron activation value is implemented using a linear sum. It can be expressed as shown in

[y.sub.i] = [N.summation over (j=1)][W.sub.ij] [x.sub.j], [[for all].sub.i]. (2)

A feedback of the activation function is performed with the same weights. Then it is subtracted from the inputs and the inhibition is performed:

[e.sub.j] = [x.sub.j] - [N.summation over (i=1)][W.sub.ij] [y.sub.i] [[for all].sub.j]. (3)

Then, simple Hebbian learning is carried out between the outputs and the inputs, as shown in

[DELTA][W.sub.ij] = [eta][e.sub.j][y.sub.i]. (4)

The network learning stability is a consequence of the negative feedback. In a similar way to Oja's Subspace algorithm [41], this network can find the main components of the input data [40]. Therefore, the weights will find a subspace basis spanned by the current principal components whereas they will not find the theses components. MLHL [42] is described following the learning rules shown in the following equations: a feedforward step (2), a feedback step (3), and finally, a weight modification, which is presented in

[DELTA][W.sub.ij] = [eta][y.sub.i] sign ([e.sub.j])[[absolute value of ([e.sub.j])].sup.p-1]. (5)

MLHL [42] is connected to the statistical standard method Exploratory Projection Pursuit (EPP) method [43].

3.1.2. Regression Techniques

Polynomial Regression. A model obtained with polynomial regression [34] can be described as a linear combination of basis functions. The number of inputs and the polynomial degree will affect the number of basis functions.

With a first-degree polynomial, the linear summation performed can be described as shown in (6) (for two inputs). If the polynomial degree increases, the model obtained will become more complex. A second-degree polynomial is shown in (7).

f(x) = [b.sub.0] +[b.sub.i][y.sub.i] +[b.sub.2][y.sub.2] (6)

f(x) = [b.sub.0] + [b.sub.1][y.sub.1] + [b.sub.2] +[y.sub.2] + [b.sub.3][y.sub.1][y.sub.2] + [b.sub.4] [y.sup.2.sub.1] + [b.sub.5][y.sup.2.sub.2]. (7)

Multilayer Perceptron (MLP). The Multilayer Perceptron can be defined as a feedforward artificial neural network (ANN) [44,45]. Its robustness and relative simplicity make this ANN the one most frequently used. Despite these advantages, the good performance of this technique depends heavily on the ANN architecture selected. The MLP architecture has three different layers: the input, the output, and one or more hidden ones.

A layer has a selected quantity of neurons, with a specific activation function. Commonly, the same activation function is used in all neurons of the same layer. Activation functions can be of three different types: linear, step, tan-sigmoid, or log-sigmoid. In MLP, the neurons have weighted links between layers.

The MPL output can be defined as in the following equation [46].

[f.sub.[??]](x) = [xi] + [j.summation over (i=1)], [a.sub.1][phi]([v.sup.T.sub.i] y + [b.sub.i]), (8)

where y = [(y(1), ..., y(d)).sup.T] [member of] [R.sup.d] is the inputs vector; j is the hidden layers quantity; [phi] is the transfer function bounded; [??] = ([xi], [a.sub.1], ..., [a.sub.k], [b.sub.1], ..., [b.sub.k], [v.sub.11], ..., [v.sub.kd]) is the model parameter vector; [v.sub.i] = [([v.sub.i1], ..., [v.sub.id]).sup.T] [member of] [R.sup.d] is the hidden unit i parameter vector.

Least Square Support Vector Regression (LS-SVR). Support vector regression (SVR) is an adaptation of the support vector machines (SVM) classification algorithm. The principal aim of SVR is to plot the data in a high-dimensional space F using nonlinear mapping. Linear regression is then applied in the space F [47, 48].

The Least Square formulation of SVM, known as LS-SVM, is an approximation of the solution of a system. Its behavior is very similar to SVM in terms of general performance [49, 50]. The LS-SVR is the use of LS-SVM for regression [51, 52]. In LS-SVR, a typical squared loss function replaces an insensitive loss function. This squared loss function set up the Lagrangian matrix by solving a linear Karush-Kuhn-Tucker equation:

[mathematical expression not reproducible], (9)

where [i.sub.n] is the n ones vector; T is the matrix or vector means transpose; [gamma] is the weight vector; a is the regression vector; [a.sub.0] is the model offset.

To implement this technique two parameters are required: [gamma] and [sigma], where a is the kernel width [53].

3.2. Preprocessing the Dataset. The dataset was obtained during the described capacity confirmation test performed on the power cell. The State of Charge was determined by the current and the time for each test. Voltage and current were monitored to determine the SOC. As explained in previous sections, temperatures were measured at two points of the battery in order to find malfunctions.

The SOC parameter of the battery should go sequentially from 0% to 100% and then from 100% to 0%. All incomplete cycles which did not fulfill this condition were discarded. After the data filtering, only completed cycles were taken into consideration for the implementation of the model. The measured data was registered with a sample of 1 Hertz. After this discard, the dataset for implementing the model was reduced from 18131 to 16370 samples.

The cycle consists of four different states: charge (1), rest (2), discharge (3), and rest (4). During the test, all data measured was labelled with the current state of the power cell.

Before developing the regression models, clustering algorithms were applied to the dataset. For this purpose, an analysis with MHLH and PCA was performed. The analysis showed that four different groups of samples could be appreciated, as Figures 6 and 7 show. Following the criteria suggested by PCA and MHLH, K-means algorithm was applied to create the four clusters. Each group represents the four different states of the battery during the test.

The number of samples assigned to each cluster is shown in Table 1. It is remarkable that the cluster distribution corresponds to the four steps of the capacity confirmation test mentioned in Section 2.2. Polynomial regression, MLP-ANN, and LS-SVR were the three techniques trained for the clusters.

4. Results

The real state of the cell labelled during the test was compared with the data distribution obtained as a result of the clustering. The dataset has the requisite characteristics to be applied by K-means and the clusters accomplished had 100% correct assignation. Due to the good performance achieved, the model approach is not required to know the state of the cycle.

The MLP-ANN algorithm was implemented with different configurations; the number of hidden layer neurons changed from 2 to 15, while only one hidden layer was used. All hidden layer neurons shared the same activation function (tan-sigmoid). A linear activation function was used as an output layer neuron. The Levenberg-Marquardt algorithm was applied on the training process. Learning algorithm was based on the gradient decrease, and MSE was set as the criterion.

The LS-SVR regression algorithm was trained with the autotuning implemented in the Matlab toolbox developed by KU Leuven-ESAT-SCD. The chosen model kernel was Radial Basis Function (RBF) and "Function Estimation" the kind selected to make the regression. The cost-criterion is "leaveoneoutlssvm," with an MSE performance function and an optimization function type "simplex."

The polynomial regression technique was applied with varying polynomial order from 1 to 10.

The Mean Square Error (MSE) value is used to compare the accuracy of the models. The lowest MSEs obtained for each technique are shown in Table 2.

In Table 3, the best regression results are shown, as well as the technique applied to achieve them.

The technique selection takes into consideration the computational cost when the MSEs achieved for two or more techniques are similar.

4.1. Statistical Analysis. Here, the results of the statistical analysis are shown with the aim of determining the accuracy of the selected models compared with that of those nonselected.

For each cluster, the best model selected is compared statistically with the others, in terms of MSE, to determine if there is a difference between their MSEs.

Figure 8 shows the boxplot for each cluster using ANN-MLP, polynomial regression, and LS-SVR, while the results of the statistical analysis using a t-test analysis with a 5% [alpha]-level are shown in Table 4.

Based on the results of Table 4, the following may be concluded:

(i) Cluster 1: there is no statistical difference between ANN-MLP, 10 neurons, and LS-SVR.

(ii) Cluster 2: there is no statistical difference between any of the three models.

(iii) Cluster 3: there is no statistical difference between ANN-MLP, 10 neurons, and LS-SVR.

(iv) Cluster 4: there is no statistical difference between any of the three models.

In any of the previous cases, the difference between the MSEs, in the best case, is lower than 0.0052 so it can be considered that the model selected does not significantly affect the results.

5. Conclusions and Future Works

The approach used in this study presents remarkable results. The MSE average is 0.1815, varying from 0.0865 to 0.4767, depending on the variable and the cycle state.

The State of Charge value for the confirmation of the battery capacity can be predicted in real time. The model obtained is capable of checking that the power cell test is functioning correctly when it offers wrong results.

The overall efficiency of this novel approach is increased if a hybrid model is implemented. This way, each local model was obtained by training one group of the initial dataset. Better results were achieved by employing MLPs and LS-SVR. However, the difference between the two, in terms of MSE, is not statistically great, so both models may be used in any of the clusters.

The temperature measured in the cell is highly important parameter relating to health of the device. Using the temperatures measured in the tests, a study and a modelling of these parameters will be undertaken to detect deviations in behavior. This new model would be used with the one-class classifier algorithm in order to detect any faults in this type of systems.

https://doi.org/10.1155/2017/9640546

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this paper.

References

[1] C. Chukwuka and K. A. Folly, "Batteries and super-capacitors," in Proceedings of the IEEE Power and Energy Society Conference and Exposition in Africa: Intelligent Grid Integration of Renewable Energy Resources, PowerAfrica 2012, pp. 1-6, July 2012.

[2] H. Qian, J. Zhang, and J.-S. Lai, "A grid-tie battery energy storage system," in Proceedings of the 2010 IEEE 12th Workshop on Control and Modeling for Power Electronics, COMPEL 2010, pp. 1-5, June 2010.

[3] N. Chaturvedi, R. Klein, J. Christensen, J. Ahmed, and A. Kojic, "Modeling, estimation, and control challenges for lithium-ion batteries," in Proceedings of the American Control Conference (ACC), pp. 1997-2002, 2010.

[4] S. Vukosavic, Electrical Machines, Power Electronics and Power Systems, Springer, 2012.

[5] O. Konig, S. Jakubek, and G. Prochart, "Battery impedance emulation for hybrid and electric powertrain testing," in Proceedings of the 2012 IEEE Vehicle Power and Propulsion Conference, VPPC 2012, pp. 627-632, October 2012.

[6] N. H. F. Ismail and S. F. Toha, "State of charge estimation of a Lithium-ion battery for electric vehicle based on particle swarm optimization," in Proceedings of the 2013 IEEE International Conference on Smart Instrumentation, Measurement and Applications, ICSIMA 2013, pp. 1-4, November 2013.

[7] M. Swierczynski, D.-I. Stroe, A.-I. Stan, R. Teodorescu, and S. K. Kaer, "Lifetime estimation of the nanophosphate lifepo4 battery chemistry used in fully electric vehicles," IEEE Transactions on Industry Applications, vol. 51, no. 4, pp. 3453-3461, 2015.

[8] J. C. Alvarez Anten, P. J. Garcia Nieto, F. J. de Cos Juez, F. S. Lasheras, C. B. Viejo, and N. Roqueni Gutierrez, "Battery state-of-charge estimator using the MARS technique," IEEE Transactions on Power Electronics, vol. 28, no. 8, pp. 3798-3805, 2013.

[9] C. C. Turrado, M. D. C. M. Lepez, F. S. Lasheras, B. A. R. Gemez, J. L. C. Rolle, and F. J. D. C. Juez, "Missing data imputation of solar radiation data under different atmospheric conditions," Sensors, vol. 14, no. 11, pp. 20382-20399, 2014.

[10] R. Jin, K. Cho, C. Hyun, and M. Son, "MRA-based revised CBR model for cost prediction in the early stage of construction projects," Expert Systems with Applications, vol. 39, no. 5, pp. 5214-5222, 2012.

[11] L.-H. Ho, S.-Y. Feng, Y.-C. Lee, and T.-M. Yen, "Using modified IPA to evaluate supplier's performance: multiple regression analysis and DEMATEL approach," Expert Systems with Applications, vol. 39, no. 8, pp. 7102-7109, 2012.

[12] J. Osborn, D. Guzman, F. J. D. C. Juez et al., "Open-loop tomography with artificial neural networks on CANARY on-sky results," Monthly Notices of the Royal Astronomical Society, vol. 441, no. 3, pp. 2508-2514, 2014.

[13] P. J. Garcia Nieto, J. R. Alonso Fernandez, F. Sanchez Lasheras, F. J. de Cos Juez, and C. Diaz Mumz, "A new improved study of cyanotoxins presence from experimental cyanobacteria concentrations in the Trasona reservoir (Northern Spain) using the MARS technique," Science of the Total Environment, vol. 430, pp. 88-92, 2012.

[14] G. Florez, P. A. Saiz, P. Garcia-Portilla et al., "Predictors of posttreatment drinking outcomes in patients with alcohol dependence," European Addiction Research, vol. 21, no. 1, pp. 19-30, 2015.

[15] Y. Cho and H. B. Awbi, "A study of the effect of heat source location in a ventilated room using multiple regression analysis," Building and Environment, vol. 42, no. 5, pp. 2072-2082, 2007.

[16] J. Nieves, I. Santos, and P. G. Bringas, "Enhancing the prediction stage of a model predictive control systems through meta-classifiers," Dyna, vol. 88, no. 3, pp. 290-298, 2013.

[17] A. S. Sonchez, P. R. Fernondez, F. S. Lasheras, F. J. de Cos Juez, and P. J. Nieto, "Prediction of work-related accidents according to working conditions using support vector machines," Applied Mathematics and Computation, vol. 218, no. 7, pp. 3539-3552, 2011.

[18] H. Jang and E. Topal, "Optimizing overbreak prediction based on geological parameters comparing multiple regression analysis and artificial neural network," Tunnelling and Underground Space Technology, vol. 38, pp. 161-169, 2013.

[19] F. Sonchez Lasheras, F. J. de Cos Juez, A. Suorez Sonchez, A. Krzemien, and P. Riesgo Fernandez, "Forecasting the COMEX copper spot price by means of neural networks and ARIMA models," Resources Policy, vol. 45, pp. 37-43, 2015.

[20] A. Alvarez-Huerta, R. Gonzalez-Miguelez, D. Garcia-Metola, and A. Noriega-Gonzalez, "Failure detection and prediction in wind turbines by using scada data," Dyna, vol. 86, no. 4, pp. 467-473, 2011.

[21] D. Martinez-Rego, O. Fontenla-Romero, and A. Alonso-Betanzos, "Efficiency of local models ensembles for time series prediction," Expert Systems with Applications, vol. 38, no. 6, pp. 6884-6894, 2011.

[22] A. Suorez Sonchez, F. J. Iglesias-Rodriguez, P. Riesgo Fernandez, and F. J. de Cos Juez, "Applying the K-nearest neighbor technique to the classification of workers according to their risk of suffering musculoskeletal disorders," International Journal of Industrial Ergonomics, vol. 52, pp. 92-99, 2014.

[23] A. Cherif, H. Cardot, and R. Bono, "SOM time series clustering and prediction with recurrent neural networks," Neurocomputing, vol. 74, no. 11, pp. 1936-1944, 2011.

[24] M. H. Ghaseminezhad and A. Karami, "A novel self-organizing map (SOM) neural network for discrete groups of data clustering," Applied Soft Computing Journal, vol. 11, no. 4, pp. 3771-3778, 2011.

[25] J.-L. Casteleiro-Roca, J. L. Calvo-Rolle, J. A. M. Perez, N. R. Gutiorrez, and F. J. de Cos Juez, "Hybrid intelligent system to perform fault detection on BIS sensor during surgeries," Sensors, vol. 17, no. 1, article 179, 2017.

[26] M. J. Crespo-Ramos, I. Machon-Gonzolez, H. Lopez-Garcia, and J. L. Calvo-Rolle, "Detection of locally relevant variables using SOM-NG algorithm," Engineering Applications of Artificial Intelligence, vol. 26, no. 8, pp. 1992-2000, 2013.

[27] J. L. Calvo-Rolle, J. L. Casteleiro-Roca, H. Quintion, and M. Del Carmen Meizoso-Lopez, "A hybrid intelligent system for PID controller using in a steel rolling process," Expert Systems with Applications, vol. 40, no. 13, pp. 5188-5196, 2013.

[28] I. Machon-Gonzolez, H. Lopez-Garcia, and J. L. Calvo-Rolle, "Neuro-robust controller for non-linear systems," Dyna, vol. 86, no. 3, pp. 308-317, 2011.

[29] J. R. Alonso Fernandez, C. Diaz Muniz, P. J. Garcia Nieto, F. J. de Cos Juez, F. Sonchez Lasheras, and M. N. Roquem, "Forecasting the cyanotoxins presence in fresh waters: a new model based on genetic algorithms combined with the MARS technique," Ecological Engineering, vol. 53, pp. 68-78, 2013.

[30] Wikipedia, Lithium-ion battery-wikipedia, the free encyclopedia, [Online; accessed 30-January-2014] (2014). http://en.wiki-pedia.org/w/index.php?tifie=Uthium-ion_batteryoldid=592970854.

[31] A. R. Sparacino, G. F. Reed, R. J. Kerestes, B. M. Grainger, and Z. T. Smith, "Survey of battery energy storage systems and modeling techniques," in Proceedings of the 2012 IEEE Power and Energy Society General Meeting, PES 2012, pp. 1-8, July 2012.

[32] PNGV Battery Test Manual (2 2001).

[33] LiFeBATT X-1P 8Ah 38123 Cell (3 2011).

[34] C. M. Bishop, Pattern Recognition and Machine Learning (Information Science and Statistics), Springer, New York, NY, USA, 2006.

[35] K. Pearson, "On lines and planes of closest fit to systems of points in space," Philosophical Magazine, vol. 2, no. 6, pp. 559-572, 1901.

[36] H. Hotelling, "Analysis of a complex of statistical variables into principal components," Journal of Educational Psychology, vol. 24, no. 6, pp. 417-441, 1933.

[37] A. K. Qin and P. N. Suganthan, "Enhanced neural gas network for prototype-based clustering," Pattern Recognition, vol. 38, no. 8, pp. 1275-1288, 2005.

[38] J. Ye and T. Xiong, "SVM versus Least Squares SVM," Journal of Machine Learning Research, vol. 2, pp. 644-651, 2007.

[39] S. Kaski, J. Sinkkonen, and A. Klami, "Discriminative clustering," Neurocomputing, vol. 69, no. 1-3, pp. 18-41, 2005.

[40] C. Fyfe, "A neural network for PCA and beyond," Neural Processing Letters, vol. 6, no. 1-2, pp. 33-41, 1997.

[41] E. Oja, "Neural networks, principal components, and subspaces," International Journal of Neural Systems, vol. 01, no. 01, pp. 61-68, 1989.

[42] M. Sagsan, I. T. Medeni, and T. D. Medeni, "Knowledge Management Paradigms: Implementation through individual fuzzy-based Education," in Proceedings of the 12th International Conference on Application of Fuzzy Systems and Soft Computing, ICAFS 2016, pp. 259-266, Vienna, Austria, August 2016.

[43] J. H. Friedman and J. W. Tukey, "A projection pursuit algorithm for exploratory data analysis," IEEE Transactions on Computers, vol. 23, no. 9, pp. 881-890, 1974.

[44] P. Wasserman, Advanced Methods in Neural Computing, John Wiley & Sons, New York, NY, USA, 1st edition, 1993.

[45] Z. Zeng and J. Wang, Advances in Neural Network Research And Applications, Springer Publishing Company, New York City, NY, USA, 1st edition, 2010.

[46] J. Rynkiewicz, "General bound of overfitting for MLP regression models," Neurocomputing, vol. 90, pp. 106-110, 2012.

[47] N. Cristiamm and J. Shawe-Taylor, An Introduction to Support Vector Machines, Cambridge University Press, New York, NY, USA, 2000.

[48] I. Steinwart and A. Christmann, Support Vector Machines, Springer Publishing Company, New York, NY, USA, 1st edition, 2008.

[49] J. A. K. Suykens and J. Vandewalle, "Least squares support vector machine classifiers," Neural Processing Letters, vol. 9, no. 3, pp. 293-300, 1999.

[50] R. Wang, A. Wang, and Q. Song, "Research on the Alkalinity of Sintering Process Based on LS-SVM Algorithms," in Advances in Computer Science and Information Engineering, vol. 168 of Advances in Intelligent and Soft Computing, pp. 449-454, Springer, Berlin, Germany, 2012.

[51] Y. Guo, X. Li, G. Bai, and J. Ma, "Time series prediction method based on LS-SVR with modified gaussian RBF," in Neural Information Processing, vol. 7664 of Lecture Notes in Computer Science, pp. 9-17, Springer, Berlin, Germany, 2012.

[52] L. Wang and J. Wu, "Neural network ensemble model using PPR and LS-SVR for stock market forecasting," in Advanced Intelligent Computing, vol. 6838 of Lecture Notes in Computer Science, pp. 1-8, Springer, Berlin, Germany, 2012.

[53] Y. Li, X. Shao, and W. Cai, "A consensus least squares support vector regression (LS-SVR) for analysis of near-infrared spectra of plant samples," Talanta, vol. 72, no. 1, pp. 217-222, 2007.

Jose-Luis Casteleiro-Roca, (1) Esteban Jove, (1) Fernando Sanchez-Lasheras, (2) Juan-Albino Mendez-Perez, (3) Jose-Luis Calvo-Rolle, (1) and Francisco Javier de Cos Juez (4)

(1) Department of Industrial Engineering, University of A Coruna, A Coruna, Spain

(2) Department of Construction and Manufacturing Engineering, University of Oviedo, Oviedo, Spain

(3) Department of Computer Science and System Engineering, University of La Laguna, Tenerife, Spain

(4) Department of Mining Exploitation and Prospecting, University of Oviedo, Oviedo, Spain

Correspondence should be addressed to Fernando Sanchez-Lasheras; sanchezfernando@uniovi.es

Received 8 February 2017; Revised 15 June 2017; Accepted 10 July 2017; Published 28 August 2017

Academic Editor: Eduard Llobet

Caption: FIGURE 1: Capacity confirmation test diagram.

Caption: FIGURE 2: Current and voltage measured during one cycle.

Caption: FIGURE 3: Energy and temperatures measured in one cycle.

Caption: FIGURE 4: Model approach.

Caption: FIGURE 5: Modelling process.

Caption: FIGURE 6: PCA analysis.

Caption: FIGURE 7: MLHL analysis.

Caption: FIGURE 8: Boxplot of MSE by cluster.

TABLE 1: Samples assigned to clusters. Cluster Samples (1) Charge 6619 (2) Rest 1089 (3) Discharge 7451 (4) Rest 1211 Total 16370 TABLE 2: Best MSE achieved for each technique. Cycle state MLP Poly LS-SVR (1) Charge 0.1094 23.9967 0.1392 (2) Rest 0.0921 0.0904 0.0902 (3) Discharge 0.4767 17.8144 0.5052 (4) Rest 0.0917 0.0930 0.0865 TABLE 3: MSE for the best methods. Cycle state Model MSE (1) Charge MLP, 10 neurons 0.1392 (2) Rest MLP, 8 neurons 0.4767 (3) Discharge LS-SVR 0.0902 (4) Rest LS-SVR 0.0865 TABLE 4: p value for each model in each cluster. Cluster Selected model Compared model 1 ANN-MLP, 10 neurons Polynomial 1 ANN-MLP, 10 neurons SVR 2 LS-SVR ANN-MLP, 10 neurons 2 LS-SVR Polynomial 3 ANN-MLP, 8 neurons Polynomial 3 ANN-MLP, 8 neurons SVR 4 LS-SVR ANN-MLP, 10 neurons 4 LS-SVR Polynomial Cluster Selected model p value 1 ANN-MLP, 10 neurons 9.44 x [10.sup.-5] 1 ANN-MLP, 10 neurons 0.1364 2 LS-SVR 0.0695 2 LS-SVR 0.0735 3 ANN-MLP, 8 neurons 2.876 x [10.sup.-9] 3 ANN-MLP, 8 neurons 0.4789 4 LS-SVR 0.4970 4 LS-SVR 0.8713

Printer friendly Cite/link Email Feedback | |

Title Annotation: | Research Article |
---|---|

Author: | Casteleiro-Roca, Jose-Luis; Jove, Esteban; Sanchez-Lasheras, Fernando; Mendez-Perez, Juan-Albino; Ca |

Publication: | Journal of Sensors |

Date: | Jan 1, 2017 |

Words: | 6011 |

Previous Article: | Structure-Dependent C[O.sub.2] Gas Sensitivity of [La.sub.2][O.sub.2]C[O.sub.3] Thin Films. |

Next Article: | A Model-Based Virtual Sensor for Condition Monitoring of Li-Ion Batteries in Cyber-Physical Vehicle Systems. |

Topics: |