中图分类号:O657.3 文献标识码:A
Development a Spectrophotometric of Fe(Ⅲ), Al(Ⅲ) and Cu(Ⅱ) Using Eriochrome Cyanine R Ligand and Assessment of the Obtained Data by Partial Least-Squares and Artificial Neural Network Method—Application to Natural Waters
A. Hakan AKTAŞ
Department of Chemistry, Faculty of Science & Art, Süleyman Demirel University, Isparta 32260, Turkey

e-mail: hakanaktas@sdu.edu.tr

Abstract

Simultaneous determination of heavy metal cations and accurate quantitative prediction of them are of great interest in analytical chemistry. This work has focused on a comprehensive comparison of partial least squares (PLS-1) and artificial neural networks (ANN) as two types of chemometric methods. For this purpose, aluminum, iron and copper were studied as three analytes whose UV-Vis absorption spectra highly overlap each other. Accordance with determined parameters (ligand concentration, pH, waiting times, the relationship between absorbance and concentration of metal ion effect and foreign ions) are provided and the optimum conditions. After establishing the optimum conditions for Fe3+, Al3+ and Cu2+ containing mixtures spectrophotometric determinations and the data calibration method of least squares (PLS-1) regression, and artificial neural network (ANN) methods were used. Chemometric methods are applied in a fast, simple, and the results are applicable.

Key words: UV-Vis spectrophotometry; Partial least squares; Artificial neural network; Aluminum; Iron; Copper
Introduction

Iron, aluminum and copper are common elements in everyday life. Iron is an essential trace element, but according to recent reports from the World Health Organization (WHO), an extraordinary large number of people (4~5 billion) may be suffering from some form of iron deficiency, and up to 2 billion of these, may also have anemia[1]. Indicating how they affect humans and other living organisms[2, 3, 4, 5, 6]. This study in natural waters in iron, aluminum and copper as spectrophotometric method for determining the development is intended to be assessed by multivariate calibration techniques. Of each component in the synthetic sample mixture comprises the importance of working pairs which have to be analyzed individually. The use of the chemometric methods used up to now in the mixed analysis of the UV-Vis spectrophotometer today. One or both of these metals were analyzed alone or in combination with, a plurality of methods are as in the literature. Some of these electro thermal atomic absorption method (ETAAS) and graphite furnace atomic absorption method (GFAAS)[7], inductively coupled plasma-optical emission spectrometry method (ICP-MS)[8, 9], atomic absorption spectrophotometer (AAS)[10] chromatography[11, 12] spectrophotometry[13, 14] and voltammetry[15]. In spite of the importance of simultaneous determination of Fe(Ⅲ ), Al(Ⅲ ) and Cu(Ⅱ ), only few works have been reported for simultaneous determination of these ions[16, 17]. Again according to the welding research, iron, aluminum showed the use of various ligand for copper spectrophotometric determinations. Examples include cresol yellow[9], alizarin red[16], 2-hydroxy-1-naphthaldehyde benzyl hydrazine[18] eriochrome black T[19], cetyl trimethyl ammonium bromide[20, 21], chrome azurol S[22], and cetyl pyridinium chloride[23]. This paper aims to develop simple, fast and sensitive methods for the simultaneous determination of Fe(Ⅲ ), Al(Ⅲ ) and Cu(Ⅱ ) in natural waters by UV-Vis spectroscopy with the help of partial least squares multivariate calibration techniques and neural network modeling.

1 Materials and Methods
1.1 Reagents and apparatus

All commercial reagents used were of analytical grade, Merck (Germany), or Fluka (Switzerland) and were used without further purification.

Recording of the absorption spectra were performed using a UV-Vis spectrophotometer (UV 1700 Pharmasec Shimadzu model) and pH adjustment was performed with Thermo Orion 5-Star pH meter. All measurement were carried out at 25 ℃ using a quartz cuvette of 1.0 cm optical path. Partial least squares and neural network modeling were implemented by MATLAB R2013a software (Math Works).

1.2 Method

An aliquot of the 0.4 mol· L-1 ammonium chloride-acetic acid buffer solution was micro pipetted into a 10-mm quartz cell. Finally, 0.1 mL 7.4× 10-4 mol· L-1 eriochrome cyanine R solution was added, and the mixture was briefly agitated (2~5 s). The addition of 0.5 mL sample solution of a metal ion followed, to give a total volume of 2.5 mL. The mixture was then stirred rapidly by a hand-controlled micro-stirrer, and the timer was started. The cell was then placed into the cell holder, and the absorbance was measured against a blank in the range of 400~600 nm. Thus, a total of 32 spectra was collected from a solution, and from such measurements, the spectral data could be arranged into a three-way data matrix. The temperature of the experiments was kept constant at 25 ℃.

A set of standards containing iron(Ⅲ ), aluminum(Ⅲ ) and copper(Ⅱ ) mixtures was prepared at different concentrations, and their spectrophotometric data were measured according to the above procedure. Different chemometrics models were established and were then used to predict the unknown samples.

By convenient dilution of stock solutions, a calibration set of 32 samples were built. Concentrations of the analyte used for the calibration set were in the range 1.0~20.0 ppm. The samples were prepared based on the ratio of water components in the range of each of the three analyte. Standard solutions were prepared in 25 mL volumetric flasks by addition of suitable amounts of each stock solution and diluted by hydrochloric acid to the mark. Also 16 additional samples were prepared similarly to constitute the test set. The UV-Vis spectra of the corresponding solutions were recorded in the same spectral conditions at ambient temperature and the obtained data were used for PLS and ANN modeling.

1.3 ANN modeling

In this study, a multilayer perception (MLP) feed-forward network was applied for modeling the concentrations Fe3+, Al3+ and Cu2+, schematically shown in Fig.1. The logistic function was used as the activation function in a neural network.

Fig.1 Schematic representation of multi-layer feed-forward neural network architecture

The MLP consists of three types of layers; an input, an output and a hidden layer, each layer contains a number of neurons which operate parallel. These neurons are connected by weights that are modified during the learning phase. Implementing ANN model normally requires a number of steps: data collection, data pre-processing, building the network, training, testing, validation and data post-processing, successively.

2 Results and Discussion
2.1 Experimental design of the calibration matrix

A concentration set of 32 mixture solutions consisting of Fe3+-ECR, Al3+-ECR and Cu2+-ECR in the concentration range of 0.25~1.25, 0.1~0.4 and 0.4~1.2 ppm for Fe3+, Al3+ and Cu2+ in the same solvent were symmetrically prepared from the prepared stock solutions respectively. A ternary diagram was utilized to generate six design points to incorporate different chemical compositions and remove any possibility of factor aliasing (Fig.2). The reason for this is to minimize errors in calibration that may occur during analysis. To check the proposed methods we used an independent validation set consisting of the synthetic mixture solutions of Fe3+-ECR, Al3+-ECR and Cu2+-ECR in the above working concentration ranges.

2.2 Effect of pH

Spectra from a set of sodium acetate-acetic acid buffer solutions with pH values in the range of 4.0~4.6 were sampled at the reaction time of 300 s (see Fig.3). It can be seen that the different pH conditions did not affect significantly the spectral absorbance of the Fe3+-ECR complexes. The absorbance versus pH of the Fe3+-ECR, Al3+-ECR and Cu2+-ECR reactions with the eriochrome cyanine R ligand were also studied at different times with the measurements being recorded at λ max(Fe(Ⅲ ))=526.8 nm, and λ max(Al(Ⅲ ))=532.6 nm and λ max(Cu(Ⅱ ))=519 nm. The kinetic curves for iron(Ⅲ )/ligand aluminum(Ⅲ )/ligand and copper(Ⅱ )/ligand reactions did not change with different pH. Thus, considering all the factors for the pH effects described above, a pH of 4.0 was selected for this work.

Fig.2 Ternary design diagram. Squares were the calibration data points, and circles were the prediction data points

Fig.3 Spectra of Fe3+-ECR complexes at different pH

2.3 Effect of ligand concentration

Fe3+-ECR, Al3+-ECR and Cu2+-ECR ligand complex formation the amount of ECR in order to have the most appropriate, Fe3+, Al3+ and Cu2+ solution pH by adding various volumes of metal solutions ECR ambient pH=4 was adjusted with buffer. Passing to the results obtained with the graphs ligand 12 times more highest absorbance values is reached. According to these results, the most appropriate amount of ligand in the formation of the complex was selected 12 times.

2.4 PLS-1 method

In the first step to developing the PLS-1 model, a calibration set was built with a dataset of 32 samples, in order to take into account the different concentration ratios of analytes and to cover the range usually present in natural water samples.

The full cross validation method suggested by Haaland and Thomas[24] was used for the accurate selection of the optimum number of factors. This consisted of removing one sample at a time from the calibration step and carrying out the calibration by the remaining samples. The concentration of the one sample removed is expected with the achieved model. This step was repeated for each considered sample. The procedure can be repeated after fixing a different number of factors. Prediction residual error sum of squares (PRESS) was obtained from Eq.(1) and best number of factors was selected in the minimum of PRESS values. In this equation n is the number of samples in the calibration set and Ci, added and Ci, found are the actual and predicted concentration of analyte in the ith sample, respectively[25].

PRESS=i=1n(Ciadded-Cifound)2(1)

Similarly, the standard error of cross-validation (SEC) can also be defined as

SEC=i=1n(Ciadded-Cifound)2n(2)

Where n is the total number of the synthetic mixtures. In order to determine the optimum number of factors, SEC was plotted as a function of different number of factors. The optimum number of factors was obtained to be 2 for Fe3+, 5 for Al3+ and 6 for Cu2+ in which SEC attained the least possible value. Afterwards, the developed PLS-1 model was applied to an independent test set including 16 artificial samples which were not used during calibration. The experimental and found (predicted) concentration of each analyte can be seen in Table 1.

2.5 Analytical figures of merit

Analytical figures of merit (AFM) are very important to quantify the quality of a given methodology or for method comparison. In multivariate calibration, several AFM have been reported, e.g. selectivity, sensitivity, and limit of detection[26, 27, 28].

The net analyte signal (NAS) is a new family of multivariate calibration methods that has been defined by Lorber[26] as the part of a mixture spectrum that is useful for model building. The net analyte signal (NAS) for analyte k ( rk* ) is defined as the part of the signal that is orthogonal to the signal of the interferences present in the sample. In general, for inverse calibration methods, it is given by the following equation

rk* =[I-R-k(R-k)+]r=PNAS, kr(3)

where r is the spectrum of a given sample (when r is the spectrum sk of pure k at unit concentration, Eq. (3) becomes sk* =PNAS, k sk), I is a J× J unitary matrix, R-k is a J× A column space spanned by the spectra of all other analytes except k (R-k+ is the pseudo inverse of R-k and A is the number of spectral factors used to build the model), and PNAS, k is a J× J projection matrix which projects a given vector onto the NAS space. In factor-based methods such as PCR, PLS and NAS-based methods this latter statement is true only to the extent that the selected factors are representative of the spectra of all other analytes.

Table 1 Composition of test set and predicted values for Fe3+, Al3+ and Cu2+ by PLS-1 and ANN regression. Concentration values are expressed as ppm

Selectivity (SEL) is an amount, ranging from 0 to 1, of how unique the spectrum of the analyte is, compared with the other species. It requires a part of the total signal that is not lost due to spectral overlap and can be defined by referring to the NAS calculation in a multivariate context:

SEL=sk* sk(4)

Details of the calculations of the NAS in PLS and other Details of the calculations of the NAS in PLS and other multivariate methods are given in Ref.[27]. On the other hand, the sensitivity (SEN) is given by[29]:

SEN=1bk(5)

where bk is the vector of final regression coefficients appropriate for component k, which can be obtained by any multivariate method.

The analytical sensitivity (γ ), which is defined in analogy with multivariate calibration, is the ratio between SEN and the instrumental noise (ε ), according to Eq.(6):

γ=SENε(6)

The value of ‖ ε ‖ may be estimated from the standard deviation in the NAS of several blanks. Concerning the limit of detection, the following simple equation has been proposed for its estimation[31]:

LOD=3εsk* (7)

Table 2 shows several estimated FOM for Fe3+, Al3+ and Cu2+ by PLS-1 model.

Table 2 Analytical figures of merit of the spectrophotometric method by applying PLS-1 for Al3+, Fe3+ and Cu2+
3.6 ANN method

In the second part of this work, the spectral data of each analyte was used to develop a neural network model. For this purpose, the network parameters were specified. Among these parameters are; the number of input layers, the number of hidden layers, number of neurons, n each layer, the output layer. In this study several hidden layer with different number of neurons were tested with logistic functions.

It is believed that neural network training can be made more efficient if certain pre-processing steps are performed on the raw input and target data. Randomizing and normalizing data are the most common practices of data pre-processing. Normalization procedure often makes the model more efficient as it prevents the learning algorithm from being confused by the unequal magnitude of different variables and consequently neglecting with the smaller magnitude. The training and testing data sets must be normalized into a range 0.1~0.9. The input and the output data sets were normalized by using following equation:

XN=0.1+0.8(X-Xmin)(Xmax-Xmin)(8)

where XN is normalized the value of a variable (the network input or the network output), X is a original value of a variable, and Xmax and Xmin are maximum and minimum original values of the variables, respectively. In order to produce sufficient data for training and testing of the model shown in Figure 1, 32 different standard solutions were prepared using different analyte concentrations and each standard solutions was subjected to spectrophotometric determination. Randomly chosen 800 data pairs from these 1 000 data pairs were used in the training of the neural network, and the rest of the data were used in the testing. The root mean square error values were calculated from following equation to prove quantitatively the accuracy of the testing results of neural network models:

RMS=0.5N-1i=1N(X'1-X1)2(9)

where N is the number of testing data and X'1 is target value.

The developed model which was trained in the previous stage was further simulated by a test set of 16 unknown data. The results obtained by ANN method is listed in Table 2 alongside those predicted by the other method investigated in this work and will be discussed subsequently.

2.6 Comparison of the predictive abilities of proposed methods

Once the optimal number of factors has been determined, the final calibration may be performed using all the calibration samples with the optimal number of factors. As mentioned earlier, a set of independent test dataset 16 samples which has not been used previously for calibration, was modeled by PLS-1 and ANN methods. The predicted values of each component together with reference values can be seen Table 2. Also, a residual analysis can be carried out by plotting residual vs. the concentration of each analyte. Towards this end, the absolute residual was defined between actual concentration and the predicted value from a model. Figure 4 demonstrates the residual values for the one analyte obtained by PLS-1 and ANN models.

As can be realized in these set of figures, the residual errors committed by ANN method are closely distributed around the zero-order line for all three components. Highest accuracies are obtained for prediction of Fe3+ and Cu2+ with ANN method with all estimated concentrations close to the reference value. Prediction of Al3+ seems to exhibit more deviations than the other components, however, this deviation between experimental and predicted values was mitigated by ANN method in comparison to the PLS-1 regression model.

Table 4 shows a number of important statistical parameters such as the determination coefficient of prediction ( Rpred2, Eq.10), relative error of prediction (REP, Eq. 11) and root mean square error of prediction (RMSEP, Eq.12).

Fig.4 Residual errors of different methods vs. concentration plots for the one component of test set
Rpred2=1-i=1n(ci, act-ci, pred)2i=1n(ci, act-c)2 (10)

REP%=100×i=1n(ci, act-ci, pred)2i=1n(ci, act-c)2(11)

RMSEP=i=1n(ci, act-ci, pred)2n(12)

where Ci, act and Ci, pred is the actual and predicted concentration of a component in the ith sample, respectively. C is the mean of actual concentrations in a particular set and n is the number of samples in the test set.

This table also shows the values of the optimal number of factors used for both PLS-1 method and ANN in the calibration set. As can be seen in Table 3, the predictive ability of ANN method was clearly improved after selecting the optimum region of wavelengths. The results also indicate that predictions of PLS-1 and ANN models for Fe3+ are in excellent agreement with experimental data, as high R2 values of 0.998 9 and 0.998 2 was obtained for these methods, respectively.

Table 3 Statistical parameters calibration and test set by PLS-1 and ANN regression for three analyte
2.7 Application to real samples

The data obtained from the methods developed in the light of natural waters Fe3+, Cu 2+ and Al3+’ s determination to UV spectrophotometric primarily as tap water is taken from Sü leyman Demirel University. Representative water behalf affect the results of the experiment natural water sample was filtered twice, then read on the UV spectrophotometer under optimum conditions obtained in the complex formation. Optimization and method development which has been implemented in natural water samples confirmed the results. The recovery result in the application are shown in Table 4. The results are the average of six experiments.

Table 4 Application of the natural water samples to PLS-1 and ANN validation procedure and the obtained recovery values

The World Health Organization (WHO) and standard values obtained from the literature by Turkish standards are given in Table 5. The test results appear to be obtained within the limits of the chart[31, 32].

Table 5 The World Health Organization and the value of TS 266

Chemometric methods as a result of Fe3+, Al3+ and Cu2+ which is suitable for the analysis of mix and high reproducibility of this method is sensitive and accurate results that showed.

3 Conclusion

The present work studied the simultaneous quantification of Al3+, Fe3+ and Cu2+ with the aid of UV-Vis spectroscopy combined with different chemometric techniques. In order to identify the most suitable chemometric method, various calibration models including PLS-1 and ANN were developed and validated by independent test set of natural water mixtures. The cross-validation leave-one-sample-out procedure was applied in the present work to obtain the optimum factors during calibration. Performance of ANN was also optimized through manipulating different network parameters and architectures. Furthermore, several figures of merit such as LOD and analytical sensitivity were calculated and their values found to be significantly better for ANN than PLS-1 method. The results indicated that ANN is capable of capturing the interactions between analytes with highly-overlapped absorption spectra with a good degree of accuracy.

The authors have declared that no competing interests exist.

参考文献
[1] Yongnian N, Chunfang H, Kokot S. Anal. Chim. Acta, 2007, 599: 209. [本文引用:1]
[2] Ekinci D, Beydemir Ş, Küfrevioğlu O İ. Journal of Enzyme Inhibition and Medicinal Chemistry, 2008, 22(6): 745. [本文引用:1]
[3] Çoban A T, Şentürk M, Çiftçi M, et al. Protein & Peptide Letters, 2007, 14(10): 1027. [本文引用:1]
[4] Tekman B, Özdemir H, Şentürk, et al. Comparative Biochemistry and Physiology C- Toxicology & Pharmacology, 2008, 148(2): 117. [本文引用:1]
[5] Ceyhun S B, Şentürk M, Yerlikaya E, et al. Environmental Toxicology and Physiology C-Toxicology & Pharmacology, 2008, 148(2): 117. [本文引用:1]
[6] Demirdağ R, Yerlikaya E, Şentürk M, et al. Enzyme Inhibition and Medicinal Chemistry, 2013, 28(2): 278. [本文引用:1]
[7] Scancar J, Benedik R M M, Kr?zaj I. Talanta, 2003, 59: 355. [本文引用:1]
[8] Tahan J E, Granadilla V A, Romero R A. Anal. Chim. Acta, 2004, 295: 187. [本文引用:1]
[9] Nagaoku M H, Maitani T. Analyst, 2000, 125: 1962. [本文引用:2]
[10] Coscaine A, R, Andrade J, C, Poppi R J. Analyst, 2002, 127: 135. [本文引用:1]
[11] Kendüzler E, Türker A R. Anal. Chim. Acta, 2003, 480: 259. [本文引用:1]
[12] Ryan E, Meaney M. Analyst, 1992, 117: 1435. [本文引用:1]
[13] Coscine A R, Andrade J C, Poppi R J, et al. Anal. Chim. Acta, 2000, 423: 31. [本文引用:1]
[14] Wang N, Liang W. Talanta, 1993, 40(6): 897. [本文引用:1]
[15] Limson J, Nyokong T, Daya S. Journal of Pineal Research, 1998, 24(1): 15. [本文引用:1]
[16] Safavi A, Abdollahi H, Mirzajani R. Spectrochimica Acta Part A, 2006, 63: 196. [本文引用:2]
[17] Peng J, Liu S, Deng C. Anal. Sci. , 2005, 21(3): 259. [本文引用:1]
[18] Reddy V K, Reddy S M, et al. Journal of Analytical Chemistry, 2000, 55(5): 435. [本文引用:1]
[19] Honorato R S, Carneiro J M T, Zagatto E A G. Anal. Chim. Acta, 2001, 441: 309. [本文引用:1]
[20] Rodrigues J L, Magalhaes S C, Luccas P O. J. of Phar. and Bio. Ana. , 2005, 36: 1119. [本文引用:1]
[21] Royset O. Anal. Chim. Acta, 1985, 178: 223. [本文引用:1]
[22] Marczenko Z, Kaowska H. Anal. Chim. Acta, 1981, 123: 279. [本文引用:1]
[23] Gao H, W, Chen L, Yang Q Z. Journal of AOAC International, 2005, 88: 1231. [本文引用:1]
[24] Thomas E V, Haaland D M. Anal. Chem. , 1990, 62: 1091. [本文引用:1]
[25] Aktaş A H, Kitiş F. Croatica Chim. Acta, 2014, 87(1): 69. [本文引用:1]
[26] Lorber A. Anal. Chem. , 1986, 58: 1167. [本文引用:2]
[27] Booksh K S, Kowalski B R. Anal. Chem. , 1994, 66: 782A. [本文引用:1]
[28] Karim A Z, Sara M K. Journal of Chinese Chemical Society, 2015, 62: 772. [本文引用:1]
[29] Goicoechea H C, Olivieri A C. Anal. Chem. , 1999, 71: 4361. [本文引用:1]
[30] Boque’ R, Rius F X. Chemom. Intell. Lab. Syst. , 1996, 32: 11. [本文引用:1]
[31] Turkish Stand ards. >Water-Drinking and Potable Water TS, Ankara, 1997, 266: 32. [本文引用:2]
[32] WHO. World Health Organization, 2nd ed. , Genova, 1993, 1: 188. [本文引用:1]