ABOUT PARAMETRIC IDENTIFICATION ALGORITHMS OF DISCRETE-CONTINUOUS PROCESSES


Cite item

Full Text

Abstract

Researches presented in the paper are devoted to parametric modelling of multidimensional processes of discrete- continuous type in the condition of priori information lack. Similar processes occur in the space industry, for example, in the manufacture of products based on electronic components. The article considers multidimensional processes with unknown mathematical description. Using parametric approach, we choose the structure of investigated process with the accuracy to within parameters, and the next step is to estimate the model parameters from the available sample of observations of the process input and output variables. The paper examines the case when due to the lack of priori knowledge about the object an error is allowed at the stage of parametric structure choosing. The relative approxima- tion error is used to estimate the model accuracy, which shows the difference between model and object outputs. A comparative analysis of several parametric models for one investigated object is carried out is. Using the method of least squares we obtain estimates of the parameters. The paper presents the results of a series of computational experiments illustrating the dependence of the modelling error on the object noise level, as well as on the sample size of observations of the input and output variables. One of the obvious parametric models advantages is the ease of its applying. However, if the dimension of the input variables vector is high, the process has a complex structure, and there is no priori information about the object structure, then it is difficult to use parametric methods. In this case, it is advisable to use nonparametric identification methods. In this paper we use a nonparametric estimation of the regression function on observations of Nadaraya-Watson as an estimate of the process output variable. However, such estimates require a large number of initial data, also they are sensitive to various kinds of defects in the initial samples of observations. Besides that, the paper compares nonpara- metric model with parametric one for the investigated process.

Full Text

Introduction. When researching different processes or phenomena, it is necessary to build models, since we are not always able to conduct experiments with real ob- jects. Herewith various methods of mathematical model- ling and identification theory are used. These methods are relevant for studying processes in different branches of human activities (technological, socio-economic proc- esses, etc.). For instance, designing of space vehicles or liquid rocket engines [1; 2]. It is really important to qualitatively determine the mathematical model of the process at the initial stages of the system investigation. These days higher standards of system control are being set. In order to correspond to them, it is necessary to conduct a lot of research and experiments to ensure the construction of an adequate mathematical model of the considering system and to find a more accurate model structure of the object. Each stage of identification requires different methods due to increasing of the amount of incoming information obtained from the research results. Depending on the amount of information available about the investigated object, the most appropriate method is chosen. Methods for mathematical models determining from the results of experimental researches are the subject of identification theory [3]. Based on the foregoing, it should be added that the construction of a mathematical model depends on the amount of priori information. In this regard, there are two approaches to identification: in a “narrow” and a “broad” sense. For problems in a “narrow” sense there is a vast amount of priori information, whether it is the structure of the system or the class of models to which it refers. This problem statement is close to real conditions and is more applicable in engineering practice [3]. In the “broad” sense of the identification problem there is clear deficit of priori information, or we have access to information only about a qualitative nature object. These methods are time-consuming as they require detailed consideration in each individual case because of the individual features of the objects under study [3]. In conditions of parametric uncertainty parametric identification methods are basically used. Such methods at the first stage assume the determination of the model structure from the available priori information about the object accurate to a parameter vector, which are later es- timated with known methods [4]. With parametric model- ling, there are cases when priori information is not enough to accurately determine the structure of the model, which can lead to significant errors. In the worst case this can lead to large inaccuracies at the stage of formulating the identification task, which results in an absolutely wrong solution. This problem led to the development of non- parametric identification methods, which have proliferated in recent years [5]. Surely, non-parametric methods allow analysis without using the object model equation. They are more universal and work with greater uncer- tainty in priori information, this can not be said about parametric methods [6]. However, under conditions of sufficient information parametric methods give better results. A large number of scientists, both in Russia and abroad, starting from the middle of the twentieth century conducted their studies related to the problem of system identification, since at those particular years it was really significant to broaden knowledge in the field of control [7]. It is commonly believed that Professor Eykhoff P. (Holland) was the first person who systematized the knowledge of the whole variety of identification algorithms and methods. In his monograph [7] he describes the basic concepts of the model, identification problem setting and gives key methods for solving problems for various classes of objects. His book may be useful for those who are working on construction or analysis of processes or phenomena models nowadays. A bit earlier two Profes- sors Andrew P. Sage and James L. Melsa from the USA wrote their book [8], which considers the most well- known methods of identification back then and also con- ducts their in-depth analysis. Not less important person among foreign scientists who contributed to the develop- ment of identification theory is Professor D. Grop (USA). His book [3] complements the knowledge and informa- tion which is presented in the book of two Professors Andrew P. Sage and James L. Melsa bringing his new results in the field of identification, and also considering a wider class of different methods that allow us to deter- mine the parameters and structure of the mathematical model. In Russia the knowledge in the identification sphere was systemized a bit later than abroad, closer to the end of twentieth century. One of the first scientists who wrote a book about constructing mathematical models of com- plex control objects was Professor N. S. Raibman. In his pamphlet [9], as the author himself called it, N. S. Raib- man describes some of the basic concepts and methods of identification which are intended basically for non- specialists in the control sphere. Thus, the professor tries to popularize the understanding of the methodology for constructing mathematical models. We would also like to mention proceedings of Professor Sh. E. Shteynberg, who tried to summarize the overwhelming majority of identifi- cation methods, possible to be used in engineering prac- tice. His book [10] is aimed at considering the methods which are implemented in industrial enterprises, where there are some missed measurements, noise in the object, the correlation of this noise in time with each other. Therefore, it is applicable to specialists developing control systems. However, the professor’s proceedings can be also useful for an ordinary student. Among other things we should say about the Russian scientist, who made a great contribution to the development of identifi- cation and whose work would be wrong not to mention. It is Professor Ya. Z. Tsypkin. His book [4] can be called fundamental as it describes a large range of nuances that appear during the identification algorithms formation. Also, the question of the most optimal identification algo- rithms selection is considered from various sides. In addi- tion, the “Information Theory of Identification” is a more modern book that was published at the very end of the twentieth century, which emphasizes its relevance to the present days. Nowadays, there is an active development of the theory of identification in all directions. Thus, in his work [11] L. P. Myshlyaev and coauthors suggest presenting the identification system as a closed-loop dynamic system where the controlled object is the structure of the object model. This approach will help to avoid using adaptive control methods which are effective only for certain classes of objects. A wide application of parametric iden- The Problem Statement. The main goal of the paper is to obtain a parametric estimate of the investigated dis- crete-continuous process. The case when an error was made at the stage of choosing the model structure was studied. In addition, a nonparametric estimation of the investigated process was constructed. The obtained results are compared with the parametric identification method. The generally accepted flowchart of the investigated dis- crete-continuous process is shown in fig. 1, where the following notation is used: А is an unknown object func- tional; y(t)ÎΩ(y)ÌR1 is an output process variable; u(t) = = (ui(t), i = 1, …, m)ÎΩ(u)ÌRm is a vector input stimulus; ζ(t) is a vector random stimulus; t is a continuous time; Hu, Hy are communication channels; hu(t), hy(t) are ran- dom measurements noise; {ui, yi, i = 1, …, s} is a training sample, where s is a sample size. Computational experiment. Suppose that the exam- ined object in the computational experiment framework is described by the following equation: y(t) = a1 × u1(t) + a2 ×[u2 (t)]2 + tification is found in electromechanical systems. For + a3 ×sin[u3 (t)] + a4 × u4 (t) + x(t), (1) example, in the paper [12], a parametric identification algorithm of an induction motor with a cage rotor is given, where in some cases it is difficult to obtain information about the values of parameters on the basis of where (α1, α2, α3, α4) = α - are the coefficients of the stud- ied object; ζ(t) - is noise with mean value equals zero which was generated in the following way: catalogued data or to determine them experimentally with the use of special devices. Therefore, the question of conx(t) = y(t) × c(t) × k, (2) structing a parametric model is quite relevant in this case. For particularly complex technical and humantechnical systems, intelligent control systems are used that combine a large number of different identification meth- ods what is very important in today’s rapidly developing conditions and a huge flow of information. V. B. Trofi- mov and S. M. Kulakov write about such systems in their where c(t) - is a normal distributed random variable in the interval of [-1;1]; k - is the noise level. Let us assume the model equation has the following form: yˆ1(t) = aˆ1 × u1(t) + aˆ 2 ×[u2 (t)]2 + + aˆ 3 ×sin[u3 (t)] + aˆ 4 × u4 (t). work [13]. The authors present the theoretical and applied fundamental basis of intelligent control systems and also offer to familiarize themselves with algorithms that allow solving the actual tasks of various complex systems moni- toring and control. Similar studies have already been conducted earlier in the field of parametric identification. The results are presented in [14]. Next, we suppose such situation when in the modelling process an error was made in the structure with u2 and then the following equation now describes the model: yˆ2 (t) = aˆ1 × u1(t) + aˆ 2 ×[u2 (t)]3 + + aˆ 3 ×sin[u3 (t)] + aˆ 4 × u4 (t). Fig. 1. Flowchart of investigated process Рис. 1. Схема исследуемого процесса The next step is to estimate the parameters aˆ1 , aˆ 2 , aˆ 3 and aˆ 4 based on the available sample {yi, u1i, u2i, u3i, u4i, i = 1, …, s}. There is a large number of methods for obtaining parameter estimates, for example, the least- squares criterion (hereinafter LS criterion). We will use this criterion. The least-squares criterion is presented below: The computational experiment demonstrates the influence of the error, which was made at the stage of selecting the model parametric structure on the final < odeling results. As the first experiment, we generate the output of the ob- ject (1), in which the input variables uj, j = 1, …, 4, are set by uniform-distributed law pseudorandom numbers in the interval of [0, 3]. The coefficients of the investigated object will be taken as: α1 = 7, α2 = 3, α3 = 10, α4 = 11. s i i F (aˆ ) = 1 ×ås ( y - yˆ )2 ¾¾® min, i=1 aˆ (5) Using LS criterion (5) for (3) and (4) we get the values of the object estimated coefficients. Substituting these values in (3) and (4) we obtain the values of the model outputs. where yˆi - are model output values; aˆ = (aˆ1, aˆ 2 , aˆ 3 , aˆ 4 ) - are unknown coefficients. Let us assume the relative approximation error, according to which the deviation of the output values of the models from the output values of the object will be calculated, in the following form: å 1 2 s ( yi - yˆi ) s In fig. 2 the results are represented, where each time- step corresponds to the values of the object and model outputs obtained from the condition that the sample size is s = 100, and the noise level is 5 %. The relative approximation error for the model (3) calculated using (6) equals: W1 = 0.09, but for model (4) equals: W2 = 0.14. Despite the fact that there is an error in the model structure (4), the results which are shown in fig. 2, as well W = i=1 , (6) as the value of the error W2, indicate that the model de- å 1 s s -1 (mˆ y - yi )2 scribes this object well. This can be explained by present- ing the values of the coefficient estimates. For the model i=1 where mˆ - is the expectation estimation of the object (3): aˆ1 = 6.95 , aˆ 2 = 3.038 , aˆ 3 = 9.986 , aˆ 4 = 11.013 , y output. The form of the nonparametric model constructed for the examined process is presented below [15]: s 4 æ u j - u ji ö for the model (4): aˆ1 = 7.37 , aˆ 2 = 1.06 , aˆ 3 = 11.37 , aˆ 4 = 11.19 . As it is evident, the coefficients values at the parameter u2 differ the most. We construct a graph that visualizes the dependence of the outputs (1), (3) and (4) on the input variable u . For the examined equations we å yi ÕF ç ÷ 2 yˆ(u) = i=1 j =1 è cs ø , (7) assume u1(t) = u3(t) = u4(t) = 1.5; u2 will generate using ås 4 F æ u j - u ji ö Õ ç c ÷ i =1 j =1 è s ø where cs - a blur parameter and Ф(•) - a kernel function satisfy convergence conditions [15]. Kernel Ф(•) has the parabolic form: uniform distribution by pseudorandom numbers in the interval of [0;3]. In fig. 3 the results of this experiment are shown. All above conclusions indicate a feature of the LS criterion which is as follows: even if the model structure is chosen incorrectly at first, the criterion will try to cor- rect it by decreasing or increasing one or another coeffi- where z = ì(0.75 × 1- z 2 , ( )ï F(z) = í ïî0, u - ui . cs z £ 1, z > 1, (8) cient to make the final model output correspond to the object output in the best way. The error value (6) can change due to a different sam- ple size. In fig. 4 we consider the dependence of these parameters on each other for models of the form (3) and (4) at a 5 % noise level. Fig. 2. Graphical dependence of the object (1) and the models (3), (4) outputs in each time-step Рис. 2. График зависимости выходов объекта (1) и моделей (3), (4) в каждый такт времени The above given results (fig. 4) show that with the sample size increase, the modelling error tends to one average value. Among other things, the error (6) changes when the noise level is changed. We investigate this dependence for a sample size s = 100 and summarize the results in tab. 1. According to the results given in tab. 1, we can as- sume that for the model (3) and model (4) the error in- creases with the noise level increase, but for the second case it is more. As the second experiment, we generate an object with input variables uj, j = 1, …, 4 which are set using uniform distribution by pseudorandom numbers in the interval of [-3;3]. Similar to the first experiment, we graph the object output (1) dependence on the models outputs (3) and (4) at a 5 % noise level and the sample size s = 100, which is shown in fig. 5. Relative approximation error (6), for the model (3) equals: W1 = 0.03 and for the model (4) equals: W2 = 0.42. The values of the coefficient estimates for the second experiment are as follows. Model (3): aˆ1 = 7.012 , aˆ 2 = 2.99 , aˆ 3 = 10.029 , aˆ 4 = 10.984 ; model (4): aˆ1 = 6.45 , aˆ 2 = 0.12 , aˆ 3 = 9.84 , aˆ 4 = 9.82 . In this case, the interval of input variables generation had a more significant effect on the final output results of the model (4), which is evident in the error value W2 and fig. 5. Similar to the above experiment in fig. 6, the graph that shows the dependence of the outputs (1), (3) and (4) on the input of the variable u2 is given. For the examined equations we assume u1(t) = u3(t) = u4(t) = 1.5, however, now u2 will be generated by pseudorandom numbers in the interval of [-3;3]. Fig. 3. Dependence of the models (3), (4) and object (1) on u2 outputs Рис. 3. Зависимость значений выходов моделей (3), (4) и объекта (1) от u2 Fig. 4. Relative approximation error graph for different sample sizes Рис. 4. График относительной ошибки восстановления при различных значениях объема выборки Table 1 Dependence of the relative approximation error on different noise levels W1 error of the model with correct structure (3) W2 error of the model with incorrect structure (4) Noise level 3 % 0.0611 0.1057 Noise level 5 % 0.0851 0.1356 Noise level 10 % 0.1583 0.1883 The results which are shown in fig. 6, once again indi- cate that the estimation value of the model (4) coefficient aˆ2 calculated with the criterion (5), is practically equal to zero. Besides that, on the basis of fig. 6 it can be seen that the most qualitative description of the object (1) by the model (4) is only in the interval [-1;1]. Going beyond the limits of this interval, the residual error increases. Similar to the first experiment, we consider the influ- ence of the sample size on the error value (6) at a 5 % noise level, which is shown in fig. 7. The results in fig. 7 are similar to the results of the first experiment. The only difference is that for the model (4) the error value (6) has increased. Now we consider the error value (6) for the different noise levels using sample size s = 100. The obtained re- sults are summarized in tab. 2. It can be seen from the results presented in tab. 2, the changed interval of generating input object variables af- fected the error values. For the model (3) a growth in the level noise from 3 % to 5 % as well as from 5 % to 10 % almost twice increases the error value. For the model (4) the error value also increases with noise level increasing, however, because of its large values, this becomes not essential. As the third experiment, we implement nonparametric modelling of the object (1). The values of the input variables uj, j = 1, …, 4, as in the previous experiments, we will generate pseudorandom numbers using uniform distribution in the intervals of [0;3] and [-3;3]. In the first case (interval [0, 3]), we make nonparamet- ric estimate (7) for which the value of the optimal blur parameter cs = 1.1 was calculated. For the second case interval [-3;3] cs = 2.4. Below, fig. 8 and fig. 9 show the results of nonparametric modelling on the assumption that the sample size is s = 100, the noise level is 5 %. In addition, the error value (6) for different noise levels was calculated. For convenience, we summarize the results in the tab. 3. Based on the results which are given in tab. 3, we can conclude that the error value (6) for parametric model (4) where the input variables are generated in the interval of [-3,3] is higher than for the model (7). In other cases, the error value (6) for the nonparametric model is higher. Fig. 5. Graphical dependence of the object (1) and the models (3), (4) outputs in each time-step Рис. 5. График зависимости выходов (1), (3) и (4) в каждый такт времени Fig. 6. Dependence of the models (3), (4) and object (1) outputs values on values u2 Рис. 6. Зависимость значений выходов (1), (3) и (4) от значений u2 Fig. 7. Relative approximation error graph for different sample sizes Рис. 7. График относительной ошибки восстановления при различных значениях объема выборки Table 2 Dependence of the relative approximation error on different noise levels W1 error of the model with correct structure (3) W1 error of the model with incorrect structure (4) Noise level 3 % 0.0151 0.4086 Noise level 5 % 0.0265 0.415 Noise level 10 % 0.0587 0.4651 Fig. 8. Graphical dependence of the object output (1) and the model output (7) in each time-step for the interval of generating the input variables [0;3] Рис. 8. График зависимости выхода объекта (1) и выхода модели (7) от времени для интервала генерирования входных переменных [0;3] Fig. 9. Graphical dependence of the object output (1) and the model output (7) in each time-step for the interval of generating the input variables [-3;3] Рис. 9. График зависимости выхода объекта (1) и выхода модели (7) 733 от времени для интервала генерирования входных переменных [-3;3] Dependence of the relative approximation error on different noise levels for the nonparametric model Table 3 The error value (6) of the nonparametric model for the generation interval of input variables [0;3] The error value (6) of the nonparametric model for the generation interval of input variables [-3;3] Noise level 3 % 0.268 0.279 Noise level 5 % 0.283 0.314 Noise level 10 % 0.328 0.359 Conclusion. Based on the results of the research, it can be concluded that the minor error made in the process of the model structure definition for one variable does not greatly affect the final results of the model output, since the least-squares criterion neutralizes the error by select- ing the required coefficients. The second experiment results showed us that under certain modelling conditions (for example, the modelling range of input variables values), a model with an error in the structure will describe the object inadequately. The increase in the sample size affects the error value to a less extent, and with a further increase in the number of observations, this value tends to a specific, average value. The noise levels considered in this work have some effect on the error value. The third experiment results showed us that the non- parametric model estimates the investigated object accu- rately enough, but it cannot compete with the correct structure parametric model. Despite this, it is more appro- priate to use the nonparametric model in conditions of priori information lack. This fact is confirmed by this experiment, the results of which showed that the value of the modelling error for a parametric model with incorrect structure is higher than for a nonparametric model.
×

About the authors

M. A. Denisov

Siberian Federal University

Email: max_denisov00@mail.ru
9, Svobodny Av., Krasnoyarsk, 660041, Russian Federation

E. A. Chzhan

Siberian Federal University

9, Svobodny Av., Krasnoyarsk, 660041, Russian Federation

References

  1. Ваганов М. А., Москалец О. Д., Кулаков С. В. Многоканальный спектральный прибор для диагно- стики жидкостного ракетного двигателя // Информа- ционно-управляющие системы. 2013. № 1 (62). С. 2-6.
  2. Мухин С. В., Ребенков А. В. Перспективы раз- вития информационно-измерительных и управляю- щих систем для испытания жидкостного ракетного двигателя на стенде химзавода - филиала ОАО «Красмаш» // Решетневские чтения : материалы XIV Междунар. науч. конф. (10-12 нояб. 2010, г. Красно- ярск) / под общ. ред. Ю. Ю. Логинова ; Сиб. гос. аэро- космич. ун-т. Красноярск, 2010. Ч. 1. С. 261-266.
  3. Гроп Д. Методы идентификации систем. М. : Мир, 1979. 304 с.
  4. Цыпкин Я. З. Информационная теория иденти- фикации. М. : Наука, 1995. 336 с.
  5. Медведев А. В. Непараметрические системы адаптации. Новосибирск : Наука. Сиб. отд-ние, 1983. 176 с.
  6. Comparing three global parametric and local non- parametric models to simulate land use change in diverse areas of the world / A. Tayyebi [et al.] // Environmental Modelling & Software. 2014. No. 59. URL: www. sciencedirect.com/science/journal/13648152.
  7. Эйкхофф П. Основы идентификации систем управления. М. : Мир, 1975. 686 с.
  8. Сейдж Э. П., Мелса Дж. Л. Идентификация сис- тем управления. М. : Наука, 1974. 248 с.
  9. Райбман Н. С. Что такое идентификация? М. : Наука, 1970. 118 с.
  10. Штейнберг Ш. Е. Идентификация в системах управления. М. : Энергоатомиздат, 1987. 80 с.
  11. Синтез и исследование алгоритмов идентифи- кации на базе замкнутых динамических систем / Л. П. Мышляев [и др.] // Идентификация систем и задачи управления : материалы X Междунар. науч. конф. (26-29 янв. 2015, г. Москва) / Институт про- блем управления им. В. А. Трапезникова. М., 2015. С. 397-418.
  12. Однолько Д. С. Параметрическая идентифика- ция асинхронного двигателя в составе частотно- регулируемого электропривода при неподвижном роторе // Вестник Гомельского государственного технического университета им. П. О. Сухого. 2014. № 2 (57). С. 51-59.
  13. Трофимов В. Б., Кулаков С. М. Интеллектуаль- ные автоматизированные системы управления техно- логическими объектами. М. : Инфра-Инженерия, 2016. 232 с.
  14. Корнеева А. А., Чжан Е. А. О параметрическом моделировании стохастических объектов // Вестник СибГАУ. 2013. №. 2 (48). С. 39-42.
  15. Надарая Э. А. Непараметрические оценки плотности вероятности и кривой регрессии. Тбилиси : Изд-во Тбил. ун-та, 1983. 194 c.

Supplementary files

Supplementary Files
Action
1. JATS XML

Copyright (c) 2017 Denisov M.A., Chzhan E.A.

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.

This website uses cookies

You consent to our cookies if you continue to use our website.

About Cookies