ON NONPARAMETRIC IDENTIFICATION AND DUAL CONTROL OF QUASILINEAR DYNAMIC PROCESSES


Дәйексөз келтіру

Толық мәтін

Аннотация

The problems of identification and control of stochastic objects with a discrete - continuous technological process nature under conditions of nonparametric indeterminacy are considered. This term means a situation when the process structure with the accuracy to within parameters remains unknown. A more general class of dynamical nonlinear proc- esses, later of quasilinear processes, is investigated. The processes in this category are characterized by a low degree of nonlinearity, that is the superposition principle for this type of object is insolvent. Such processes often occur in vari- ous control loops for aerospace objects and systems. Nonparametric models where the dynamic process memory depth is specified on the basis of the selection of essential variables rule are given. According to this rule, the only variables included in the nonparametric model, at which the optimum blurring factor of the kernel is minimal. Nonparametric algorithms of quasilinear objects dual control are given. Control devices built on the basis of these algorithms not only perform the object control function directly, but also its study. The case when the control device corresponding to its inverse model “turns on” at the object input is considered. The process of dual control system training with active information accumulation is analyzed. The results of a numerical study of nonparametric models for quasilinear proc- esses with memory are presented in detail, as well as the results of a computational experiment using the algorithm of nonparametric adaptive dual control. In the simulation, objects characteristics were described by equations with different degrees of nonlinearity, the form of which was unknown, and which, during active information accumulation, were automatically restituted on the basis of the input-output variables process measurement. Also, the influence of various noise levels affecting an object and in measurement channels was investigated. The given computational ex- periments confirmed the possibility of using nonparametric algorithms for identifying and controlling of quasilinear systems.

Толық мәтін

Introduction. Traditionally nonlinear dynamic proc- esses identification and control theories developed for particular classes of systems and models. In the first works on these problems the methods based on Voltaire number [1; 2] - the convolution linear integral, with its use it was possible to describe nonlinear systems, gener- ally prevailed. It is necessary to consider that for early stages of identification for nonlinear systems methods development the noise was most often assumed by the additive white Gaussian noise [3] that was serious restric- tion for many relevant processes. Further other types of models were actively developed, especially important among them are block structure models, for example, a Wiener model and a Gammerstein’s model [4; 5]. This type of models consists of static nonlinear and dynamic linear elements. Also, methods based on correlation and separable functions able to define which block and strucothers. It is necessary to consider that essential difficulties of identification and control of nonlinear objects problem solving are caused by object structure complexity and incompleteness of mathematical description and informa- tion about an object. In this case, if a priori information on the studied process is insufficient for the reasonable choice of parametrical structure, the use of nonparametric identification theory is expedient [6]. The nonparametric theory, unlike previous, assumes that only qualitative characteristics of system, such as, for example, linearity or nonlinearities, stability or instability and others are known. It means that the use of nonparametric theory allows to deviate, completely in many cases, from a ques- tion of an object parametrical structure determination. Identification problem definition. Let us consider the following modeling scheme of one-dimensional non- linear dynamic process (fig. 1). tural models would be suitable for system description In fig. 1 the designations are accepted: x(t) - an output were developed. The basic feature of the above listed re- search directions is that they include a stage of a model parametrical structure choice therefore these methods are applicable only for identification and control of rather well studied processes for which the studied process equa- tion type is in some way or other defined, for example, in the form of differential or difference equations, transfer functions, a set of standard links of a dynamic system and process variable, u(t) - the control action, xs (t) - an object model output; (t) - the continuous time; the index t - discrete time; hu , x - random noise of measuret h t t t t ments of process corresponding variables; the index h at object variables for the simplicity reason is omitted: ut = uh , x = xh ; x(t) - vector random noise. Fig. 1. A flowchart of quasilinear dynamic processes modeling Рис. 1. Блок-схема моделирования квазилинейных динамических процессов Let us enter the following designations: Nonparametric identification. As we said before, zt = (z1 , K, z then k +1 ) = ( xt -1 , K, xt-k , ut ), (1) under conditions of nonparametric indeterminacy the description of an object to within parameters is unknown. In this case, a conditional mathematical expectation of a form can be accepted as an object model: xt = f ( zt ). (2) x s ( t ) = M { x ( t ) / u ( t )} . (4) In the conditions of nonparametric indeterminacy the structure of nonlinear dynamic process model can be described by the equation in a discrete form: As the studied object model the following nonpara- metric estimation of regression function on observations {xi , ui , i = 1, s} in a discrete form can be accepted [8]: xt = f (xt -1, xt -2 , ..., xt -k , ut ), (3) 1 s æ us - ui ö k 1 æ xs- j - xi- j ö where f(.) is an unknown functional, k - “the depth” of a xs = å xi × H çç u ÷÷Õ x j H ç ÷, (5) x j dynamic object memory (on A. A. Feldbaum’s termis i=1 è cs ø j =1 cs ç cs ÷ nology) [7]. If we draw analogy to the studied process description in the continuous time in the form of differential equations, then k - an order of the senior derivative in where H (×) x1 xk è ø s - a nuclear bell-shaped function, а cu , cs , ..., cs - nuclear function blur coefficients which the corresponding equation. It is essential here that the satisfy convergence conditions [8]. functional type is not defined to within parameters. With regard to redesignations the process model (3) can be The only output variables xs-1 , ..., xs-k are included shown on the following scheme (fig. 2) which illustrates the dynamic system model in a discrete time reduced to a static system model when not only variables ut , but also in the model (5), which were defined with the use of dynamic process memory depth rule determination [9; 10]. The essence of this rule is as follows: if we put into corrext -1, xt -2 , ..., xt -k and others are delivered to an entry spondence a nuclear function blur coefficient с* x1 , с* x2 , s s of the last. ..., с*xk for each output variable xs-1 ,..., xs-k of nonparas The considered process refers to a class of discrete- continuous, that is the process is continuous by nature, however, the process “input-output” variables are controlled through discrete time points Dt , forming an “input- output” variables {xi , ui , i = 1, s} sample of observations, where s - is a sample size, the index h at object variables for the simplicity reason is omitted. metric estimation (5), which delays at some values, at which the difference between the obtained model and the actual object output will be minimum, then that variable with the smallest coefficient contributes a lot to generation of total estimation and thereby has a greater influence on an output variable. These particular variables have to be considered in model (5). It should be noted that the model (5) can be used only at equal intervals of measurement Dt . Fig. 2. A quasilinear dynamic process control flowchart Рис. 2. Блок-схема управления квазилинейным динамическим процессом Dual control algorithms. Dual control was opened by A. A. Feldbaum in 1960 and developed on the basis of the statistical decisions theory [7]. It should be noted that the studying control systems are systems “with memory”, that means they are not only able to study characteristics of an object, but also, keeping them in memory, to develop the operating action us+1, the first augend, that is u* starts obtaining an escalating role. Thus, in the process of an object dual control both the stage of object studying, and the stage of its reduction to the purpose appear. s For control estimation the relative error was used: rational control actions. The theory of dual management gained its further development in researches of various W = Rs / Ds , 1 s 2 (10) authors [11-14], in particular B. Vittenmarka [15]. Let us note that when we set a problem, most often the noise where Rs = å(xi - x* ) i i s i=1 , xi - an output variable measprobability distribution and also an object parametrical ured value, x* - the setting action, Ds - estimation of the model are assumed to be known. That is the developed regulation algorithms are applicable in conditions when the model structure of an object or a regulator is set to within parameters. In case when the dynamic process structure is not certain, one of possible solutions of a con- trol task is the application of nonparametric adaptive con- trol systems keeping dualism components. Let us consider an idealized case at first. We enter an object operator A, which describes the process, that is output variable xt dispersion. Computing experiment. Let us consider the linear case for a start. Let an object be described by the follow- ing simple equation: xt = 0.2 × xt -1 - 0.3 × xt - 2 + 0.4 × xt -3 + 1.5ut . (11) In the described experiment the training sample is formed in the following way. Let a series of various input actions u1(t) - u4 (t) be described by the equations (12), their charts are in fig. 3. x (t ) = A < u (t ) > , where u (t ) - a control action, x(t ) - an object output. If there is an operator, inverse to u1(t) = 0.6 + 0.3×sin (0.15 × t); A, that is А - 1 , А -1 А = 1 - a unity operator, then u2 (t) = 0.5 + 0.2 ×cos (0.05 × t); А - 1 x ( t ) = А - 1 A < u ( t ) > , u ( t ) = A - 1 x ( t ). (6) u3 (t) = 0.23 + 0.1×sin (0.2 × t); u4 (t) = 0.1+ 0.4 ×sin (0.2 × t); (12) Assuming now x(t) = x*(t), where x*(t) - the setting trajectory, we find from (6) the ideal value u(t) = u*(t). Thus, the type of the ideal regulator -regulator) [6] can be presented in the form (6). The main problem is that in most cases operators A and A-1 are unknown, and, there- fore, the given method of the -regulator construction becomes very problematic. Therefore, there is a question of dual control scheme construction which is close to actual. Under conditions of nonparametric indeterminacy the description of an object to within parameters, as it was already mentioned, is unknown. In this case, as direct operator А estimation the conditional expectation (4) can be accepted, and estimates of the inverse operator А-1: u5 (t) = 0.3 + 0.8 ×sin (0.05 × t). The received values form the common input variables u(t) sample defined in the range of u(t)Î[-0.5,1.1] . The values of the input variable u(t) are substituted in the equation (12), thus we receive an input-output variables {xi ,ui ,i = 1,s} sample, where s - the training sample size. In this case s = 500. For the offered identification algorithm operability check we will create a new sample of input actions u6 (t) = 1+ 0.5×sin (0.1× t) . The identification algorithm is carried out according to the following scheme. We set initial value k = 6. Further we develop a model in pursuu * ( t ) = M {u (t ) / x (t ) = x * ( t )} . (7) ance of formula (5). The relative modeling error is It is equivalent to “inclusion” of an “anti-object” to the W0 = 0.017. Further we find optimum coefficients with the coordinate descent algorithm use. The found coefficients entry of the operated object 4 1 If we accept expression (5) as a direct operator А estimaare equal to с* = 0.38 - at ut , с* = 0.231 - at xt -1 , tion, then the inverse operator А-1 estimation will be (8): с* = 0.378 - at x , с* = 0.456 - at x , с* = 1.876 - 2 t -2 3 t -3 5 s * k æ x ö at xt-4 , с* = 1.999 - at xt-5 , с* = 1.983 - at xt-6 . After u* = 1 u × H æ xs+1 - xi ö 1 H ç s- j - xi- j ÷. 6 7 (8) s s å i çç с ÷÷Õ x j ç x j ÷ realization of all algorithm stages, variables xt -4 , xt -5 , i=1 è s ø j =1 cs è cs ø xt -6 were excluded. Thus the developed dynamic object The nonparametric dual control algorithm explicitly described in [6] is: final structure is: x = a x u = u * + D u , (9) s 1 t -1 2 t -2 3 t -3 t s + 1 s s + 1 what corresponds to a difference object equation structure s where u* - an augend in charge of an object information (11), and a nonparametric model is: accumulation, and Dus+1 = e(x* 1 - xs ) - the “studying” 1 s æ u - u ö 1 æ x - x ö c с c è ø s+ xs = å xi × Н ç s i ÷ × Фç s-1 i-1 ÷´ search steps. Let us analyze the character of algorithm dualism (9). On the initial control step the main role s i=1 ç u ÷ x1 ç s s è x1 ÷ s ø belongs to the second addend Dus+1 from formula (9). It ´ 1 æ xs-2 - xi-2 ö 1 æ xs-3 - xi-3 ö (13) Н ç ÷ × Н ç ÷. с c с is a case of active information accumulation in the system of dual control which begins with the first observation of x2 ç s è x2 ÷ x3 ç s ø s è x3 ÷ c s ø input and output object variables emergence. In the proc- ess of training (information accumulation) when forming The forecast of the output x6(t) variable with non- parametric model use (13) is shown in fig. 4. t - 3 The relative identification error for the first experi- ment is equal to W = 0.12, that means satisfactory quality of the model. Thus, it is possible to draw a conclusion that the nonparametric algorithm (5) can be applicable for the linear dynamic processes identification with the known depth of memory k. We enter power functions into the process input difference equation. Let the object equation be described as: model quality, even at rather high level of the external noise affecting the process. From the analysis of fig. 5 and relative error values, it is possible to draw a conclusion that in spite of the fact that the process (14) does not refer to the linear processes class, the nonparametric algorithm (5) copes with an identification task, and, thus, it can be applicable also for identification of nonlinear dynamic processes at partial nonparametric indeterminacy. t -1 xt = 0.2 × x1.1 - 0 .3 × x t - 2 + 0.4 × x1.2 + 1 .5ut . (14) Let us consider the possibility of nonparametric algo- rithms application to control different types of quasilinear Let the process (14) be affected by the additive noise level x(t) = 5 %, distributed according to the normal law with a zero expectation and dispersions 1. To model a random value distributed according to the normal law the method of polar coordinates was used. The coefficient of k was found in the same way as earlier. The results of object identification (14) when delivering to the entry of input action u6 (t) are shown in fig. 5. The relative identification error for the second ex- periment is equal to W = 0.19, that also means satisfactory dynamic objects. To begin with, in the first experiment we will consider the linear dynamic object which is described by the following simple equation: xt = 0.2 × xt -1 - 0.3 × xt - 2 + 0.4 × xt -3 + 1.5ut . (15) The results of nonparametric object control (15) with the use of control law (9) are presented in fig. 6. In fig. 6 the case of active information accumulation is presented, that is the initial training sample before performing an experiment was absent. Fig. 3. Series of input actions u1(t) - u4 (t) Рис. 3. Серия входных воздействий u1(t) - u4 (t) Fig. 4. Object output (11) x6(t) and nonparametric model output (13) xs (t) , at an input variable u6 (t) delivered to the entry Рис. 4. Выход объекта (11) x6 (t) и выход непараметрической модели (13) xs (t) при подаче на вход входной переменной u6(t) Fig. 5. Object output (14) and nonparametric model output x6 (t) (14) xs (t) , when delivering an input variable u6(t) to the entry, with the noise x(t) = 5 % Рис. 5. Выход объекта (14) x6 (t) и выход непараметрической модели (14) xs (t) при подаче на вход входной переменной u6 (t) , при помехе x(t) = 5 % Fig. 6. Linear dynamic object control (15) Рис. 6. Управление линейным динамическим объектом (15) The relative control error for the first experiment is equal to W = 0,076, this suggests high quality of non- parametric control. In the analysis of figure 6 it is possible to observe how the dual control algorithm at active infor- mation accumulation behaves. On the initial steps of con- trol, from the 1st to the 20th steps, the algorithm needs more time for reduction of an object to the control action The results of object control (16) at passive informa- tion accumulation, that is in case when there is a training sample with the size equal to s = 300, are presented in fig. 7. The relative object control error (16) is equal to W = 0,085. We increase a sample size to s = 500 (fig. 8). For the experiment presented in fig. 8 the relative conas at this stage a sample training and accumulation take trol object error (16) is W = 0.045. We enter a x = 5 % place. After the training level, starting from the 20th step, at the control action change an output object variable practically at once becomes equal to the task. We consider the possibility of nonparametric control algorithm appli- cation (9) for quasilinear dynamic systems. In the follow- ing experiments dynamic objects with different degree of nonlinearity will perform as the studied objects. Let the dynamic object difference equation contain power functions, the object equation is described as fol- lows: noise level (the noise is added to an output variable meas- urement, distributed by normal distribution), the control results for this experiment are presented in fig. 9. Thus, from the analysis of fig. 7-9 and values of the relative control errors, it is possible to draw a conclusion that the nonparametric control algorithm (9) successfully copes with the task of quasilinear dynamic systems regu- lation in case of passive information accumulation, even at external noise influence on the object under study, at the same time the quality of control practically does not t -1 xt = 0, 2 × x1.2 - 0, 3 × x t - 2 + 0, 4 × x t - 3 + 1.5ut . (16) differ from the linear object control at active information accumulation. Fig. 7. Dynamic object control (16) at a sample size s = 300 Рис. 7. Управление динамическим объектом (16) при объеме выборки s = 300 Fig. 8. Quasilinear dynamic object control (16) at a sample size s = 500 Рис. 8. Управление квазилинейным динамическим объектом (16) при объеме выборки s = 500 Fig. 9. Quasilinear dynamic object control (16) at a sample size s = 300, x = 5 % noise level Рис. 9. Управление квазилинейным динамическим объектом (16) при объеме выборки s = 300, помеха x = 5 % We consider the dependence of the relative control error W on a power function index p (fig. 10). A variable function index will change xt -3 : xt = 0.2 × xt -1 - 0.3 × xt - 2 + t -3 + 0.4 × x p +1.5ut . Fig. 10 shows that if a power function index is less than 1.5, a relative control error is sufficiently low. On the contrary, at power function index increase more than 1.6, the sharp increase of the relative control error is observed. Information on dependence of the relative modeling error on the studied function type is provided in table. Fig. 10. Dependence between the relative regulation error on a power function index Рис. 10. Зависимость между относительной ошибкой регулирования и показателем степенной функции Dependence of the relative control error on an object equation type Type of the equation The relative control error I. The linear equation: xt = 0.2 × xt -1 - 0.3 × xt - 2 + 0.4 × xt - 3 + 1.5ut 0.076 II. Non-linear equations (algebraic) xt = 0.2 × x1.2 - 0 .3 × x + 0 .4 × x + 1 .5u t -1 t - 2 t - 3 t 0.085 xt = 0.2 × xt -1 - 0 .3 × x1.2 + 0 .4 × xt - 3 + 1 .5ut t - 2 0.088 xt = 0.2 × xt -1 - 0.3 × xt - 2 + 0.4 × x1.2 + 1.5ut t - 3 0.113 xt = 0.2 × x1.2 - 0.3 × x1.2 + 0.4 × x1.2 + 1.5u t -1 t - 2 t - 3 t 0.124 xt = 0.2 × x1.5 - 0.3 × x + 0.4 × x + 1.5u t -1 t - 2 t - 3 t 0.115 xt = 0.2 × x 2 - 0.3 × x + 0.4 × x + 1.5u t -1 t - 2 t - 3 t 0.66 xt = 0.2 × xt -1 - 0.3 × x 2 2 + 0.4 × xt - 3 + 1.5ut t - 0.78 xt = 0.2 × xt -1 - 0 .3 × xt - 2 + 0.4 × xt - 3 + 1 .5ut 0.082 xt = 0.2 × xt - 1.5 - 0.3 × x + 0 .4 × x + 1.5u 1 t - 2 t - 3 t 0.23 x = 0.2 × 1 - 0.3 × x + 0.4 × x + 1.5u t x t - 2 t - 3 t t -1 0.456 III. Non-linear equations (transcendental) xt = 0.2 × sin ( xt -1) - 0.3 × xt-2 + 0.4 × xt -3 + 1.5ut 0.105 xt = 0.2 × xt -1 - 0.3 × log10 (xt -2 ) + 0.4 × xt -3 + 1.5ut 0.128 x = 0.2 × ext -1 - 0.3 × x + 0.4 × x + 1.5u t t -2 t -3 t 0.89 xt = 0.2 × cos(xt -1) - 0.3 × xt -2 + 0.4 × sin(xt -3) + 1.5ut 0.117 From the table analysis it is possible to draw the fol- lowing conclusions: if the dynamic object difference equation contains degrees up to the second order, the con- trol algorithm (9) successfully copes with the control task. The relative control error increases if the power function is entered for the variable at which the coefficient has a greater influence on an output value. At increase in degree the quality of control considerably worsens. If the degree at some variable is equal to 2 or more in a difference equation, then a nonlinear dynamic object control with a control algorithm (9) use is impossible. If an object difference equation contains fractional degrees, the control algorithm (9) also copes with the con- trol task. The results of this computing experiment show that even at each variable involution, the control algo- rithm (9) copes with the task of an object reduction to the setting action. It is also possible to notice that if a dy- namic object difference equation contains trigonometric or logarithmic functions, then a control algorithm applica- tion (9) is possible. The considerable deterioration of con- trol is observed in the presence of exponential function in the object equation. At the negative degrees the control quality considerably decreases. Conclusion. The following important conclusion can be considered as the result of the present article. Non- parametric algorithms, as it was previously known, can be efficiently applicable for identification problem solving and also are rather qualitatively able to conduct the oper- ated process of the linear class or category. It was natu- rally interesting, what if the process is nonlinear, in a varying degree. The computing experiments given above showed that identification and control algorithms can be also used for nonlinear dynamic objects control or objects for which the principle of superposition is not performed any more. Thus, the main conclusion which we can make is that the class of dynamic discrete continuous processes slightly extends for the nonparametric identification and control theory of the linear dynamic processes.
×

Авторлар туралы

A. Medvedev

Reshetnev Siberian State University of Science and Technology

31, Krasnoyarsky Rabochy Av., Krasnoyarsk, 660037, Russian Federation

A. Raskina

Siberian Federal University

Email: raskina.1012@gmail.com
79, Svobodny Av., Krasnoyarsk, 660041, Russian Federation

Әдебиет тізімі

  1. Павленко В. Д. Идентификация нелинейных ди- намических систем в виде ядер Вольтерра на основе данных измерений импульсных откликов // Электрон- ное моделирование. 2010. Т. 32, № 3. С. 3-18.
  2. Гельднер K., Кубик С. Нелинейные системы управления : пер. с нем. М. : Мир, 1987. 368 с.
  3. Андриевский Б. Р., Матвеев А. С., Фрадков А. Л. Управление и оценивание при информационных ог- раничениях: к единой теории управления, вычислений и связи // Автоматика и телемеханика. 2010. № 4. С. 34-99.
  4. Коплярова Н. В., Медведев А. В. Непараметри- ческие алгоритмы управления системами класса Гаммерштейна // Вестник СибГАУ. 2015. Т. 16, № 1. С. 62-73.
  5. Коплярова Н. В. О непараметрической иденти- фикации стохастических объектов класса Винера // Системы автоматизации в образовании, науке и про- изводстве : Тр. IX Всерос. науч.-практ. конф. / под ред. С. М. Кулакова, Л. П. Мышляева ; Сиб. гос. индустр. ун-т. Новокузнецк : Изд. центр СибГИУ, 2013. С. 445-451.
  6. Медведев А. В. Основы теории адаптивных сис- тем / СибГАУ. Красноярск. 2015. 525 c.
  7. Фельдбаум А. А. Основы теории оптимальных автоматических систем. М. : Физматгиз, 1963. 552 с.
  8. Надарая Э. А. Непараметрические оценки плот- ности вероятности и кривой регрессии. Тбилиси : Изд. Тбил. ун-та, 1983. 194 с.
  9. Раскина А. В. Определение структуры линейно- го динамического объекта в задачах непараметриче- ской идентификации // Вестник СибГАУ. 2016. Т. 17, № 4. С. 891-898.
  10. Медведев А. В. Адаптация и обучение в усло- виях непараметрической неопределенности // Фунда- ментальные исследования (физико-математические и технические науки). Новосибирск : Наука. Сиб. отд- ние, 1977. С. 92-97.
  11. Wenk C. J., Bar-Shalom Y. A multiple model adaptive dual control algorithm for stochastic systems with unknown parameters // Automatic Control: IEEE Transactions on. 2003. Vol. 25, iss. 4. Pp. 703-710.
  12. Duan Lia, Fucai Qianb, Peilin Fuc. Optimal nomi- nal dual control for discrete-time linear-quadratic Gaussian problems with unknown parameters // Automatica. 2008. Vol. 44, iss. 1. Pp. 119-127
  13. Tse E., Bar-Shalom Y. An actively adaptive con- trol for linear systems with random parameters via the dual control approach // Automatic Control: IEEE Trans- actions on. 2003. Vol. 18, iss. 2. Pp. 109-117.
  14. Filatov N. M., Keuchel U., Unbehauen H. Dual control for an unstable mechanical plant // Control Systems: IEEE. 1996, Vol. 16, iss. 4. P. 31-37.
  15. Wittenmark B. Adaptive dual control methods: An overview // 5th IFAC Symposium on Adaptive Sys- tems in Control and Signal Processing. 1995. Pp. 67-72.

Қосымша файлдар

Қосымша файлдар
Әрекет
1. JATS XML

© Medvedev A.V., Raskina A.V., 2017

Creative Commons License
Бұл мақала лицензия бойынша қолжетімді Creative Commons Attribution 4.0 International License.

Осы сайт cookie-файлдарды пайдаланады

Біздің сайтты пайдалануды жалғастыра отырып, сіз сайттың дұрыс жұмыс істеуін қамтамасыз ететін cookie файлдарын өңдеуге келісім бересіз.< / br>< / br>cookie файлдары туралы< / a>