## Forecasting of the specialities demand in the sector of mineral resources of the labor market

**Authors:**Ilyushina A.N.^{1}, Afanasyev P.M.^{2}, Novozhilov I.M.^{3}, Voznesensky A.S.^{3}**Affiliations:**- Saint-Petersburg College of Management and Commerce
- Admiral Makarov State University of Maritime and Inland Shipping
- Saint-Petersburg Electrotechnical University “LETI”

**Issue:**2019: Proceedings of 2019 XVIII Russian Scientific and Practical Conference on Planning and Teaching Engineering Staff for the Industrial and Economic Complex of the Region (PTES)**Pages:**111-113**Section:**Presentations**URL:**https://journals.eco-vector.com/PTES/article/view/26326**DOI:**https://doi.org/10.17816/PTES26326- Cite item

# Abstract

The article observes the issue of forecasting and creating a predictive model based on data obtained as a result of expert evaluation. Authors introduce a calculation technique and implement an experimental calculation of the demand for one factor (specialty or group of specialty) within the framework of a general request of the labor market.

# Full Text

## I. Field of the research

We often face in practice with the task of forecasting random variables, whether it is a sharp drop in the demand for specialists or the opening of a new educational institution. This fact is a prerequisite for the application of probabilistic models. Probabilistic models allow us to calculate the probability that the future value of the parameter of the predicted process will be less than a certain number, for example, the probability that y<y_{i}

${P}_{i}=P\left(y<{y}_{i}\right)$.

The value ** y** can lie within the limits ${y}_{0}\le y\le {y}_{n}$ because $P\left(y<{y}_{0}\right)=0$ and $P\left(y<{y}_{n}\right)=1$

The distribution curve of the continuous random variable *y* shown in the figure is a graph of the distribution function $P\left(y\right)$. The distribution function exists for both continuous and discrete random variables and is a universal characteristic of random variables.

Knowing the distribution function, we can find the probability of a random variable falling on a given section by the equation ${y}_{0}\le y\le {y}_{n}$:

$P\left({y}_{i}\le y\le {y}_{n}\right)=1-{P}_{i}$.

For continuous random variables, is often considered the derivative of the distribution function of the form

$f\left(y\right)=\frac{dP\left(y\right)}{dy}$,

or the distribution density of a continuous random variable *y*. The probability of a random variable *y* entering a certain section is shown by the equation

$P\left({y}_{1}\le y\le {y}_{2}\right)=\underset{{y}_{1}}{\overset{{y}_{2}}{\int}}f\left(y\right)dy$.

Thus, we can carry out prediction of the probability of an event when predicting the distribution functions considered. Moreover, in many practical cases, there is no need to characterize the random variable completely, and it is enough to predict only some distribution parameters (for example, mathematical expectation and variance).

In some cases, we can describe the data obtained during the process of observations of the predicted process by well-known distributions of continuous and discrete random variables, including normal distribution, uniform distribution, exponential distribution, Poisson distribution, and some others.

If the type and parameters of the named distributions do not change in time and a sufficient number of observations is available, then solving the forecasting problem does not cause any particular difficulties. We can construct an empirical distribution, solve the task of the theoretical distribution curve for a given empirical distribution choice, and carry out forecasting using the required accuracy. However, in practice, as a rule, the researcher has limited information about the process and, in addition, it is not always possible to guarantee the invariance of the form and distribution parameters. These conditions predetermine the use of more complex probabilistic models based on the latest achievements of probability theory. These most intensively developed areas of probability theory include, in particular, the theory of small samples and the theory of summing a random number of independent random variables.

## II. Formulation of the problem

The analysis of the forecasting object and the forecast background at the retrospection period (forecast base period) established that the system development process can be represented by a stepwise process (a sequence of jumps made at random times). The magnitude ${y}_{i}$ is a random variable which behavior is described by the distribution law $f\left(y\right)$. The number of jumps *n* at the forecast lead-time is random, distributed according to the law ${P}_{n}$. It is required to determine the distribution function of the output parameter of the system *y*.

## III. A numerical example

The traditional (main) analytical apparatus of probability theory and mathematical statistics is the apparatus of characteristic functions. It is known that if $Y$ is a real random variable, then there exists a complex random variable $X=\mathrm{exp}\left\{itY\right\}$ (where $i=\sqrt{-1}$ – is an imaginary unit, and **t** is a real number).

The function of a view

$\varphi \left(t\right)=EX=E{e}^{itY}$,

Where ** E** is the symbol of mathematical expectation, and is called the characteristic function of a random variable $Y$. That means that the characteristic function of a random variable $Y$ is the mathematical expectation of a complex random variable $X$.

The characteristic function is dimensionless, and the parameter *t* has a dimension inverse to the dimension of a random variable $Y$.

We use the main properties of characteristic functions to solve the problem, from the conditions of the solution of which we know that the output parameter of the system *y* depends both on the random number of jumps *n* in the lead-time period and on the random value ${y}_{i}$ of each jump. Moreover, the random variables ${y}_{1},{y}_{2}\mathrm{,...,}{y}_{r}\mathrm{,...,}{y}_{n}$ are independent, equally distributed and do not depend on the random variable *n*.

We assume that the Poisson's law can determine the number of jumps in the forecast lead-time

${P}_{n}\left(r=n\right)=\frac{{\lambda}^{n}{e}^{-\lambda}}{n!}\text{\hspace{0.33em}}\to \text{\hspace{0.33em}}n=\mathrm{0,1,2...}$,

With a parameter $\lambda $, the relation $\lambda =\overline{n}$ is valid for the Poisson distribution.

The random variable *y* (the value of the jump) has a standard normal distribution $N\left(\mathrm{0,1}\right)$ with parameters ${m}_{y}=\mathrm{0,}\text{\hspace{0.33em}}{\sigma}^{2}=1$ and probability density

$f\left(y\right)=\frac{1}{\sqrt{2\pi}}{e}^{-\frac{{y}^{2}}{2}}$.

Thus, in order to obtain the distribution law of the output parameter, it is necessary to consider the distribution of the sum of the Poisson number of standard normal quantities.

Based on the multiplicative property of the characteristic function, the characteristic function of the sum of independent random variables is equal to the product of the characteristic functions of random variables, what means that if $y=\sum _{i=1}^{n}{y}_{i}$, then

$\varphi \left(t\right)=E{e}^{ity}=\prod _{i=1}^{n}{\varphi}_{i}\left(t\right)$,

We can write that the cumulative distribution function $F\left(y\right)$ of the sum of a random number *n* of random variables ${y}_{i}$ is determined by the characteristic function

$\varphi \left(t\right)=\sum _{n=0\left(1\right)}^{\infty}{P}_{n}{\varphi}_{i}^{n}\left(t\right)$,

Where ${\varphi}_{i}\left(t\right)$ is the characteristic function of a random variable ${y}_{i}$.

Below we consider the characteristic function of a standard normal distribution:

$\begin{array}{c}{\varphi}_{i}\left(t\right)=\underset{-\infty}{\overset{\infty}{\int}}{e}^{ity}\frac{1}{\sqrt{2\pi}}{e}^{-\frac{{y}^{2}}{2}}dy=\frac{1}{\sqrt{2\pi}}\underset{-\infty}{\overset{\infty}{\int}}{e}^{-\frac{{\left(y-it\right)}^{2}}{2}-\frac{{t}^{2}}{2}}dy=\\ ={e}^{-\frac{{t}^{2}}{2}}\frac{1}{\sqrt{2\pi}}\underset{-\infty}{\overset{\infty}{\int}}{e}^{-\frac{{\left(y-it\right)}^{2}}{2}}dy.\end{array}$

Since the integral looks like $\underset{-\infty}{\overset{\infty}{\int}}{e}^{-\frac{{y}^{2}}{2}}dy=\sqrt{2\pi}$, then $\varphi \left(t\right)={e}^{-\frac{{t}^{2}}{2}}$.

Hence, the characteristic function of the sum of the Poisson number of standard normal quantities has the form of

$\varphi \left(t\right)=\sum _{n=0\left(1\right)}^{\infty}\frac{{\lambda}^{n}{e}^{-\lambda}}{n!}{e}^{-\frac{t{}_{2}}{2}n}=\sum _{n=0\left(1\right)}^{\infty}\frac{{\lambda}^{n}}{n!}{e}^{-\lambda -\frac{{t}^{2}}{2}n}$.

For definiteness, we exclude the case when $n=0$ from our consideration. Then

$\varphi \left(t\right)=\sum _{n=1}^{\infty}\frac{{\lambda}^{n}}{n!\left(1-{e}^{-\lambda}\right)}{e}^{-\lambda -\frac{{t}^{2}}{2}n}$.

Based on the inversion formula

$f\left(y\right)=\frac{dF\left(y\right)}{dy}$;

$f\left(y\right)=\frac{1}{2\pi}\underset{-\infty}{\overset{\infty}{\int}}{e}^{-ity}\sum _{n=1}^{\infty}{P}_{n}{\varphi}_{i}^{n}\left(t\right)$,

then

$f\left(y\right)=\frac{1}{2\pi}\sum _{n=1}^{\infty}\frac{{\lambda}^{n}{e}^{-\lambda}}{n!\left(1-{e}^{-\lambda}\right)}\underset{-\infty}{\overset{\infty}{\int}}{e}^{-ity-\frac{{t}^{2}}{2}n}dt$.

After the integration, we obtain the desired distribution density:

$f\left(y\right)=\sum _{n=1}^{\infty}\frac{{\lambda}^{n}{e}^{-\lambda}}{n!\left(1-{e}^{-\lambda}\right)}\frac{1}{\sqrt{2\pi n}}{e}^{-\frac{{y}^{2}}{2n}}$

## IV. Conclusion

As we have already noted, the methods of mathematical modeling are one the most general and at the same time quite strict forecasting methods. However, when we use them, we cannot predict changes in the nature of technology over time. The characteristics are defined as the result of optimization. However, we can predict the input data necessary for the functioning of the general mathematical model. We mean the prediction of exogenous variables, that is, those that are determined by dependencies, which are not a part of the main model, in contrast to endogenous variables (in particular, performance indicators of technology), which are the desired (output) variables of the main mathematical model.

We can make a prediction of input data to optimize the characteristics of the equipment, and sometimes the characteristics themselves, by one of the following methods, which differ in the scientific basis and the reliability of the forecasting results:

- compilation, according to well-known laws of nature of theoretical cause-and-effect mathematical models, which determine the predicted development processes and evaluate the parameters of these processes according to the history list and current state;
- manufacturing and testing of prototypes and experimental samples, as well as physical modeling;
- compilation of empirical dependencies according to the statistics of the history list and the present data, what is called the regression analysis and extrapolation.

Sometimes researchers also distinguish the expert assessment methods, but in most cases, experts themselves should use the above mentioned methods.

# About the authors

### A. N. Ilyushina

Saint-Petersburg College of Management and Commerce
**Author for correspondence.**

Email: bdbyu@mail.ru

Russian Federation, St. Petersburg

### P. M. Afanasyev

Admiral Makarov State University of Maritime and Inland Shipping
Email: paulafanasiev@gmail.com

Russian Federation, St. Petersburg

### I. M. Novozhilov

Saint-Petersburg Electrotechnical University “LETI”
Email: novozhilovim@list.ru

Russian Federation, St. Petersburg

### A. S. Voznesensky

Saint-Petersburg Electrotechnical University “LETI”
Email: A-Voznesensky@yandex.ru

Russian Federation, St. Petersburg

# References

- Kukharova T.V., Pershin I.M. Conditions of Application of Distributed Systems Synthesis Methods to Multidimensional Object. 2018 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2018, 2018, October 2-4, Vladivostok, Russia. No. 8602749 doi: 10.1109/FarEastCon.2018.8602749
- Kukharova T.V., Utkin V.A., Boev I.V. Observation and Prediction Systems Modeling for Human Mental State. 2018 International Multi-Conference on Industrial Engineering and Modern Technologies, FarEastCon 2018, 2018, October 2- 4, Vladivostok, Russia. No. 8602831 doi: 10.1109/FarEastCon.2018.8602831
- Martirosyan A.V., Martirosyan, K.V., Chernyshev A.B. Application of Fourier Series in Distributed Control Systems Simulation. 2019 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (EIConRus) January 28-31, St.Petersburg, Russia. 2019. Pp. 609–613. doi: 10.1109/EIConRus.2019.8656865
- Martirosyan A.V., Martirosyan K.V., Chernyshev A.B. Methods of distributed systems’ structured modeling. 2016 IEEE NW Russia Young Researchers in Electrical and Electronic Engineering Conference February 2-3. St.Petersburg, Russia. 2016. P. 283–289. doi: 10.1109/EIConRusNW.2016.7448175
- Martirosyan A.V., Martirosyan K.V. Uality improvement information technology for mineral water field’s control. 2016 IEEE Conference on Quality Management, Transport and Information Security, Information Technologies October 4-11. Nalchik, Russia. 2016. P. 147-151. doi: 10.1109/ITMQIS.2016.7751925
- Ilyushin Y., Mokeev A. Distribution of temperature in a spatially onedimensional object as a result of the active point source. ARPN Journal of Engineering and Applied Sciences 14(6).2019, pp. 1238-1243
- Afanaseva O., Ilyushin Y. Analysis and synthesis of distributed icedrill heating control system of mountain reconnaissance drilling rig. International Multidisciplinary Scientific GeoConference Surveying Geology and Mining Ecology Management: 18th International Multidisciplinary Scientific Geoconference, SGEM 2018, Bulgaria, Albena 2–8 July 2018. Stef92 Technology Ltd., 2018, Volume 18, Issue 2.2, pp. 41-48. doi: 10.5593/sgem2018/2.2/S08.006
- Afanaseva O., Ilyushin Y. Analysis and processing of the hydrolitospheric plast information remote sensing through the theory of systems with distributed. International Multidisciplinary Scientific GeoConference Surveying Geology and Mining Ecology Management. 18th International Multidisciplinary Scientific Geoconference, SGEM 2018, Bulgaria, Albena, 2–8 July 2018. Stef92 Technology Ltd., 2018, Volume 18, Issue 2.2, pp. 35-40. doi: 10.5593/sgem2018/2.2/S08.005
- Ilyushin Y.V., Novozhilov I.M. Software implementation of a pulse regulator of a distributed distributed control object. 2017. Proceedings of 2017 IEEE 2nd International Conference on Control in Technical Systems, CTS 2017, No. 8109555, pp. 315-317. doi: 10.1109/CTSYS.2017.8109555
- Ilyushin Y.V., Novozhilov I.M. Development of a technique for the synthesis of a pulsed regulator of a distributed control system. 2017. Proceedings of 2017 IEEE 2nd International Conference on Control in Technical Systems, CTS 2017, No. 8109517, pp. 168-171. doi: 10.1109/CTSYS.2017.8109517