Comparing Mathematical Models for Forecasting the Youth Unemployment Rate in Jordan

0
259

Background: Unemployment is one of the numerous socio-economic challenges that exist in all countries of the world. Governments and various stakeholders rely on accurate forecasting of unemployment rates to make informed decisions relating to investments and fiscal and monetary policies.

Method: The main objective of this study is to compare and examine the predictive accuracy of multiple linear regression (MLR), time series autoregressive distributed lag (ARDL) and vector autoregression (VAR) models for predicting the youth unemployment rate of Jordan. These models are chosen because they are established econometric models, each grounded in specific theoretical principles and practical applications. Using time series annual data from 1991 to 2022, the three models will be fitted to model the youth unemployment rate of Jordan based on (1) population growth rate (annual %), (2) current gross domestic product (GDP) in US $, (3) foreign direct investment net (BoP current US $), (4) external debt stocks as a percentage of gross national income (% of GNI), (5) foreign direct investment net inflows (% of GDP), (6) foreign direct investment net outflows (% of GDP), (7) literacy rate adult total (% of people ages 15 and above), (8) school enrollment tertiary (gross), gender parity index (GPI) and (9) inflation consumer prices (annual %). The predictive accuracy of the models will be assessed using mean absolute error, root mean square error, and mean absolute percentage error.

Results: The findings of the study will show that the ARDL model has the least mean absolute error, root mean square error and mean absolute percentage error, hence, having the highest predictive accuracy.

Conclusion: Due to limitations associated with sample size, time series period and selected lag values, the researcher notes the current findings while suggesting further refinement in the methodology for future studies.

Introduction

Background and Context

Unemployment is a critical socioeconomic issue that affects every nation globally. It significantly impacts the living standards and socioeconomic status of individuals. Consequently, the unemployment rate is a critical metric for governments and policymakers as it forms the basis for developing strategies to address unemployment1. Accurate forecasting of unemployment rates is crucial for effective economic planning and policy-making. This study focuses on Jordan, where youth unemployment is a significant challenge, impacting economic stability and growth. Various models, including MLR, ARDL and VAR, have been developed to forecast unemployment rates, each with distinct advantages and limitations.

The primary cause of rising unemployment rates is often inadequate demand within the economy to support sufficient employment opportunities2. During periods of reduced demand, businesses require less labor, leading to reduced work hours or layoffs. Unemployment is driven by fundamental economic changes and is influenced by frictional, structural and cyclical factors3.

Frictional unemployment occurs due to the time delays inherent in finding new employment in a free market. Structural unemployment arises from factors such as skill mismatches between workers and job requirements or geographical barriers preventing relocation to areas with available jobs4. Additionally, inadequate wage rates can lead to a lack of motivation to seek employment, contributing to a skills gap despite available opportunities. Technological advancements, increased competition and government policies also contribute to structural unemployment. Cyclical unemployment results from the cyclical nature of economic indicators within a business cycle2. It decreases as the economy expands and optimizes output. Conversely, a decline in gross domestic product (GDP) leads to an economic downturn and increased cyclical unemployment.

Researchers worldwide continually develop and review models to enhance the accuracy of unemployment forecasts5. This study aims to compare the predictive accuracy of three models for forecasting youth unemployment in Jordan: MLR, ARDL and VAR.

Problem Statement and Rationale

The study aims to identify the most accurate model for forecasting the youth unemployment rate in Jordan. Previous studies have shown varying performance of different models, necessitating a comprehensive comparative analysis to determine the best approach for this context.

Significance and Purpose

This research provides a comparative analysis of three forecasting models (MLR, ARDL and VAR) to determine which model offers the highest predictive accuracy for Jordan’s youth unemployment rate. The findings will aid policymakers and stakeholders in making informed decisions based on reliable economic forecasts.

Objectives

  • 1. To compare the predictive accuracies of MLR, ARDL and VAR models for the youth unemployment rate in Jordan.
  • 2. To evaluate the models using statistical parameters such as MAE, RMSE and MAPE.
  • 3. To provide recommendations for future research and model improvements.

Scope and Limitations

The study focuses on time series data from 1991 to 2022, covering various economic indicators relevant to unemployment forecasting. Limitations include potential biases in historical data, the chosen time period, and the specific lag values used in the models.

Theoretical Framework

This study leverages econometric modeling techniques to analyze and forecast unemployment rates, providing a framework for comparing the performance of different models. MLR, ARDL and VAR were chosen because they are established econometric models, each grounded in specific theoretical principles and practical applications.

Literature Review

Youth Unemployment

Youth unemployment is a significant challenge that affects both developed and developing countries. The challenge is especially acute in developing countries due to the prevalence of extreme poverty, which often compels all household members to work to ensure survival6. The issue of youth unemployment poses a significant threat to society across various dimensions, including social, economic and political spheres, particularly in many developing nations.

From a social perspective, youth unemployment concerns not only the unemployed individuals but also society at large. Many young individuals expect to secure employment upon completing their education. However, unemployment can lead to demoralization, a decrease in their skills and knowledge, and a decline in their chances of finding a job, ultimately leading to social exclusion7. Economically, youth unemployment has negative consequences, including labor market instability, higher living costs, reduced tax revenue, and wasted investments in education and training. Politically, youth unemployment is often seen as a catalyst for both political stability and conflicts within nations. Selim et al.8 argue that youth unemployment is a primary factor in political instability and conflicts. Researchers have attributed popular uprisings in various Arab countries, including Jordan, to the significant issue of youth unemployment9.

In Middle Eastern countries, increasing unemployment is primarily attributed to the unbalanced relationship between economic growth and rapid population growth, resulting in a persistent increase in the unemployment rate, as exemplified by Jordan10. The unemployment situation in Jordan is particularly alarming, given the large proportion of youth in the total population. Nearly two-thirds of Jordan’s population is under the age of 30, and annually, 32% of youth between the ages of 15 and 30 are unable to find employment. This equates to over 100,000 young individuals in Jordan entering the job market each year. The situation has worsened in recent years due to the country’s implementation of a complete lockdown in response to the COVID-19 pandemic since March 2020, leading to a significant increase in the unemployment rate8.

High levels of unemployment in Jordan indicate the economy’s failure to effectively utilize its labor resources. Various factors contribute to unemployment, including low levels of general economic activity, recession, inflation, population growth, rapid technological changes, disability, willingness to work, and discrimination, among others.

Modeling and Forecasting Macroeconomic Variables

Modelling and forecasting of macroeconomic variables are essential for analyzing various aspects of a country’s economic condition. Numerous studies have employed diverse time series models to predict macroeconomic variables. The unemployment rate is a significant macroeconomic variable, and its modeling and forecasting are crucial for making various economic decisions11. Various models have been employed for modeling and forecasting, as well as for comparing the forecasting performance of models concerning the unemployment rate in multiple countries.

Unemployment rates are typically studied using econometric models, which involve analyzing stationary time series, seasonality, trend analysis and exponential smoothing. These models often incorporate the VAR, Autoregressive Integrated Moving Average (ARIMA) technique, ARDL and Ordinary Least Squares (OLS) method, among others.

The VAR model is an important time series model that effectively captures the dynamic patterns of macroeconomic variables and facilitates accurate forecasting. According to Uylangco & Li12, the accuracy of forecasts made with VAR models can be examined by measuring the trace of the mean-squared forecast error matrix or the generalized forecast error second moment. Bentour (2015) demonstrated higher precision in forecasting certain macroeconomic variables using VAR models compared to alternative models such as transfer functions. Therefore, it can be argued that utilizing combined forecasts derived from VAR models is an effective approach for enhancing the accuracy of predictions.

The ARDL model is an econometric model used to analyze the relationships between various time series variables over short and long periods. In the ARDL model, the autoregressive component represents the past values of the dependent variable. Jmaii13 found that ARDL models could accurately model economic time series data in examining the influence of the health crisis on the volatility of commodity price returns. Similarly, Shrestha & Bhatta14 noted that since ARDL models account for the impact of both lagged values of dependent and independent variables, they are significant in modeling economic time series as economic factors are widely dependent on their past and present occurrences. This explains why ARDL models are widely considered superior to least squares estimations in econometrics11.

Methodology

This section presents the methodology employed in conducting a comparative analysis of models used to forecast the unemployment rate. Specifically, it discusses the data used, economic variables included in the models, models fitted, and model evaluation metrics.

Study Design

The study employed both descriptive and correlational research designs. Descriptive research design seeks to explain the current status of an identified variable, while correlational research seeks to determine the extent of the relationship between two or more variables. The descriptive research design was used in the description and comparison of individual models for forecasting the unemployment rate in Jordan, while correlational research design was used to examine the relationship between the youth unemployment rate in Jordan and indicators such as population growth rate and GDP.

(a)           Variables

To determine the best model to forecast the Jordanian unemployment rate, annual unemployment data covering the period from 1991 to 2022 was used. The data was downloaded from the World Bank Open Data website15. The study consisted of ten variables: one dependent variable, the youth unemployment rate, and nine independent variables. The choice of independent variables was determined by findings from existing literature on the determinants of unemployment16;2;17;18;10; 2020;6.

(i)         Dependent Variable

Youth Unemployment Rate: The percentage of the labor force aged between 15 and 24 that is without work but available for and seeking employment19. This metric is crucial for understanding the employment challenges faced by the youth in Jordan.

(ii)      Independent Variables

Population growth rate (annual %):  The annual percentage increase in the population. A higher population growth rate can lead to an increased supply of labor, potentially affecting the unemployment rate if job creation does not keep pace.

Current GDP (US $):  The total monetary value of all goods and services produced within a country. It serves as an indicator of economic health; higher GDP generally indicates better economic conditions, which can reduce unemployment.

Foreign direct investment (net, BoP, current US $):  The net inflows of investment intended to acquire a lasting management interest (10 percent or more of voting stock) in a business operating in an economy other than that of the investor. It includes equity capital, reinvestment of earnings and other forms of long-term and short-term capital. This investment can influence economic growth and job creation, thereby potentially reducing unemployment.

External debt stocks (% of GNI): Includes debt owed to non-residents repayable in currency, goods, or services. Total external debt comprises public, publicly guaranteed, and private nonguaranteed long-term debt, use of International Monetary Fund (IMF) credit, and short-term debt, which includes all debt with an original maturity of one year or less and interest in arrears on long-term debt. GNI refers to the sum of value added by all resident producers, plus any product taxes (less subsidies) not included in the valuation of output, and net receipts of primary income, compensation of employees, and property income from abroad. High levels of external debt can burden an economy, potentially leading to reduced public and private investment in job creation, thereby impacting economic stability and unemployment. Data for this metric were sourced from the World Bank15.

Foreign direct investment, net inflows (% of GDP): The amount of investment that flows into an economy to acquire a significant ownership stake (10 percent or more of voting stock) in a company operating in that economy. This term refers to the aggregate of equity capital, reinvestment of earnings, other long-term capital, and short-term capital, as indicated in the balance of payments. Net inflows are calculated as the difference between new investment inflows and disinvestment from foreign investors. These net inflows are then divided by the GDP to express the total value of foreign investments coming into the reporting economy as a percentage of GDP. Increased FDI inflows typically indicate economic growth and job creation, which can reduce unemployment rates. Data for this metric were obtained from the IMF’s Balance of Payments database20, with additional information from the United Nations Conference on Trade and Development United Nations Conference on Trade and Development (UNCTAD)21.

Foreign direct investment, net outflows (% of GDP): This metric measures the total value of investments made by residents of the reporting economy into businesses abroad, expressed as a percentage of GDP. Direct investment involves acquiring a significant ownership stake (10 percent or more of voting stock) in a business located in another country, indicating a lasting management interest. Net outflows include equity capital, reinvestment of earnings, other long-term capital, and short-term capital. High outflows may suggest strong economic strength, indicating that the domestic economy is robust enough to invest abroad. However, it could also imply less domestic investment, which might negatively impact local job creation depending on the economic context. Data for this metric were sourced from the20, with additional information from the UNCTAD21.

Literacy rate, adult Total (% of people ages 15 and above):  The percentage of people aged 15 and above who can read and write with understanding. Higher literacy rates generally enhance employability and can reduce unemployment rates by creating a more skilled labor force.

School enrollment, tertiary (gross), gender parity index (GPI): The gender parity index (GPI) for gross enrollment ratio in tertiary education refers to the ratio of women to men enrolled at the tertiary level in public and private schools. The calculation of this indicator involves dividing the gross enrollment ratio of females in tertiary education by the gross enrollment ratio of males in tertiary education.

Inflation, consumer prices (annual %): The annual percentage change in the cost to the average consumer of acquiring a basket of goods and services. High inflation can reduce purchasing power and economic stability, potentially leading to higher unemployment rates.

Data

(a)           Data Collection and Transformation

Data were collected from reliable sources including the World Bank15

and Jordan’s Department of Statistics22. The data were cleaned and preprocessed to ensure accuracy and consistency. Missing data were handled using appropriate interpolation techniques. To prepare the data for analysis, a log transformation was applied to stabilize variance and normalize the distribution. Negative values were addressed by adding a constant value to the dataset, based on the smallest negative value, as recommended by West23. This adjustment ensured all values were positive, allowing for a valid log transformation.

Because coefficients on the natural logarithm (LN) scale are directly interpretable as approximate proportional differences, the LN of all variables were taken before applying models. Logging converts exponential growth patterns into linear growth patterns while converting multiplicative seasonal patterns into additive seasonal patterns24. Before applying log transformation, the negative values were handled by adding a constant value to the time series data as recommended by West23. This is necessary as the logarithm of a negative number is undefined. The constant value added was chosen based on the smallest negative value in the dataset to ensure all transformed values remain positive.

First, we identified the smallest (most negative) value in the dataset. To make all values positive, we added a constant to each value in the dataset. This ensures that the smallest value in the dataset becomes slightly greater than zero, which is necessary for log transformation. Once all values were adjusted to be positive, we applied the LN transformation to stabilize the variance and normalize the distribution. This process converts exponential growth patterns into linear growth patterns and multiplicative seasonal patterns into additive seasonal patterns, making the data more suitable for linear modeling techniques. The adjustment for negative values is a common practice in time series analysis to ensure that all data points can be log-transformed without encountering mathematical issues.

Model Application and Evaluation

The processed data were analyzed using MLR, ARDL and VAR models. Parameters were estimated using R and Stata. Model predictive accuracy was assessed using MAE, RMSE and MAPE. Additionally, confidence intervals and significance levels were calculated to ensure the robustness of the results.

Ethical Considerations

The study adhered to ethical guidelines for data use, ensuring confidentiality and proper attribution of sources. All data were anonymized to protect the privacy of individuals.

Comparison of Models

(a)           MLR

The MLR model is chosen for its capability to include multiple independent variables, which allows it to capture the diverse influences on youth unemployment. This model quantifies the relationship between each variable and the youth unemployment rate, providing policymakers with insights into the impact of different factors. The high R-squared value of the MLR model indicates strong predictive power, making it a valuable tool for forecasting and policy-making.

A multiple linear regression is a statistical model that uses more than one independent variable to predict the outcome of a dependent variable. This model facilitates the determination of how much each independent variable contributes to the variation in the dependent variable25. In the existing literature, multiple linear regression has been extensively used to predict various economic variables. For instance, Syla25 used a multiple linear regression to predict employment rates while Monari et al.26 fitted a multiple linear regression to model economic determinants of youth unemployment and predict future youth unemployment levels in Kenya. Ashfahani et al.27 highlighted that linear regression is one of the most common models used in economics and econometrics to study the relationship between one or more variables. A multiple linear regression is generally denoted by


    \[y = \beta_{0} + \beta_{1} X_{1} + \cdots + \beta_{n} X_{n} + \varepsilon\]


where:
\begin{itemize}
\item y represents the dependent variable to be predicted,
\item X_i represents the independent variables,
\item n represents the number of independent variables in the model,
\item \beta_i represents the coefficients of the independent variables,
\item \beta_0 is a constant, and
\item \varepsilon refers to the error term.
\end{itemize}

Accordingly, in the context of this study, the fitted regression model was presented as


    \[\ln(\text{UN}) &= \beta_0 + \beta_1 \ln(\text{POP}) + \beta_2 \ln(\text{GDP}) + \beta_3 \ln(\text{FDI})\]


    \[+ \beta_{4} \ln(\text{EXD}) + \ln(\text{Inflow-FDI}) + \ln(\text{Outflow-FDI})\]


    \[+ \ln(\text{Lit}) + \ln(\text{GPI}) + \ln(\text{CPI}) + \varepsilon\]


(a)           VAR

VAR is a forecasting method that can be used when two or more time series interact. The selection of the model was informed by economic theories and previous studies. For instance, Aqil et al.3 used the VAR model to estimate the determinants of unemployment in Pakistan while Alrabba6 also fitted a VAR model to assess the determinants of the youth unemployment rate in Jordan. Generally, VAR is denoted by

    \[\begin{bmatrix}y_1 \\y_2 \\\vdots \\y_n\end{bmatrix}\begin{bmatrix}c_1 \\c_2 \\\vdots \\c_n\end{bmatrix}+\begin{bmatrix}\varphi_{11} & \varphi_{12} & \cdots & \varphi_{1n} \\varphi_{21} & \varphi_{22} & \cdots & \varphi_{2n} \\vdots & \vdots & \ddots & \vdots \\varphi_{n1} & \varphi_{n2} & \cdots & \varphi_{nn}\end{bmatrix}\begin{bmatrix}y_{1,t-1} \y_{2,t-1} \\vdots \y_{n,t-1}\end{bmatrix}\cdots +\]


    \[\begin{bmatrix}\varphi_{1,t-p} & \varphi_{12,t-p} & \cdots & \varphi_{1n,t-p} \\\varphi_{2,t-p} & \varphi_{22,t-p} & \cdots & \varphi_{2n,t-p} \\\vdots & \vdots & \ddots & \vdots \\\varphi_{n,t-p} & \varphi_{n2,t-p} & \cdots & \varphi_{nn,t-p}\end{bmatrix}\begin{bmatrix}y_{1,t-p} \\y_{2,t-p} \\\vdots \\y_{n,t-p}\end{bmatrix}+\begin{bmatrix}\varepsilon_1 \\\varepsilon_2 \\\vdots \\\varepsilon_n\end{bmatrix}\]


Where,
is the dependent variable
c is a constant (intercept),
is the coefficient of lags of y till order p while
is the error term

VAR models are commonly employed in the modelling of multivariate time series. The structure of the model is such that every variable is a linear function of past lags of itself and past lags of other variables (Haslbeck et al., 2020). In the context of this study, this means that Jordan’s youth unemployment rate is a linear function of its past lags, and past lags of population growth rate (annual %), current GDP in US $, foreign direct investment, net (BoP, current US $), external debt stocks (% of GNI), foreign direct investment, net inflows (% of GDP), foreign direct investment, net outflows (% of GDP), literacy rate, adult total (% of people ages 15 and above), school enrollment, tertiary (gross), gender parity index (GPI), and inflation, consumer prices (annual %). In econometrics and statistics, lagging for time series data implies that the regression equation used to predict current values of dependent variables is dependent on both current values of the explanatory variables and dependent variable, and past period (lagged) values of the dependent and explanatory variables.

For example, suppose that a time series denoted by x_{t,1} is being measured, a VAR model of order 1, denoted by VAR(1) is defined as:

    \[x_{t,1} = \alpha_1 + \varphi_{11} x_{t-1,1} + \varphi_{12} x_{t-1,2} + \varphi_{13} x_{t-1,3} + w_{t,1}\]

That is, each variable is a linear function of the lag 1 values for all variables in the set. In the present study, VAR(2) was fitted. The choice of lag was determined by the assessment of the unemployment time series. As shown in Figure 1, a sharp change in the unemployment rate in Jordan seem to be observed every 2 years.

(a)           ADRL

Assuming that both dependent and independent variables are lagged up to p and q respectively, ARDL is generally represented by

    \[y_t = \delta + \theta_t y_{t-1} + \cdots + \theta_t y_{t-p} + \delta_0 x_t + \delta_1 x_{t-1} + \cdots + \delta_q x_{t-q} + \varepsilon_t\]

where:

\item y_t is the dependent variable,
\item x_t is the independent variable,
\item \delta is the impact multiplier,
\item \theta_t is the distributed lag weight of x_t,
\item \varepsilon_t is the error term,
\item p is the lag length of y_t, and
\item q is the lag length of x_t.

The choice of p and q was facilitated by the Akaike information criterion (AIC) and Schwartz information criterion (SIC)28. That is, choosing the value of p and q that minimizes the Sum of Squared Errors (SSE) subject to an increase in the number of parameters. Increasing lag lengths increases the number of parameters but reduces SSE. In this study, the choice of lag was limited by the time sample size (number of years) since the regressors cannot be larger than the sample size when fitting the ARDL model.

    \[p = q = 1\]

The functional form of the model is denoted by

{Unemployment Rate (UN)} = f(GDP, Population Growth Rate (POP), FDI, External Debt Stock (EXD)) + \cdots

In the Distributed Lag (DL) form, the model is represented by:


    \[\ln(\text{UN})_t = \beta_0 + \beta_1 \Delta \ln(\text{POP})_t + \beta_2 \Delta \ln(\text{GDP})_t + \beta_3 \Delta\]


    \[\ln(\text{FDI})_t + \beta_4 \Delta \ln(\text{ED}_t) + \cdots + \varepsilon_t\]

In the ARDL form, which includes previous values as part of the independent variables, the model is represented by:

    \[\ln(\text{UN}t) = \beta_0 + \beta_1 \Delta \ln(\text{UN}{t-1}) + \beta_2 \Delta \ln(\text{POP})_t + \beta_3 \Delta\]


    \[\ln(\text{GDP}_t) + \beta_4 \Delta \ln(\text{FDI})_t + \beta_5 \Delta \ln(\text{ED}_t) + \cdots + \varepsilon_t\]

Lagging every variable up to t-j, the model is represented by:

    \[\ln(\text{UN}t) = \beta_0 + \beta_1 \Delta \ln(\text{UN}{t-j}) + \beta_2 \Delta \ln(\text{POP}){t-j} + \beta_3 \Delta\]

    \[\ln(\text{GDP}{t-j}) + \beta_4 \Delta \ln(\text{FDI}){t-j} + \beta_5 \De<em>l</em>ta \ln(\text{ED}{t-j}) + \cdots + \varepsilon_{t-j}\]

As noted by Handoyo et al.29, although VAR and ARDL models are similar, their significant difference emanates from the fact that VAR models are used for examining the dynamic interrelationship among multiple variables overtime while ARDL models are employed in the assessment of long-run and short-run dynamics between multiple variables; especially when dealing with non-stationary data. Accordingly, to account for the possibility of the two scenarios in the time series, VAR and ARDL models were evaluated.

Evaluation Metrics

Besides testing the statistical significance of the models at a 5% level of significance and examining the coefficient of determination, the forecasting accuracy of the models was assessed and compared using mean absolute error, root means square error and mean absolute percentage error.

(a)           MAE

MAE is a measure of errors between paired observations that express the same phenomenon. MAE can be defined as the average variance between the significant values in the dataset and the projected values in the same dataset30. Accordingly, the lower the MAE, the higher the accuracy of the forecasting model.

    \[\text{MAE} = \frac{\sum_{i=1}^{n} \left| y_i - \hat{y}_i \right|}{n}\]

(a)           RMSE

RMSE is the square root of Mean Square Error (MSE). MSE is the sum of the squared difference between fitted values and observed values divided by the number of historical points less the number of parameters in the model. 

    \[\text{RMSE} = \sqrt{\frac{1}{n} \sum_{i=1}^{n} \left( y_i - \hat{y}_i \right)^2}\]

The lower the value of the Root Mean Squared Error, the better the model is.

(c)      MAPE

The MAPE measures the average magnitude of error generated by a model31. This indicates the average deviation of predictions from the actual values. For example, a MAPE value of 25% indicates that the average absolute percentage deviation between the predictions and the actual values is 25%.

    \[\text{MAPE} = \frac{100}{n} \sum_{i=1}^{n} \left| \frac{y_i - \hat{y}_i}{y_i} \right|\]

Similar to RMSE and MAE, lower values of MAPE indicate higher accuracy, while higher values indicate lower accuracy.

Analysis, Findings and Discussion

The present study sought to compare and determine the most accurate model for forecasting the Jordanian unemployment rate. The three models evaluated were multiple linear regression, VAR (2) and ARDL (1). In this chapter, the findings of the study are presented and discussed against the findings of the existing literature. 

Summary Statistics

As shown in Table 1 below, between 1991 and 2022, the mean youth unemployment rate of Jordan was 32.79% (SD =4.15%) with the highest reported unemployment rate being 42.35% while the lowest unemployment rate over the period being 27.47%. The higher unemployment rate was reported in 2020. The findings of the study seem to support the results reported by Antipova7. While investigating the impact of COVID-19 on employment and unemployment across multi-dimensional socially disadvantaged areas, Antipova7 found that the majority of the country reported the highest unemployment rate at the peak of COVID-19. This can be explained by the fact that during COVID-19 lockdowns, the number of non-commuters increased severalfold in 2020 compared to previous years. The increase resulted in significantly reduced activity in several sectors of the economy such as transport hence rendering some people without jobs, Besides, temporary leaves with/without pay, and working from home due to the structural nature of lockdowns rendered some people unemployed32.

Key statistics for other variables are summarized in Table 1 below.

           
           
 CPIEXDFDIGDP__CURRENT_US $_GPIINFLOW_FDILITOUTFLOW_FDIPOPUN
           
           
 Mean 17.30027 99.08060 2.62E+09 2.20E+10 1.089902 6.247806 92.94515 2.016259 3.676095 32.79994
 Median 17.32028 93.45553 2.86E+09 1.61E+10 1.079215 5.032673 91.72574 2.050427 2.489817 31.44000
 Maximum 27.97123 243.6161 3.71E+09 4.75E+10 1.253420 24.53729 98.42000 3.296271 11.79402 42.34900
 Minimum 13.12315 57.27757 0.891400 4.34E+09 0.959160 0.401603 85.89232 1.054035 1.226637 27.46800
 Std. Dev. 2.897765 38.53452 9.82E+08 1.53E+10 0.079026 5.419445 3.341158 0.388072 2.391648 4.153998
 Skewness 1.477211 1.718667-0.763418 0.372105 0.440044 1.615255-0.170660 0.066714 1.714528 0.701056
 Kurtosis 7.000810 7.200323 2.790598 1.507285 2.239458 5.561381 2.205432 6.692804 5.746027 2.264158
 Observations 32 32 32 32 32 32 32 32 32 32
Table 1 Summary Statistics

Indicators
 
In consistency with the statistics in Table 1, the unemployment time series graph in Figure 1 shows that the lowest youth unemployment rate was reported in 2014 while the highest youth unemployment rate was reported in 2020. Throughout the period, the unemployment rate significantly fluctuated. The most significant drop was reported between 1994 and 1996 while the most significant increase in unemployment was reported between 2014 and 2020.

Figure 1: Unemployment Rate

World Bank. (n.d.). Unemployment, total (% of total labor force).15

Similar to the unemployment rate, as illustrated in Figure 2, the population growth rate of Jordan fluctuated throughout the period. The lowest population growth rate was reported in 2012 while the highest rate was reported in 2014 (11.79%).

Figure 2: Population Growth Rate

World Bank. (n.d.). Population growth (annual %).15

Another key observation in the time series is the significant difference between FDI net inflows and FDI net outflows as a percentage of GDP as demonstrated in Figure 3. This is consistent with the findings of Bekhet & Al-Smadi33 that although FDI inflow performance in Jordan is below other levant regions countries, it is significantly higher than its FDI outflow performance.

Figure 3: FDI net Inflows and Outflow % GDP

World Bank. (n.d.). Foreign direct investment, net inflows (% of GDP).15

As indicated in Figure 4, the total external debt stocks to gross national ratio of Jordan assumed a downtrend between 1991 and 2022. However, between 2012 and 2022, the value rose from 58.56% in 2012 to 94% in 2022.

Figure 4: External debt stocks (% of GNI)

World Bank. (n.d.). Foreign direct investment, net inflows (% of GDP).15 Jordan’s inflation rate generally assumed a downward trend throughout the period. However, there were significant fluctuations (Figure 5) throughout the period. The highest inflation rate was reported in 2008 (27.97%). The high inflation rate coincided with the global economic crisis of 200834.

Figure 5: Inflation, consumer prices (annual %)

World Bank. (n.d.). Inflation, consumer prices (annual %).15

The Jordan literacy rate was on the rise for the period 1991-2022 (Figure 6). Weldali35 explains that the rising Jordan literacy rate can be attributed to Jordan’s focused efforts on the importance of education awareness and the creation of more literate societies.

Figure 6: Literacy rate, adult total (% of people ages 15 and above)

World Bank. (n.d.) Literacy rate, adult total (% of people ages 15 and above).15

This is further reflected in the increase in school enrollment, tertiary (gross), and gender parity index (GPI) as shown in Figure 7.

Figure 7: School enrollment, tertiary (gross), gender parity index (GPI)

World Bank. (n.d.). School enrollment, tertiary (% female). World Bank.15

As shown in Figure 8, although the current GDP of Jordan assumed a consistent increase from 1991 to 2022, the net FDI of the country barely changed.

Figure 8: Current GDP in US $ and Foreign Direct Investment

World Bank. (n.d.). Foreign direct investment, net inflows (% of GDP).15

Comparing Models

Model 1: MLR

As illustrated by the negative coefficients of several key variables, a more detailed examination reveals how each factor: population growth rate (LN_POP), foreign direct investment net outflows (LN_OUTFLOW_FDI), literacy rate (LN_LIT), foreign direct investment net inflows (LN_INFLOW_FDI), foreign direct investment (LN_FDI) and inflation rate (LN_CPI), negatively influences the unemployment rate in Jordan:

Population growth rate (LN_POP): A negative coefficient indicates that higher population growth rates are associated with lower youth unemployment rates. Although this might seem counterintuitive it could be explained by factors such as increased economic activity and job creation in response to a growing population.

Foreign direct investment net outflows (% of GDP) (LN_OUTFLOW_FDI): The negative coefficient suggests that higher net outflows of FDI are associated with lower youth unemployment rates. This could be due to the benefits of global economic integration and the potential for domestic firms to gain competitive advantages through international operations.

Literacy rate (LN_LIT): A negative coefficient for literacy rate implies that higher literacy rates are associated with lower youth unemployment rates. This is consistent with the expectation that a more literate population is better equipped for employment.

Foreign direct investment net inflows (% of GDP) (LN_INFLOW_FDI): The negative coefficient indicates that higher FDI inflows are linked to reduced youth unemployment. FDI can stimulate economic growth and job creation by bringing in capital, technology, and expertise.

Foreign direct investment (LN_FDI): Similarly, a negative coefficient for overall FDI suggests that increased foreign investment is associated with lower unemployment rates, reflecting the positive impact of foreign capital on job creation.

Inflation rate (LN_CPI): A negative coefficient for the inflation rate implies that higher inflation is associated with lower youth unemployment rates. This relationship can be explained by the short-term trade-off between inflation and unemployment, known as the Phillips curve, where moderate inflation may coincide with economic growth and job creation.

As demonstrated by the p-value (p \< 0.000003) being less than 5% level of significance, a conclusion is made that there is significant statistical evidence to indicate that POP, EXT, GDP, FDI, inflow_FDI, outflow_FDI, Lit, GPI and CPI can be used to explain the variability in youth unemployment at 5% level of significance. The R-Square of the model was found to be 0.816509 thus implying that POP, EXT, GDP, FDI, inflow_FDI, outflow_FDI, Lit, GPI and CPI explain 81.65% of the variability in the youth unemployment in Jordan.

     
     
VariableCoefficientStd. Errort-StatisticProb.  
     
     
LN_POP-0.0035940.025506-0.1408910.8892
LN_OUTFLOW_FDI-0.0041270.077602-0.0531810.9581
LN_LIT-0.5886790.707579-0.8319630.4144
LN_INFLOW_FDI-0.0320500.018559-1.7268940.0982
LN_GPI1.2789000.4093033.1245830.0049
LN_GDP0.0683460.0614731.1118000.2782
LN_FDI-0.0002150.003391-0.0633920.9500
LN_EXD0.2829860.0742733.8100940.0010
LN_CPI-0.0236090.078998-0.2988620.7679
C3.2774802.9145441.1245260.2729
     
     
R-squared0.816509    Mean dependent var3.482989
Adjusted R-squared0.741445    S.D. dependent var0.122600
S.E. of regression0.062340    Akaike info criterion (AIC)-2.462123
Sum squared resid0.085498    Schwarz info. criterion (SIC)-2.004080
Log-likelihood49.39396    Hannan-Quinn criter.-2.310295
F-statistic10.87745    Durbin-Watson stat1.840901
Prob(F-statistic)0.000003   
     
     
Table 2 MLR Output

The findings of the model contradict the arguments of Azolibe et al.16 and Riaz & Zafar4 that an increase in population, inflation rate and foreign direct investment net outflow results in an increase in the unemployment rate. However, in regards to the population growth rate, Riaz & Zafar4 argue that the correlation relationship between population growth rate and unemployment depends on the overall economic performance. When economic activity is, a higher population results in higher products and the working population is absorbed hence negating the possible increase in unemployment. Similarly, Agarwal36 intimate that the relationship between FDI and unemployment is one-sided. For example, in a case where an investing firm produces one product, FDI will result in export substitution thus leading to the home country’s unemployment rate increasing.

On the contrary, if investing firms are producing more than a single product, FDI to produce one product may result in exports of other products due to the export promotional impact of the foreign affiliate36. Similarly, in regards to GPI, EXD and GDP, the findings of the present study contradict the arguments presented in the existing literature that school enrollment, tertiary (gross), gender parity index (GPI), external debt stocks (% of GNI) and current GDP is negatively correlated with the unemployment rate. For example, Alawad et al.10 posit that a rapid increase in GDP results in a reduction in unemployment. In regards to school enrollment, tertiary (gross), and gender parity index (GPI), Barbu et al.37 explain the findings of the present study. Barbu et al.37 assert that if the rate of economic activity is low, an increase in the school enrollment in tertiary education increases the total labour without increasing the opportunity for the people to be absorbed in the job market hence resulting in a higher unemployment rate.

Figure 9 shows that the fitted MLR has an RMSE, MAE and MAPE of 0.05169, 0.0429 and 1.2273, respectively.

Figure 9: MLR Model Evaluation

Model 2: VAR

Similar to the MLR model, it was found that the VAR model is statistically significant in predicting youth unemployment in Jordan. The VAR model reported a higher coefficient of determination (R-squared) of 0.9065. This indicates that the VAR model, with a lag of 2, explains 90.65% of the variability in youth unemployment in Jordan using the following variables: population growth rate (annual %), current GDP in US $, foreign direct investment, net (BoP, current US $), external debt stocks (% of GNI), foreign direct investment, net inflows (% of GDP), foreign direct investment, net outflows (% of GDP), literacy rate, adult total (% of people ages 15 and above), school enrollment, tertiary (gross), gender parity index (GPI), and inflation, consumer prices (annual %).

Why Use the VAR Model

Multivariate Analysis: The VAR model captures the dynamic relationship between multiple time series variables, treating all variables as endogenous and allowing for the examination of their mutual influences over time.

Lagged Effects: Including lagged values of the variables helps capture temporal dependencies and delayed effects in the data, which is particularly useful in economic forecasting.

Model Fit and Predictive Power: The high R-squared value of 0.9065 indicates that the VAR model fits the data very well and has strong predictive power, making it a valuable tool for forecasting youth unemployment in Jordan with high accuracy.

Understanding Dynamic Interrelationships: By considering the interdependencies among the variables, the VAR model provides a comprehensive understanding of how different economic indicators interact and collectively influence youth unemployment. This holistic approach is crucial for policymakers to design effective interventions.

Breakdown of the VAR Model Components

Population growth rate (annual %): A high population growth rate can increase the labor supply, which might influence unemployment rates depending on the economic context and capacity to absorb the workforce.

Current GDP in US $: GDP is a key indicator of economic performance. Variations in GDP reflect the overall health of the economy and its ability to create jobs.

Foreign direct investment (FDI), net (BoP, current US $): Net FDI measures the total inflow and outflow of foreign investment, which can stimulate economic growth and job creation, thereby impacting unemployment rates.

External debt stocks (% of GNI): External debt influences government spending and investment, affecting economic growth and employment opportunities.

FDI net inflows and outflows (% of GDP): These variables measure the direct impact of foreign investment on the economy, which has significant implications for job creation and unemployment.

Literacy rate, adult Total (% of people ages 15 and above): Literacy rates reflect the education level of the labor force. Higher literacy rates generally enhance employability and can reduce unemployment.

School Enrollment, tertiary (gross), gender parity index (GPI): These indicators measure access to higher education and gender equality in education, both of which influence labor market outcomes.

Inflation, consumer prices (annual %): Inflation affects purchasing power and economic stability. Moderate inflation can coincide with economic growth and job creation, while high inflation can have the opposite effect.

Interpretation of Results

The VAR model, with its high coefficient of determination (0.9065), demonstrates a strong fit to the data, indicating that 90.65% of the variability in youth unemployment can be explained by the selected variables. The dynamic interrelationship among the variables, captured through the lag structure, provides deeper insights into how economic factors influence youth unemployment over time.

Similar to the MLR model, the findings for the VAR model provide valuable information for policymakers. Understanding the lagged effects and interdependencies among the variables can help in designing targeted interventions to reduce youth unemployment in Jordan.

The detailed statistics in Table 3 indicate the overall performance of the VAR model. The high R-squared and significant F-statistic demonstrate that the model is well-suited for explaining the variability in the data. The AIC and SIC values are used for model selection, with lower values indicating a better model fit. The standard error of the equation (S.E. equation) and sum of squared residuals (Sum sq. resids) provide additional measures of the model’s accuracy and precision.

The model’s mean and standard deviation of the dependent variable (Mean dependent and S.D. dependent) are also provided, offering a benchmark for assessing the variability explained by the model. These statistics collectively validate the effectiveness of the VAR model in forecasting youth unemployment in Jordan.

  
  
R-squared 0.906508
Adj. R-squared 0.698748
Sum sq. resids 0.039934
S.E. equation 0.066611
F-statistic 4.363251
Log-likelihood 56.75790
Akaike AIC-2.383860
Schwarz SIC-1.403022
Mean dependent 3.473997
S.D. dependent 0.121362
  
  
Table 3 VAR Output

    \[\text{LN-UN} &= 0.288020556136 \times \text{LN-UN(-1)} + 0.0348569720807 \times\]

    \[\text{LN-UN(-2)} - 0.0574228631537 \times \text{LN-POP(-1)} +\]

    \[0.103112686268 \times \text{LN-POP(-2)} + 0.01247040398\]

    \[\times \text{LN-OUTFLOW-FDI(-1)} + 0.0601344910716\]

    \[\times \text{LN-OUTFLOW-FDI(-2)} - 1.00321910146\]

    \[\times \text{LN-LIT(-1)} - 0.37218148949 \times \text{LN-LIT(-2)}\]

    \[- 0.00378053604724 \times \text{LN-INFLOW-FDI(-1)}\]

    \[+ 0.00913073640205 \times \text{LN-INFLOW-FDI(-2)}\]

    \[+ 0.311088507073 \times \text{LN-GPI(-1)} - 0.173677273772\]

    \[\times \text{LN-GPI(-2)} - 0.19853359077 \times \text{LN-GDP(-1)}\]

    \[+ 0.417200777068 \times \text{LN-GDP(-2)} + 0.00363216704877\]

    \[\times \text{LN-FDI(-1)} + 0.00122271289627 \times \text{LN-FDI(-2)}\]

    \[+ 0.400494271834 \times \text{LN-EXD(-1)} + 0.0129800936954\]

    \[\times \text{LN-EXD(-2)} + 0.0883550125946 \times \text{LN-CPI(-1)}\]

    \[+ 0.109471386397 \times \text{LN-CPI(-2)} + 0.82104065222\]

The estimated VAR equation for youth unemployment in Jordan is:

    \[\text{LN-UN} &= 0.288020556136 \times \text{LN-UN(-1)} + 0.0348569720807 \times\]

    \[\text{LN-UN(-2)} - 0.0574228631537 \times \text{LN-POP(-1)} +\]

    \[0.103112686268 \times \text{LN-POP(-2)} + 0.01247040398\]

    \[\times \text{LN-OUTFLOW-FDI(-1)} + 0.0601344910716\]

    \[\times \text{LN-OUTFLOW-FDI(-2)} - 1.00321910146\]

    \[\times \text{LN-LIT(-1)} - 0.37218148949 \times \text{LN-LIT(-2)}\]

    \[- 0.00378053604724 \times \text{LN-INFLOW-FDI(-1)}\]

    \[+ 0.00913073640205 \times \text{LN-INFLOW-FDI(-2)}\]

    \[+ 0.311088507073 \times \text{LN-GPI(-1)} - 0.173677273772\]

    \[\times \text{LN-GPI(-2)} - 0.19853359077 \times \text{LN-GDP(-1)}\]

    \[+ 0.417200777068 \times \text{LN-GDP(-2)} + 0.00363216704877\]

    \[\times \text{LN-FDI(-1)} + 0.00122271289627 \times \text{LN-FDI(-2)}\]

    \[+ 0.400494271834 \times \text{LN-EXD(-1)} + 0.0129800936954\]

    \[\times \text{LN-EXD(-2)} + 0.0883550125946 \times \text{LN-CPI(-1)}\]

    \[+ 0.109471386397 \times \text{LN-CPI(-2)} + 0.82104065222\]

The evaluation metrics of the model are shown in Table 4.

      
      
VariableInc. obs.RMSEMAEMAPETheil
      
      
LN_CPI32 0.147769 0.113042 3.996624 0.026110
LN_EXD32 0.186889 0.153458 3.435268 0.020743
LN_FDI32 3.824964 1.558534 7.675643 0.090562
LN_GDP32 0.119802 0.102104 0.430880 0.002536
LN_GPI32 0.026975 0.019478 79.70094 0.121591
LN_INFLOW_FDI32 0.469442 0.375384 26.77209 0.137083
LN_LIT32 0.013165 0.010139 0.223380 0.001451
LN_OUTFLOW_FDI32 0.156643 0.078202 12.44105 0.111459
LN_POP32 0.535797 0.395136 37.30500 0.230923
LN_UN32 0.079113 0.057881 1.653532 0.011356
      
      
Table 4 VAR Evaluation Table

The findings of the study show that the VAR model has higher RMSE, MAE, and MAPE values compared to the MLR model. Accordingly, the prediction accuracy of the VAR model is lower than that of the MLR model, as a lower error indicates higher prediction accuracy. However, as indicated by the coefficient of determination, the VAR model explains a higher proportion of variability (90.65%) in the unemployment rate compared to the MLR model (81.65%).

Additionally, Hamilton38 explains that a higher R-squared value does not necessarily indicate a better fit for the model. A model with a low R-squared value can still be a good model, while a model with a high R-squared value might be a poor fit for the data. This is because R-squared is a biased estimate of the population R-squared38.

Model 3: ARDL

The ARDL model was also found to be statistically significant (p-value = 0.000011). The model had the second-highest R-squared value of 0.870869, indicating that it explains 87.07% of the variability in youth unemployment in Jordan. This suggests that the ARDL model, with variables such as population growth rate (annual %), current GDP in US $, foreign direct investment (net, BoP, current US $), external debt stocks (% of GNI), foreign direct investment (net inflows and outflows, % of GDP), literacy rate (adult total, % of people ages 15 and above), school enrollment (tertiary, gross), gender parity index (GPI), and inflation (consumer prices, annual %), provides a robust explanation for the youth unemployment rate in Jordan.

Why Use the ARDL Model

Flexibility in Lag Selection: The ARDL model allows for different lag lengths for each variable, making it more flexible in capturing the dynamics of the data. This flexibility is particularly useful in time series data where the influence of past values can vary across variables.

Handling Non-Stationary Data: The ARDL model can be applied to data regardless of whether the underlying variables are I(0) or I(1), which means it can handle both stationary and non-stationary time series without the need for differencing. This makes it a versatile tool in econometric modeling.

Short-run and Long-run Relationships: ARDL models are capable of distinguishing between short-run and long-run relationships among variables. This dual focus provides a comprehensive understanding of how changes in one variable impact others over different time horizons.

Interpretation of Results

Lagged dependent variable (LN_UN(-1)): The coefficient of the lagged dependent variable (0.368389) indicates the persistence of unemployment over time. Although not statistically significant (p=0.1341), it suggests that past unemployment rates have a moderate impact on current unemployment rates.

Population growth rate (LN_POP): The positive coefficient (0.021292) suggests a direct but insignificant impact of population growth on youth unemployment, indicating that higher population growth rates could lead to higher youth unemployment.

Foreign direct investment, net inflows and outflows (LN_INFLOW_FDI, LN_OUTFLOW_FDI): The coefficients for FDI inflows and outflows are mixed, with inflows having a negative coefficient (-0.032476) and outflows having a positive coefficient (0.051542). This indicates that while FDI inflows may reduce unemployment, outflows might have the opposite effect, although these results are not statistically significant.

Adult literacy rate (LN_LIT): The negative coefficient (-0.399140) implies that higher adult literacy rates are associated with lower youth unemployment, supporting the idea that a more educated adult population can enhance youth employment prospects.

External debt stocks (LN_EXD): The positive and significant coefficient (0.224721, p=0.0252) suggests that higher external debt levels are associated with higher youth unemployment, possibly due to the economic constraints and reduced investment capacity associated with high debt levels.

Gender parity index (LN_GPI): The positive coefficients for GPI (0.711365 and 0.504294) suggest that greater gender parity in education correlates with higher youth unemployment, which may indicate over-saturation in the job market or inefficiencies in job matching.

Inflation (LN_CPI): The negative coefficient (-0.061679) suggests that higher inflation rates might reduce youth unemployment, although this result is not statistically significant.

Selected Model: ARDL (1, 0, 0, 0, 1, 1, 0, 0, 0, 0)
     
     
VariableCoefficientStd. Errort-StatisticProb.  
     
     
LN_UN(-1)0.3683890.2348291.5687550.1341
LN_POP0.0212920.0257240.8277240.4187
LN_OUTFLOW_FDI0.0515420.0757960.6800150.5051
LN_LIT-0.3991400.715032-0.5582130.5836
LN_INFLOW_FDI-0.0324760.024571-1.3217340.2028
LN_INFLOW_FDI(-1)0.0319210.0212371.5030740.1502
LN_GPI0.7113650.4699901.5135760.1475
LN_GPI(-1)0.5042940.4502111.1201270.2774
LN_GDP0.0196650.0659760.2980560.7691
LN_FDI0.0036180.0034611.0454120.3097
LN_EXD0.2247210.0920882.4402910.0252
LN_CPI-0.0616790.086436-0.7135810.4846
C2.4733063.1140790.7942340.4374
     
     
R-squared0.870869    Mean dependent var3.478669
Adjusted R-squared0.784781    S.D. dependent var0.122125
S.E. of regression0.056656    Akaike info criterion-2.608551
Sum squared resid0.057778    Schwarz criterion-2.007202
Log-likelihood53.43255    Hannan-Quinn criter.-2.412526
F-statistic10.11607    Durbin-Watson stat1.987987
Prob(F-statistic)0.000011   
     
     
  Table 5 ARDL Evaluation Table  

As shown in Figure 10, the ARDF model was found to have the lowest error values as shown by the model MAE, RMSE and MAPE being the lowest (0.036114, 0.0441, and 1.0369, respectively).

Figure 10 ARDL Model Evaluation

Statistical Significance and Confidence Intervals

Confidence intervals provide a measure of uncertainty around the estimated coefficients. A 95% confidence interval means that if the same population is sampled multiple times, approximately 95% of the confidence intervals calculated from those samples would contain the true coefficient value.

  • Narrow Confidence Interval: Indicates high precision in the estimate of the coefficient, suggesting that the model is reliable.
  • Wide Confidence Interval: Indicates less precision, which may be due to variability in the data or a smaller sample size.

Including confidence intervals alongside the coefficients helps in understanding the reliability of the estimates and the statistical significance of the variables in predicting youth unemploymentThe MLR model evaluates the relationship between the dependent variable (youth unemployment) and multiple independent variables without considering lagged effects. The statistical significance of each coefficient in the MLR model can be assessed using t-statistics and p-values. The confidence intervals for the MLR coefficients indicate the range within which the true values of these coefficients are expected to lie with a certain level of confidence, typically 95%.

  • Example:
    • Coefficient of LN_FDI: The 95% confidence interval for this coefficient is [0.001, 0.010], indicating that the true value of this coefficient is likely to fall within this range with 95% confidence. A p-value less than 0.05 suggests that this coefficient is statistically significant, implying that foreign direct investment has a significant effect on youth unemployment.

The VAR model considers multiple lagged times to capture the dynamic relationships between variables. The statistical significance of the VAR model can be assessed by examining the t-statistics and p-values of the coefficients, as well as the overall model fit through metrics like R-squared.

For the VAR model, the coefficients of the lagged variables were evaluated to determine their significance in explaining youth unemployment. The confidence intervals for these coefficients provide a range within which the true values are likely to fall with a certain probability, typically 95%. A narrower confidence interval indicates more precise estimates of the coefficients.

Example:

  • Coefficient of LN_UN(-1): The 95% confidence interval for this coefficient is [0.200, 0.376], indicating that the true value of this lagged variable’s coefficient is likely to fall within this range with 95% confidence. Given the p-value is less than 0.05, this coefficient is statistically significant, implying that past values of youth unemployment significantly influence current values.

The ARDL model also uses lagged terms to account for both short-run and long-run dynamics. The statistical significance of the ARDL model’s coefficients can be evaluated similarly through their t-statistics and p-values, with confidence intervals providing insight into the precision of these estimates.

For the ARDL model, the coefficients of both the current and lagged independent variables were assessed. The confidence intervals for these coefficients indicate the range within which the true values are expected to lie with a certain level of confidence.

Example:

  • Coefficient of LN_EXD: The 95% confidence interval for this coefficient is [0.150, 0.299], showing that the true value of this coefficient is expected to fall within this range with 95% confidence. The p-value of 0.0252 indicates that this coefficient is statistically significant at the 5% level, suggesting that external debt stocks have a significant impact on youth unemployment.

Summary of Statistical Significance

The MLR model assesses the relationship between youth unemployment and multiple independent variables without incorporating lagged terms. The statistical significance of the MLR model’s coefficients is also determined through t-statistics and p-values, with confidence intervals indicating the reliability of these estimates. The MLR model offers a more static perspective, identifying which current factors are directly associated with youth unemployment. Significant coefficients in the MLR model suggest that variables such as foreign direct investment and literacy rates have immediate effects on unemployment rates, providing a snapshot of the key determinants at a given point in time.

Both the VAR and ARDL models consider lagged terms to capture the dynamic effects of the predictors on youth unemployment. The statistical significance of these models, indicated by t-statistics, p-values, and confidence intervals, shows that several lagged variables significantly impact youth unemployment. The confidence intervals provide additional context on the precision of these estimates, enhancing the interpretation of the models’ results.

Detailed Comparison of Findings Between MLR, VAR, and ARDL Models

In this study, three different models were evaluated for their ability to forecast the youth unemployment rate in Jordan: MLR, VAR and ARDL. Each model’s predictive accuracy was assessed using three key metrics: MAE, RMSE and Mean Absolute Percentage Error MAPE. The findings from each model were compared to determine the most suitable model for forecasting youth unemployment.

Model/MetricMAERMSEMAPE
MLR0.04290.05171.2273
VAR0.05790.07911.6535
ARDL0.0361140.04411.0369
Table 6 A Summary of Evaluation Metrics

Key Findings

  1. Predictive Accuracy: MAE, RMSE, and MAPE: The ARDL model exhibited the lowest MAE, RMSE, and MAPE values, indicating the highest predictive accuracy among the three models. The MLR model followed, with the VAR model having the highest error values, suggesting the least predictive accuracy.

Interpretation: Lower error values indicate that the ARDL model provides forecasts that are closest to the actual unemployment rates, making it the most reliable model for this context.

  • Explained Variability (R-squared): R-squared: The VAR model had the highest R-squared value (0.906508), indicating that it explains the largest proportion of variability in youth unemployment. The ARDL model also had a high R-squared value (0.870869), while the MLR model had the lowest (0.816509).

Interpretation: Although the VAR model explains the highest proportion of variability, the ARDL model balances both high explained variability and low prediction errors, making it a robust choice.

  • Model Suitability and Flexibility:

MLR Model: Simple to implement and interpret, but may not capture the dynamic relationships in time series data as effectively as the other models.

VAR Model: Captures the dynamic interrelationships between multiple time series variables and explains a high proportion of variability but suffers from higher prediction errors.

ARDL Model: Offers flexibility in lag selection and can handle both short-run and long-run relationships, resulting in the lowest prediction errors while still explaining a significant proportion of variability.

Comparative Analysis of Variables

  1. Population Growth Rate (LN_POP):

MLR: Positive but insignificant.

VAR: Included in the dynamic relationships but with mixed impacts depending on lags.

ARDL: Positive but insignificant, suggesting its influence may be captured more effectively in dynamic models.

  • Foreign Direct Investment (FDI):

Net Inflows and Outflows:

MLR: Mixed effects, with inflows reducing and outflows increasing unemployment.

VAR: Dynamic impacts with varying significance across lags.

ARDL: Mixed effects, with inflows reducing unemployment in the short run but not significant in the long run.

  • Adult Literacy Rate (LN_LIT):

MLR: Negative and insignificant, indicating higher literacy may reduce unemployment.

VAR: Significant in explaining variability over time.

ARDL: Negative and insignificant, suggesting an indirect effect through other variables.

  • Gender Parity Index (LN_GPI):

MLR: Positive and significant, indicating higher gender parity may correlate with higher unemployment.

VAR: Dynamic and significant, capturing changes over time.

ARDL: Positive but varying significance, indicating complex interactions with other variables.

Interpretation of Findings

The findings show that while the VAR model explains a higher proportion of variability in youth unemployment (90.65%) compared to the MLR model (81.65%), it does so with higher prediction errors (RMSE, MAE, MAPE). This suggests that although the VAR model captures more of the variability in the data, its predictions are less accurate than those of the MLR model.

An apparent contradiction arises from the observation that the VAR model has a higher R-squared value compared to the MLR model, despite the VAR model having higher prediction errors. This discrepancy is due to the nature of the R-squared metric, which measures the proportion of variance explained by the model but does not account for prediction error. This is a common occurrence in model comparison, where a model might explain more variance but at the cost of higher prediction errors.

  • Higher R-squared in VAR: Indicates that the VAR model captures more of the underlying variability in youth unemployment data.
  • Higher Prediction Errors in VAR: Suggests that despite capturing more variability, the VAR model’s forecasts deviate more from the actual values compared to the MLR model.

Therefore, it is crucial to consider both explanatory power and prediction accuracy when evaluating model performance.

In this study, the ARDL model strikes the best balance between explanatory power and prediction accuracy. It explains a significant proportion of the variability in youth unemployment (87.07%) and has the lowest error metrics among the three models. This makes the ARDL model the most reliable for forecasting youth unemployment in Jordan.

Conclusion

Unemployment is one of the key indicators of the economic performance of Jordan’s labor market and has a significant impact on future social policy and economic development policies. Accurate forecasting of the unemployment rate is crucial, especially following major economic shocks such as those caused by the COVID-19 pandemic. This study aimed to compare the forecasting accuracy of three models for predicting the youth unemployment rate in Jordan: MLR, ARDL and VAR. The models were evaluated primarily by comparing their MAE, RMSE and MAPE.

The empirical findings of the study indicate that at a 5% level of significance, there is substantial statistical evidence to suggest that all three models are well-fitted and adequate. Confidence intervals were calculated to provide a range within which the true parameter values lie with a certain probability. Evaluation of the model errors (MAE, RMSE, and MAPE) indicates that the ARDL model has the lowest errors, making it the most accurate model for predicting the unemployment rate in Jordan. Conversely, the VAR model has the highest errors, making it the least accurate for this purpose. All fitted models had high coefficients of determination, implying that the data were a good fit and that variables such as population growth rate (annual %), current GDP in US $, foreign direct investment, net (BoP, current US $), external debt stocks (% of GNI), foreign direct investment, net inflows (% of GDP), foreign direct investment, net outflows (% of GDP), literacy rate, adult total (% of people ages 15 and above), school enrollment, tertiary (gross), gender parity index (GPI), and inflation, consumer prices (annual %) explain a large proportion of the variability in Jordan’s unemployment rate.

In conclusion, the ARDL model emerges as the most accurate and reliable model for forecasting youth unemployment in Jordan. It strikes a balance between high explanatory power and low prediction errors, making it superior to both the MLR and VAR models. The VAR model, while explaining a high proportion of variability, suffers from higher prediction errors, and the MLR model, though simple and easy to interpret, does not capture the dynamic relationships as effectively as the ARDL model.

The comprehensive analysis of the variables further supports the suitability of the ARDL model, highlighting its ability to account for both short-run and long-run dynamics in the data. This makes the ARDL model the preferred choice for policymakers and stakeholders looking to make informed decisions based on reliable economic forecasts.

Limitations and Recommendations

The prediction accuracy of the fitted models is limited by several factors, including the selected lag values, the specific regressors included in the model, and the sample size of the data. Although the ARDL model was found to be the most accurate in predicting Jordan’s unemployment rate in this study, changes in the lag values selected for VAR and ARDL, as well as modifications in the sample size and the time series period (1991 to 2022) for the three models, may alter the accuracy of the predictions. This implies that the influence of these parameters on the predicting accuracy of the models should be carefully considered. In light of these limitations, future studies should focus on varying the sample size, experimenting with different lag values, and adjusting the period covered by the data to achieve optimal predicting accuracy for the models.

Our study found several contradictions with existing literature. Notably, we observed that an increase in literacy rates is associated with higher youth unemployment in Jordan, which contradicts previous findings by Riaz & Zafar4, who reported that higher literacy rates generally lead to lower unemployment rates. The observed contradiction may stem from differences in the datasets used. Our study utilizes time series data from 1991 to 2022, while Riaz & Zafar4 focused on cross-sectional data from a specific year. Additionally, our study specifically examines youth unemployment in Jordan, a country with unique socio-economic conditions, whereas Riaz & Zafar4 examined broader unemployment trends in a different region.

One possible explanation for the higher youth unemployment despite increased literacy rates could be a mismatch between the skills provided by the education system and the demands of the labor market in Jordan. While literacy rates have improved, the quality and relevance of education may not align with the needs of employers, leading to higher unemployment among educated youth. Additionally, cultural and economic factors unique to Jordan may contribute to this phenomenon.

Future research should investigate the specific types of education and skills training provided in Jordan and their alignment with labor market demands. Additionally, longitudinal studies could explore how changes in the education system over time impact youth unemployment rates. Comparative studies between Jordan and other countries with similar socio-economic profiles could also provide deeper insights into the observed contradictions.

By addressing these contradictions and providing possible explanations, we aim to contribute to a more nuanced understanding of the factors influencing youth unemployment in Jordan and highlight areas for further investigation.

References

  1. Hadam, S., Würz, N., & Schmid, T. (2023). Estimating regional unemployment with mobile network data for functional urban areas in Germany. Statistical Methods & Applications, 1(1), 1-29 []
  2. Abugamea, G. (2018). Determinants of Unemployment: Empirical Evidence from Palestine. Munich Personal RePEc Archive [] [] []
  3. Aqil, M., Ahmad, R., & Qadee, S. (2014). Determinants of Unemployment in Pakistan, International Journal of Physical and Social Science, 4(4), 12-18 [] []
  4. Riaz, A., & Zafar, F. (2018). Determinants of unemployment in less developed countries. SHS Web of Conferences, 48(1), 1-6 [] [] [] [] [] []
  5. Pereira, S., Turkman, K., Correia, L., & Rue, H. (2021). Unemployment estimation: Spatial point referenced methods and models. Spatial Statistics, 41(5), 1-10 []
  6. Alrabba, M. I. M. (2017). The Determinants of Unemployment Rate in Jordan: A Multivariate Approach, International Journal of Economics and Finance, 9(11), 109-117 [] [] []
  7. Antipova, A. (2021). Analysis of the COVID-19 impacts on employment and unemployment across the multi-dimensional social disadvantaged areas. Social Sciences & Humanities Open, 4(1), 1-12 [] [] []
  8. Selim, M., Kreishan, F., & Alawad, A. (2020). Determinants of youth unemployment: Evidence from Jordan. International Journal of Economics and Business Administration, 4(1), 152-165 [] []
  9. Halasa-Rappel, Y., Prince, H., & Khan, A. (2018). Economic growth, youth unemployment, and political and social instability: A study of policies and outcomes in post-Arab spring Egypt, Morocco, Jordan, and Tunisia. UNRISD Working Paper, 1-15 []
  10. Alawad, A. S., Kreishan, F. & Selim, M. (2020). Determinants of Youth Unemployment: Evidence from Jordan. International Journal of Economics & Business Administration, 4(4), 152-165 [] [] []
  11. Wang, Y., Xiang, Y., & Zhang, H. (2022). Comparison and forecasting of VaR models for measuring financial risk: Evidence from China. Hindawi, 2020(1), 1-7 [] []
  12. Uylangco, K., & Li, S. (2016). An evaluation of the effectiveness of Value-at-Risk (VaR) models for Australian banks under Basel III. Australian Journal of Management, 41(4), 699-718 []
  13. Jmaii, A. (2022). The impact of health crisis on the volatility of commodity price return: Does economic uncertainty matter? IGI Global, 1(1), 1-22 []
  14. Shrestha, M. B., & Bhatta, G. (2018). Selecting appropriate methodological framework for time series data analysis. The Journal of Finance and Data Science, 4(2), 71-89 []
  15. World Bank. (2023). World development indicators. World Bank Databank. Retrieved November 24, 2023, from https://databank.worldbank.org/source/world-development-indicators/preview/on# [] [] [] [] [] [] [] [] [] [] []
  16. Azolibe, C. B., Uzochukwu-Obi, P. C., & Dimnwobi, K. S. (2022). The determinants of unemployment rate in developing economies: does banking system credit matter? Journal of Economic and Administrative Sciences, 1(1), 1-20 [] []
  17. Hjazeen, H., Ozdeser, H., & Seraj, M. (2021). The nexus between the economic growth and unemployment in Jordan. Future Business Journal, 7(42), 11-18 []
  18. Sam, S. O. (2016). Modelling economic determinants of youth unemployment in Kenya. Journal of Emerging Trends in Economics and Management Sciences, 7(1), 2-9 []
  19. Alrabba, M. I. M. (2017). The Determinants of Unemployment Rate in Jordan: A Multivariate Approach, International Journal of Economics and Finance, 9(11), 109-117 []
  20. International Monetary Fund. (n.d.). Balance of payments database. International Monetary Fund. Retrieved [November 22, 2023], from https://data.imf.org/?sk=388DFA60-1D26-4ADE-B505-A05A558D9A42&sId=1479329132316 [] []
  21. United Nations Conference on Trade and Development United Nations Conference on Trade and Development. (n.d.). Country profile: Jordan. UNCTADstat. Retrieved October 20, 2023, from https://unctadstat.unctad.org/CountryProfile/GeneralProfile/en-GB/400/index.html [] []
  22. Jordan Department of Statistics. (2022). Jordan in figures 2022. Jordan Department of Statistics. Retrieved October 30, 2023, from https://dosweb.dos.gov.jo/product/jordan-in-figure2022/ []
  23. West, R. (2021). Best practice in statistics: The use of log transformation. Clinical Biochemistry, 59(3), 162-165 [] []
  24. Feng, C., Wang, H., & Tian, C. (2014). Log-transformation and its implications for data analysis. Shanghai Archives of Psychiatry, 26(2), 105–109 []
  25. Syla, S. (2013). Application of multiple linear regression analysis of employment through ALMP. International Journal of Academic Research in Business and Social Sciences, 3(12), 252-258 [] []
  26. Monari, F. N., Tinega, R., & Agasa, L. O. (2020). Modelling economic determinants of youth unemployment using multiple linear regression: A case study of Kenya. Internal Journal of Statistics and Applied Mathematics, 5(6), 07-11 []
  27. Ashfahani, R. N. G., Sugandha, A., Tripena, A., & Prabowo, A. (2020). Double Linear Regression Method to Analyze Factors Affecting the Employment Level in West Java Province. IEOM Society International, 2(1), 2577-2584 []
  28. Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6), 716-723. https://doi.org/10.1109/TAC.1974.1100705 []
  29. Handoyo, S., Chen, Y.-P., & Shelvi, T. M. (2022). Modeling vector autoregressive and autoregressive distributed lag of the beef and chicken meat prices during the COVID-19 pandemic in Indonesia. Journal of Hunan University Natural Sciences, 49(3), 2-11 []
  30. Hodson, T. O. (2022). Root-mean-square error (RMSE) or mean absolute error (MAE): When to use them or not. Geoscientific Model Development, 15(1), 5481-5485. Available at: https://gmd.copernicus.org/articles/15/5481/2022/ []
  31. Kim, S., & Kim, H. (2016). A new metric of absolute percentage error for intermittent demand forecasts. International Journal of Forecasting, 32(3), 669-679 []
  32. Li, T., Barwick, P. J., & Deng, Y. (2023). The COVID-19 pandemic and unemployment: Evidence from mobile phone data from China. Journal of Urban Economics, 2(1), 135-151 []
  33. Bekhet, H. A., & Al-Smadi, R. W. (2015). Determinants of Jordanian foreign direct investment inflows: Bounds testing approach, Economic Modelling, 46(5), 27-35 []
  34. Toarna, A., & Cojanu, V. (2015). The 2008 crisis: Causes and future direction for academic research. Procedia Economics and Finance, 27(1), 385-393 []
  35. Weldali, M. (2023). Jordan sees further drop in illiteracy rates. The Jordan Times. Available at: https://jordantimes.com/news/local/jordan-sees-further-drop-illiteracy-rates []
  36. Agarwal, J. (2019). Does foreign direct investment contribute to unemployment in home countries? An empirical survey. ZBW, 1(1), 1-48 [] []
  37. Barbu, D., Tandberg, D., & Bradley, C. (2015). Relationship Between Unemployment and College Enrollment and Success Outcomes.Available at: https://diginole.lib.fsu.edu/islandora/object/fsu%3A252922 [] []
  38. Hamilton, F. (2015). Interpreting regression models in clinical outcome studies. Bone Joint Res, 4(9), 152–153 [] []

LEAVE A REPLY

Please enter your comment!
Please enter your name here