Forecasting of Covid-19 Trends Using Arima Model and Tableau
Main Article Content
Article Sidebar
Abstract
COVID-19 is a plague that has wreaked havoc on over 175 countries throughout the world and could pose a serious threat to all of their executive branches. India is also affected by the wave, making it a difficult challenge to control the viral epidemic, which it has done so by enacting severe safeguards. This research examines the spread prognosis in India as well as the impact of various countermeasures. In most of the countries, and the number of infected, and killed the patients, is growing at an alarming rate. Forecasting techniques are frequently used, contributing in the development of improved approaches and the making of productive decisions. These methods analyse past events in order to make more accurate predictions about what will happen in the future. These forecasts could aid in preparing for potential dangers and consequences. In order to make accurate forecasts, forecasting techniques are quite necessary. COVID-19 has become a lethal disease as a result of its spread. The pandemic has caused severe damage to our day-to-day life, either directly or indirectly. We want to use the Machine Learning model to try to predict the disease's trend in India by looking at the approximation of when normalcy would return. This study investigated the effectiveness and appropriateness of the Facebook pages Prophet's predictions of the model and the ARIMA forecasting model to a dataset with the confirmed cases, deaths, and recovered from, the digits are collected, the covid19india site. The predictions of the models are compared to the specific data in order to see how well they perform. The result reveals that ARIMA outperforms prophet in general, despite the fact that it is farther away from the specific data the more days it forecasts. ARIMA proves to be a more accurate forecasting method than Facebook's Prophet by validating the performance of the applied models. The ARIMA model predicts that the historical data and anticipated data would have a high correlation, indicating that there will be minimal errors.
The SARS-COV-2 virus has created a new infection known as Coronavirus (COVID-19). In December 2019, the virus made its first appearance in Wuhan, Hubei province [19]. Wuhan began as a simple case of pneumonia, but it quickly escalated into a global disaster within a month. On March 11, 2020, the World Health Organization (WHO) classified COVID to be an outbreak. The disease is difficult to prevent because the infected person does not display symptoms for a long time or the disease does not exist. COVID-19 is classified into three groups based on its distribution.
- Local outbreaks: because the virus's chain of transmission among humans can be traced at this stage, the source of infection is frequently identified. This category includes cases involving relatives or friends, as well as local disclosure.
- Public transmission: at this time, the source of the infected people's network is unknown. Clusters of infected cases spread across communities, resulting in an increase in the number of infected people.
- Large-scale transmission: at this point, the virus has spread swiftly to other parts of the world due to largescale uncontrolled human migration.
FIG1. Data forecast Using FB-Prophet Model
PROPHET is a process of prediction time series data developed by the Facebook team of Data Science. Its main motive is to be able to predict ‘standard’, which means that the PROPHET [27] wants to be an automated predictive tool in the environment, which provides the most convenient use of time management methods and empowers analysts in any domain or people with little (possible) forecasting knowledge to predict successfully. According to Facebook[5], THE PROPHET “is doing very well with a series of time series with strong seasonal results and several periods of historical and strong data for marketers and changes in this trend.” In this case, our data is
Flawless but efficient. And this is what the PROPHET is not ready for. Its default environment provides time-series data flexibility with significant changes and that is
Why analysts have not to worry that their data is not ready to predict the PROPHET.
A PROPHET [12] is aware of its application. As a result, analysts must make changes to the database and make a data frame with two columns: a 'ds' column for the date stamp (in data time format), and a 'y' column for the prediction rating (in numerical values). Following that, analysts must create an object from the Prophet () section, into which a data frame will be inserted and saved. Analysts can then choose the appropriate time frame for their predictions and proceed with the forecasting process.
This is likewise one of the classification [20] algorithms which is directed and is anything but difficult to utilize. A dimensional space is used to depict each point that represents a piece of information in this computation. The dimensional space is sometimes referred to as an n-dimensional plane, where the number "n" refers to the number of points that make up the information piece. In every time series analysis, the forecast [1] is completely dependent on the values of the series that have occurred in the past, which are referred to as lags. As shown in the equation, a straightforward overview of a model that is dependent on just one lag or one variable in the series is provided. Because of the prior prediction Yt1, the predicted value Yt2 is dependent on the error, which is computed as the difference between the expected and actual outcomes. The slope coefficient is denoted by w, and the nonzero mean is denoted by x. While the ARIMA model [2] does not directly depend on the lags or variables of the time series, it does rely on error lags, which are calculated by subtracting the actual outcome from the projected outcome in order to calculate the error lags. A linear connection between time series values is assumed by ARIMA[23] models, and they make an attempt to leverage the linear correlations between observations in order to identify local patterns from the data while simultaneously eliminating high-frequency noise from the data. A typical notation for ARIMA models is ARIMA with p, d, and q, where integer values substitute for the parameters to indicate which type of ARIMA model is employed.
- p:It is the number of lag observations in the model
- d:It is the number of times that the raw observations are differenced .
- Q: It is the size of the moving average window. The forecasting equation is constructed as given. First, let y denote dth difference of Y, means:
If d=2: yt = (Yt - Yt-1) - (Yt-1 -Yt-2) = Yt -2Yt-1 + Yt-2
.
In terms of y, the general forecasting equation is:
ŷt = μ + ϕ1 yt-1 +………+ ϕp yt-p — θ1et-1 -………- θqet-q,
here:
μ=constant
ϕ1 yt-1 +…+ ϕp yt-p == ar terms (lagged values of y)
-θ1et-1 -………- θqet-q == ma terms (lagged errors)
How to Cite
Article Details
Aarima, Mathematical modeling, COVID19
2. Petrevska B. Predicting tourism demand by A.R.I.M.A. models. Economic Research-Ekonomska Istraživanja. 2017 Jan; 30: p. 939–950.