Forecasting of Covid-19 Trends Using Arima Model and Tableau

Main Article Content

Article Sidebar

Published Sep 15, 2021
Ayushi Dwivedi

Abstract

COVID-19 is a plague that has wreaked havoc on over 175 countries throughout the world and could pose a serious threat to all of their executive branches. India is also affected by the wave, making it a difficult challenge to control the viral epidemic, which it has done so by enacting severe safeguards. This research examines the spread prognosis in India as well as the impact of various countermeasures. In most of the countries, and the number of infected, and killed the patients, is growing at an alarming rate. Forecasting techniques are frequently used, contributing in the development of improved approaches and the making of productive decisions. These methods analyse past events in order to make more accurate predictions about what will happen in the future. These forecasts could aid in preparing for potential dangers and consequences. In order to make accurate forecasts, forecasting techniques are quite necessary. COVID-19 has become a lethal disease as a result of its spread. The pandemic has caused severe damage to our day-to-day life, either directly or indirectly. We want to use the Machine Learning model to try to predict the disease's trend in India by looking at the approximation of when normalcy would return. This study investigated the effectiveness and appropriateness of the Facebook pages Prophet's predictions of the model and the ARIMA forecasting model to a dataset with the confirmed cases, deaths, and recovered from, the digits are collected, the covid19india site. The predictions of the models are compared to the specific data in order to see how well they perform. The result reveals that ARIMA outperforms prophet in general, despite the fact that it is farther away from the specific data the more days it forecasts. ARIMA proves to be a more accurate forecasting method than Facebook's Prophet by validating the performance of the applied models. The ARIMA model predicts that the historical data and anticipated data would have a high correlation, indicating that there will be minimal errors.

The SARS-COV-2 virus has created a new infection known as Coronavirus (COVID-19). In December 2019, the virus made its first appearance in Wuhan, Hubei province [19]. Wuhan began as a simple case of pneumonia, but it quickly escalated into a global disaster within a month. On March 11, 2020, the World Health Organization (WHO) classified COVID to be an outbreak. The disease is difficult to prevent because the infected person does not display symptoms for a long time or the disease does not exist. COVID-19 is classified into three groups based on its distribution.

  • Local outbreaks: because the virus's chain of transmission among humans can be traced at this stage, the source of infection is frequently identified. This category includes cases involving relatives or friends, as well as local disclosure.
  • Public transmission: at this time, the source of the infected people's network is unknown. Clusters of infected cases spread across communities, resulting in an increase in the number of infected people.
  • Large-scale transmission: at this point, the virus has spread swiftly to other parts of the world due to largescale uncontrolled human migration.mceclip0.png

    FIG1. Data forecast Using FB-Prophet Model

    PROPHET is a process of prediction time series data developed by the Facebook team of Data Science.  Its  main  motive  is  to  be able to predict ‘standard’, which means that the  PROPHET  [27]  wants  to  be  an  automated   predictive  tool in the environment, which  provides  the  most convenient use of time  management  methods  and  empowers analysts in any domain or people with little (possible)   forecasting   knowledge   to    predict successfully.   According   to   Facebook[5],   THE PROPHET “is doing very  well  with  a  series  of  time  series  with  strong  seasonal  results   and   several   periods of historical  and  strong  data  for  marketers  and  changes  in this trend.” In this case, our data is 

    Flawless but efficient. And this  is  what  the  PROPHET  is  not  ready  for. Its default environment provides time-series data flexibility with significant changes and that is

    Why analysts have not to worry that their data is not ready to predict the PROPHET.

    A PROPHET [12] is aware of its application. As a result,  analysts  must  make  changes  to  the  database  and  make  a  data  frame  with  two  columns:  a 'ds' column for  the  date  stamp  (in  data  time  format),  and a 'y' column for the prediction rating (in numerical values). Following that, analysts  must  create  an  object  from  the  Prophet  ()  section,  into  which  a  data  frame  will  be  inserted  and  saved.  Analysts can then choose the appropriate time frame for their predictions and proceed with the forecasting process.

    This is likewise one of the classification [20] algorithms which is directed and is anything but difficult to utilize. A dimensional space is used to depict each point that represents a piece of information in this computation. The dimensional space is sometimes referred to as an n-dimensional plane, where the number "n" refers to the number of points that make up the information piece. In every time series analysis, the forecast [1] is completely dependent on the values of the series that have occurred in the past, which are referred to as lags. As shown in the equation, a straightforward overview of a model that is dependent on just one lag or one variable in the series is provided. Because of the prior prediction Yt1, the predicted value Yt2 is dependent on the error, which is computed as the difference between the expected and actual outcomes. The slope coefficient is denoted by w, and the nonzero mean is denoted by x. While the ARIMA model [2] does not directly depend on the lags or variables of the time series, it does rely on error lags, which are calculated by subtracting the actual outcome from the projected outcome in order to calculate the error lags. A linear connection between time series values is assumed by ARIMA[23] models, and they make an attempt to leverage the linear correlations between observations in order to identify local patterns from the data while simultaneously eliminating high-frequency noise from the data. A typical notation for ARIMA models is ARIMA with p, d, and q, where integer values substitute for the parameters to indicate which type of ARIMA model is employed.


    • p:It is the number of lag observations in the model
    • d:It is the number of times that the raw observations are differenced .

    • Q: It is the size of the moving average window. The forecasting equation is constructed as given. First, let y denote dth difference of Y, means:


    If d=2: yt = (Yt - Yt-1) - (Yt-1 -Yt-2) = Yt -2Yt-1 + Yt-2

    .

    In terms   of   y,   the   general   forecasting   equation   is: 

     ŷt = μ + ϕ1 yt-1 +………+ ϕp yt-p — θ1et-1 -………- θqet-q,


    here:

    μ=constant

    ϕ1 yt-1 +…+ ϕp yt-p == ar terms (lagged values of y)

    -θ1et-1 -………- θqet-q == ma terms (lagged errors)


How to Cite

Dwivedi, A. (2021). Forecasting of Covid-19 Trends Using Arima Model and Tableau. SPAST Abstracts, 1(01). Retrieved from https://spast.org/techrep/article/view/400
Abstract 196 |

Article Details

Keywords

Aarima, Mathematical modeling, COVID19

References
1. Almasarweh M, Wadi SAL. ARIMA Model in Predicting Banking Stock Market Data. Modern Applied Science. 2018 Oct; 12: p. 309.
2. Petrevska B. Predicting tourism demand by A.R.I.M.A. models. Economic Research-Ekonomska Istraživanja. 2017 Jan; 30: p. 939–950.
Section
General Session: Technologies For Smart Connected Societies