🕷️ Crawler Inspector

URL Lookup

Direct Parameter Lookup

Raw Queries and Responses

1. Shard Calculation

Query:

Response:

Calculated Shard: 36 (from laksa042)

2. Crawled Status Check

Query:

curl -X POST \
  'http://laksa036.int.ahrefs:8124/' \
  -H 'Content-Type: text/plain' \
  -H 'X-ClickHouse-Database: crawler3' \
  -H 'Authorization: Basic YXBpOg==' \
  -d 'SELECT getAhrefsURLFromUnparsed(src_unparsed) AS found_url, ifNull(toUnixTimestamp(download_stamp), 0) AS crawl_time, ifNull(toUnixTimestamp(props_url_first_seen), 0) AS first_indexed_time, download_http_code AS http_code, src_unparsed AS src_unparsed, src_root_hash AS src_root_hash, history_drop_reason AS history_drop_reason, meta_title AS meta_title, meta_descriptions AS meta_descriptions, attrs_boilerpipe_text AS attrs_boilerpipe_text, attrs_markdown AS attrs_markdown, attrs_readable_markdown AS attrs_readable_markdown, meta_canonical AS meta_canonical FROM crawler3.page_info_local FINAL PREWHERE (src_root_hash, src_unparsed) IN ((getAhrefsRootHashFromUnparsed(getAhrefsUnparsedNoserviceFromURL(\'https://otexts.com/fpp3/ses.html\')), getAhrefsUnparsedNoserviceFromURL(\'https://otexts.com/fpp3/ses.html\'))) FORMAT JSONEachRow'

Response:

{"found_url":"https:\/\/otexts.com\/fpp3\/ses.html","crawl_time":1776194820,"first_indexed_time":1561046220,"http_code":200,"src_unparsed":"com,otexts!\/fpp3\/ses.html s443","src_root_hash":"13347583336389170836","history_drop_reason":null,"meta_title":"8.1 Simple exponential smoothing | Forecasting: Principles and Practice (3rd ed)","meta_descriptions":["3rd edition"],"attrs_boilerpipe_text":"The simplest of the exponentially smoothing methods is naturally called\nsimple exponential smoothing\n(SES)\n16\n. This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure\n8.1\ndo not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section\n5.2\n).\nalgeria_economy\n<-\nglobal_economy\n|>\nfilter\n(Country\n==\n\"Algeria\"\n)\nalgeria_economy\n|>\nautoplot\n(Exports)\n+\nlabs\n(\ny =\n\"% of GDP\"\n,\ntitle =\n\"Exports: Algeria\"\n)\nFigure 8.1: Exports of goods and services from Algeria from 1960 to 2017.\nUsing the naïve method, all forecasts for the future are equal to the last observed value of the series,\n\\[\n  \\hat{y}_{T+h|T} = y_{T},\n\\]\nfor\n\\(h=1,2,\\dots\\)\n. Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation.\nUsing the average method, all future forecasts are equal to a simple average of the observed data,\n\\[\n  \\hat{y}_{T+h|T} = \\frac1T \\sum_{t=1}^T y_t,\n\\]\nfor\n\\(h=1,2,\\dots\\)\n. Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts.\nWe often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations:\n\\[\\begin{equation}\n  \\hat{y}_{T+1|T} = \\alpha y_T + \\alpha(1-\\alpha) y_{T-1} + \\alpha(1-\\alpha)^2 y_{T-2}+ \\cdots,   \\tag{8.1}\n\\end{equation}\\]\nwhere\n\\(0 \\le \\alpha \\le 1\\)\nis the smoothing parameter. The one-step-ahead forecast for time\n\\(T+1\\)\nis a weighted average of all of the observations in the series\n\\(y_1,\\dots,y_T\\)\n. The rate at which the weights decrease is controlled by the parameter\n\\(\\alpha\\)\n.\nThe table below shows the weights attached to observations for four different values of\n\\(\\alpha\\)\nwhen forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of\n\\(\\alpha\\)\nwill be approximately one for any reasonable sample size.\n\\(\\alpha=0.2\\)\n\\(\\alpha=0.4\\)\n\\(\\alpha=0.6\\)\n\\(\\alpha=0.8\\)\n\\(y_{T}\\)\n0.2000\n0.4000\n0.6000\n0.8000\n\\(y_{T-1}\\)\n0.1600\n0.2400\n0.2400\n0.1600\n\\(y_{T-2}\\)\n0.1280\n0.1440\n0.0960\n0.0320\n\\(y_{T-3}\\)\n0.1024\n0.0864\n0.0384\n0.0064\n\\(y_{T-4}\\)\n0.0819\n0.0518\n0.0154\n0.0013\n\\(y_{T-5}\\)\n0.0655\n0.0311\n0.0061\n0.0003\nFor any\n\\(\\alpha\\)\nbetween 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If\n\\(\\alpha\\)\nis small (i.e., close to 0), more weight is given to observations from the more distant past. If\n\\(\\alpha\\)\nis large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where\n\\(\\alpha=1\\)\n,\n\\(\\hat{y}_{T+1|T}=y_T\\)\n, so the forecasts are equal to the naïve forecasts.\nWe present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation\n(8.1)\n.\nWeighted average form\nThe forecast at time\n\\(T+1\\)\nis equal to a weighted average between the most recent observation\n\\(y_T\\)\nand the previous forecast\n\\(\\hat{y}_{T|T-1}\\)\n:\n\\[\n\\hat{y}_{T+1|T} = \\alpha y_T + (1-\\alpha) \\hat{y}_{T|T-1},\n\\]\nwhere\n\\(0 \\le \\alpha \\le 1\\)\nis the smoothing parameter.\nSimilarly, we can write the fitted values as\n\\[\n\\hat{y}_{t+1|t} = \\alpha y_t + (1-\\alpha) \\hat{y}_{t|t-1},\n\\]\nfor\n\\(t=1,\\dots,T\\)\n. (Recall that fitted values are simply one-step forecasts of the training data.)\nThe process has to start somewhere, so we let the first fitted value at time 1 be denoted by\n\\(\\ell_0\\)\n(which we will have to estimate). Then\n\\[\\begin{align*}\n  \\hat{y}_{2|1} &= \\alpha y_1 + (1-\\alpha) \\ell_0\\\\\n  \\hat{y}_{3|2} &= \\alpha y_2 + (1-\\alpha) \\hat{y}_{2|1}\\\\\n  \\hat{y}_{4|3} &= \\alpha y_3 + (1-\\alpha) \\hat{y}_{3|2}\\\\\n  \\vdots\\\\\n  \\hat{y}_{T|T-1} &= \\alpha y_{T-1} + (1-\\alpha) \\hat{y}_{T-1|T-2}\\\\\n  \\hat{y}_{T+1|T} &= \\alpha y_T + (1-\\alpha) \\hat{y}_{T|T-1}.\n\\end{align*}\\]\nSubstituting each equation into the following equation, we obtain\n\\[\\begin{align*}\n  \\hat{y}_{3|2}   & = \\alpha y_2 + (1-\\alpha) \\left[\\alpha y_1 + (1-\\alpha) \\ell_0\\right]              \\\\\n                 & = \\alpha y_2 + \\alpha(1-\\alpha) y_1 + (1-\\alpha)^2 \\ell_0                          \\\\\n  \\hat{y}_{4|3}   & = \\alpha y_3 + (1-\\alpha) [\\alpha y_2 + \\alpha(1-\\alpha) y_1 + (1-\\alpha)^2 \\ell_0]\\\\\n                 & = \\alpha y_3 + \\alpha(1-\\alpha) y_2 + \\alpha(1-\\alpha)^2 y_1 + (1-\\alpha)^3 \\ell_0 \\\\\n                 & ~~\\vdots                                                                           \\\\\n  \\hat{y}_{T+1|T} & =  \\sum_{j=0}^{T-1} \\alpha(1-\\alpha)^j y_{T-j} + (1-\\alpha)^T \\ell_{0}.\n\\end{align*}\\]\nThe last term becomes tiny for large\n\\(T\\)\n. So, the weighted average form leads to the same forecast Equation\n(8.1)\n.\nComponent form\nAn alternative representation is the component form. For simple exponential smoothing, the only component included is the level,\n\\(\\ell_t\\)\n. (Other methods which are considered later in this chapter may also include a trend\n\\(b_t\\)\nand a seasonal component\n\\(s_t\\)\n.) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by:\n\\[\\begin{align*}\n  \\text{Forecast equation}  && \\hat{y}_{t+h|t} & = \\ell_{t}\\\\\n  \\text{Smoothing equation} && \\ell_{t}        & = \\alpha y_{t} + (1 - \\alpha)\\ell_{t-1},\n\\end{align*}\\]\nwhere\n\\(\\ell_{t}\\)\nis the level (or the smoothed value) of the series at time\n\\(t\\)\n. Setting\n\\(h=1\\)\ngives the fitted values, while setting\n\\(t=T\\)\ngives the true forecasts beyond the training data.\nThe forecast equation shows that the forecast value at time\n\\(t+1\\)\nis the estimated level at time\n\\(t\\)\n. The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period\n\\(t\\)\n.\nIf we replace\n\\(\\ell_t\\)\nwith\n\\(\\hat{y}_{t+1|t}\\)\nand\n\\(\\ell_{t-1}\\)\nwith\n\\(\\hat{y}_{t|t-1}\\)\nin the smoothing equation, we will recover the weighted average form of simple exponential smoothing.\nThe component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components.\nFlat forecasts\nSimple exponential smoothing has a “flat” forecast function:\n\\[\n\\hat{y}_{T+h|T} = \\hat{y}_{T+1|T}=\\ell_T, \\qquad h=2,3,\\dots.\n\\]\nThat is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component.\nOptimisation\nThe application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of\n\\(\\alpha\\)\nand\n\\(\\ell_0\\)\n. All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen.\nIn some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data.\nIn Section\n7.2\n, we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as\n\\(e_t=y_t - \\hat{y}_{t|t-1}\\)\nfor\n\\(t=1,\\dots,T\\)\n. Hence, we find the values of the unknown parameters and the initial values that minimise\n\\[\\begin{equation}\n\\text{SSE}=\\sum_{t=1}^T(y_t - \\hat{y}_{t|t-1})^2=\\sum_{t=1}^Te_t^2. \\tag{8.2}\n\\end{equation}\\]\nUnlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it.\nExample: Algerian exports\nIn this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria.\n# Estimate parameters\nfit\n<-\nalgeria_economy\n|>\nmodel\n(\nETS\n(Exports\n~\nerror\n(\n\"A\"\n)\n+\ntrend\n(\n\"N\"\n)\n+\nseason\n(\n\"N\"\n)))\nfc\n<-\nfit\n|>\nforecast\n(\nh =\n5\n)\nThis gives parameter estimates\n\\(\\hat\\alpha=0.84\\)\nand\n\\(\\hat\\ell_0=39.5\\)\n, obtained by minimising SSE over periods\n\\(t=1,2,\\dots,58\\)\n, subject to the restriction that\n\\(0\\le\\alpha\\le1\\)\n.\nIn Table\n8.1\nwe demonstrate the calculation using these parameters. The second last column shows the estimated level for times\n\\(t=0\\)\nto\n\\(t=58\\)\n; the last few rows of the last column show the forecasts for\n\\(h=1\\)\nto\n\\(5\\)\n-steps ahead.\nTable 8.1:\nForecasting goods and services exports from Algeria using simple exponential smoothing.\nYear\nTime\nObservation\nLevel\nForecast\n\\(t\\)\n\\(y_t\\)\n\\(\\ell_t\\)\n\\(\\hat{y}_{t\\vert t-1}\\)\n1959\n0\n39.54\n1960\n1\n39.04\n39.12\n39.54\n1961\n2\n46.24\n45.10\n39.12\n1962\n3\n19.79\n23.84\n45.10\n1963\n4\n24.68\n24.55\n23.84\n1964\n5\n25.08\n25.00\n24.55\n1965\n6\n22.60\n22.99\n25.00\n1966\n7\n25.99\n25.51\n22.99\n1967\n8\n23.43\n23.77\n25.51\n⋮\n⋮\n⋮\n⋮\n2014\n55\n30.22\n30.80\n33.85\n2015\n56\n23.17\n24.39\n30.80\n2016\n57\n20.86\n21.43\n24.39\n2017\n58\n22.64\n22.44\n21.43\n\\(h\\)\n\\(\\hat{y}_{T+h\\vert T}\\)\n2018\n1\n22.44\n2019\n2\n22.44\n2020\n3\n22.44\n2021\n4\n22.44\n2022\n5\n22.44\nThe black line in Figure\n8.2\nshows the data, which has a changing level over time.\nfc\n|>\nautoplot\n(algeria_economy)\n+\ngeom_line\n(\naes\n(\ny =\n.fitted),\ncol=\n\"#D55E00\"\n,\ndata =\naugment\n(fit))\n+\nlabs\n(\ny=\n\"% of GDP\"\n,\ntitle=\n\"Exports: Algeria\"\n)\n+\nguides\n(\ncolour =\n\"none\"\n)\nFigure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values.\nThe forecasts for the period 2018–2022 are plotted in Figure\n8.2\n. Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of\n\\(\\alpha\\)\nin this example is reflected in the large adjustment that takes place in the estimated level\n\\(\\ell_t\\)\nat each time. A smaller value of\n\\(\\alpha\\)\nwould lead to smaller changes over time, and so the series of fitted values would be smoother.\nThe prediction intervals shown here are calculated using the methods described in Section\n8.7\n. The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading.\nIn some books it is called “single exponential smoothing”.\n↩︎","attrs_markdown":"- [Forecasting: Principles and Practice](https:\/\/otexts.com\/fpp3\/)\n- [Preface](https:\/\/otexts.com\/fpp3\/index.html)\n- [**1** Getting started](https:\/\/otexts.com\/fpp3\/intro.html)\n  - [**1\\.1** What can be forecast?](https:\/\/otexts.com\/fpp3\/what-can-be-forecast.html)\n  - [**1\\.2** Forecasting, goals and planning](https:\/\/otexts.com\/fpp3\/planning.html)\n  - [**1\\.3** Determining what to forecast](https:\/\/otexts.com\/fpp3\/determining-what-to-forecast.html)\n  - [**1\\.4** Forecasting data and methods](https:\/\/otexts.com\/fpp3\/data-methods.html)\n  - [**1\\.5** Some case studies](https:\/\/otexts.com\/fpp3\/case-studies.html)\n  - [**1\\.6** The basic steps in a forecasting task](https:\/\/otexts.com\/fpp3\/basic-steps.html)\n  - [**1\\.7** The statistical forecasting perspective](https:\/\/otexts.com\/fpp3\/perspective.html)\n  - [**1\\.8** Exercises](https:\/\/otexts.com\/fpp3\/intro-exercises.html)\n  - [**1\\.9** Further reading](https:\/\/otexts.com\/fpp3\/intro-reading.html)\n- [**2** Time series graphics](https:\/\/otexts.com\/fpp3\/graphics.html)\n  - [**2\\.1** `tsibble` objects](https:\/\/otexts.com\/fpp3\/tsibbles.html)\n  - [**2\\.2** Time plots](https:\/\/otexts.com\/fpp3\/time-plots.html)\n  - [**2\\.3** Time series patterns](https:\/\/otexts.com\/fpp3\/tspatterns.html)\n  - [**2\\.4** Seasonal plots](https:\/\/otexts.com\/fpp3\/seasonal-plots.html)\n  - [**2\\.5** Seasonal subseries plots](https:\/\/otexts.com\/fpp3\/subseries.html)\n  - [**2\\.6** Scatterplots](https:\/\/otexts.com\/fpp3\/scatterplots.html)\n  - [**2\\.7** Lag plots](https:\/\/otexts.com\/fpp3\/lag-plots.html)\n  - [**2\\.8** Autocorrelation](https:\/\/otexts.com\/fpp3\/acf.html)\n  - [**2\\.9** White noise](https:\/\/otexts.com\/fpp3\/wn.html)\n  - [**2\\.10** Exercises](https:\/\/otexts.com\/fpp3\/graphics-exercises.html)\n  - [**2\\.11** Further reading](https:\/\/otexts.com\/fpp3\/graphics-reading.html)\n- [**3** Time series decomposition](https:\/\/otexts.com\/fpp3\/decomposition.html)\n  - [**3\\.1** Transformations and adjustments](https:\/\/otexts.com\/fpp3\/transformations.html)\n  - [**3\\.2** Time series components](https:\/\/otexts.com\/fpp3\/components.html)\n  - [**3\\.3** Moving averages](https:\/\/otexts.com\/fpp3\/moving-averages.html)\n  - [**3\\.4** Classical decomposition](https:\/\/otexts.com\/fpp3\/classical-decomposition.html)\n  - [**3\\.5** Methods used by official statistics agencies](https:\/\/otexts.com\/fpp3\/methods-used-by-official-statistics-agencies.html)\n  - [**3\\.6** STL decomposition](https:\/\/otexts.com\/fpp3\/stl.html)\n  - [**3\\.7** Exercises](https:\/\/otexts.com\/fpp3\/decomposition-exercises.html)\n  - [**3\\.8** Further reading](https:\/\/otexts.com\/fpp3\/decomposition-reading.html)\n- [**4** Time series features](https:\/\/otexts.com\/fpp3\/features.html)\n  - [**4\\.1** Some simple statistics](https:\/\/otexts.com\/fpp3\/some-simple-statistics.html)\n  - [**4\\.2** ACF features](https:\/\/otexts.com\/fpp3\/acf-features.html)\n  - [**4\\.3** STL Features](https:\/\/otexts.com\/fpp3\/stlfeatures.html)\n  - [**4\\.4** Other features](https:\/\/otexts.com\/fpp3\/other-features.html)\n  - [**4\\.5** Exploring Australian tourism data](https:\/\/otexts.com\/fpp3\/exploring-australian-tourism-data.html)\n  - [**4\\.6** Exercises](https:\/\/otexts.com\/fpp3\/feast-exercises.html)\n  - [**4\\.7** Further reading](https:\/\/otexts.com\/fpp3\/further-reading.html)\n- [**5** The forecaster’s toolbox](https:\/\/otexts.com\/fpp3\/toolbox.html)\n  - [**5\\.1** A tidy forecasting workflow](https:\/\/otexts.com\/fpp3\/a-tidy-forecasting-workflow.html)\n  - [**5\\.2** Some simple forecasting methods](https:\/\/otexts.com\/fpp3\/simple-methods.html)\n  - [**5\\.3** Fitted values and residuals](https:\/\/otexts.com\/fpp3\/residuals.html)\n  - [**5\\.4** Residual diagnostics](https:\/\/otexts.com\/fpp3\/diagnostics.html)\n  - [**5\\.5** Distributional forecasts and prediction intervals](https:\/\/otexts.com\/fpp3\/prediction-intervals.html)\n  - [**5\\.6** Forecasting using transformations](https:\/\/otexts.com\/fpp3\/ftransformations.html)\n  - [**5\\.7** Forecasting with decomposition](https:\/\/otexts.com\/fpp3\/forecasting-decomposition.html)\n  - [**5\\.8** Evaluating point forecast accuracy](https:\/\/otexts.com\/fpp3\/accuracy.html)\n  - [**5\\.9** Evaluating distributional forecast accuracy](https:\/\/otexts.com\/fpp3\/distaccuracy.html)\n  - [**5\\.10** Time series cross-validation](https:\/\/otexts.com\/fpp3\/tscv.html)\n  - [**5\\.11** Exercises](https:\/\/otexts.com\/fpp3\/toolbox-exercises.html)\n  - [**5\\.12** Further reading](https:\/\/otexts.com\/fpp3\/basics-reading.html)\n- [**6** Judgmental forecasts](https:\/\/otexts.com\/fpp3\/judgmental.html)\n  - [**6\\.1** Beware of limitations](https:\/\/otexts.com\/fpp3\/judgmental-limitations.html)\n  - [**6\\.2** Key principles](https:\/\/otexts.com\/fpp3\/judgmental-principles.html)\n  - [**6\\.3** The Delphi method](https:\/\/otexts.com\/fpp3\/delphimethod.html)\n  - [**6\\.4** Forecasting by analogy](https:\/\/otexts.com\/fpp3\/analogies.html)\n  - [**6\\.5** Scenario forecasting](https:\/\/otexts.com\/fpp3\/scenarios.html)\n  - [**6\\.6** New product forecasting](https:\/\/otexts.com\/fpp3\/new-products.html)\n  - [**6\\.7** Judgmental adjustments](https:\/\/otexts.com\/fpp3\/judgmental-adjustments.html)\n  - [**6\\.8** Further reading](https:\/\/otexts.com\/fpp3\/judgmental-reading.html)\n- [**7** Time series regression models](https:\/\/otexts.com\/fpp3\/regression.html)\n  - [**7\\.1** The linear model](https:\/\/otexts.com\/fpp3\/regression-intro.html)\n  - [**7\\.2** Least squares estimation](https:\/\/otexts.com\/fpp3\/least-squares.html)\n  - [**7\\.3** Evaluating the regression model](https:\/\/otexts.com\/fpp3\/regression-evaluation.html)\n  - [**7\\.4** Some useful predictors](https:\/\/otexts.com\/fpp3\/useful-predictors.html)\n  - [**7\\.5** Selecting predictors](https:\/\/otexts.com\/fpp3\/selecting-predictors.html)\n  - [**7\\.6** Forecasting with regression](https:\/\/otexts.com\/fpp3\/forecasting-regression.html)\n  - [**7\\.7** Nonlinear regression](https:\/\/otexts.com\/fpp3\/nonlinear-regression.html)\n  - [**7\\.8** Correlation, causation and forecasting](https:\/\/otexts.com\/fpp3\/causality.html)\n  - [**7\\.9** Matrix formulation](https:\/\/otexts.com\/fpp3\/regression-matrices.html)\n  - [**7\\.10** Exercises](https:\/\/otexts.com\/fpp3\/regression-exercises.html)\n  - [**7\\.11** Further reading](https:\/\/otexts.com\/fpp3\/regression-reading.html)\n- [**8** Exponential smoothing](https:\/\/otexts.com\/fpp3\/expsmooth.html)\n  - [**8\\.1** Simple exponential smoothing](https:\/\/otexts.com\/fpp3\/ses.html)\n  - [**8\\.2** Methods with trend](https:\/\/otexts.com\/fpp3\/holt.html)\n  - [**8\\.3** Methods with seasonality](https:\/\/otexts.com\/fpp3\/holt-winters.html)\n  - [**8\\.4** A taxonomy of exponential smoothing methods](https:\/\/otexts.com\/fpp3\/taxonomy.html)\n  - [**8\\.5** Innovations state space models for exponential smoothing](https:\/\/otexts.com\/fpp3\/ets.html)\n  - [**8\\.6** Estimation and model selection](https:\/\/otexts.com\/fpp3\/ets-estimation.html)\n  - [**8\\.7** Forecasting with ETS models](https:\/\/otexts.com\/fpp3\/ets-forecasting.html)\n  - [**8\\.8** Exercises](https:\/\/otexts.com\/fpp3\/expsmooth-exercises.html)\n  - [**8\\.9** Further reading](https:\/\/otexts.com\/fpp3\/expsmooth-reading.html)\n- [**9** ARIMA models](https:\/\/otexts.com\/fpp3\/arima.html)\n  - [**9\\.1** Stationarity and differencing](https:\/\/otexts.com\/fpp3\/stationarity.html)\n  - [**9\\.2** Backshift notation](https:\/\/otexts.com\/fpp3\/backshift.html)\n  - [**9\\.3** Autoregressive models](https:\/\/otexts.com\/fpp3\/AR.html)\n  - [**9\\.4** Moving average models](https:\/\/otexts.com\/fpp3\/MA.html)\n  - [**9\\.5** Non-seasonal ARIMA models](https:\/\/otexts.com\/fpp3\/non-seasonal-arima.html)\n  - [**9\\.6** Estimation and order selection](https:\/\/otexts.com\/fpp3\/arima-estimation.html)\n  - [**9\\.7** ARIMA modelling in `fable`](https:\/\/otexts.com\/fpp3\/arima-r.html)\n  - [**9\\.8** Forecasting](https:\/\/otexts.com\/fpp3\/arima-forecasting.html)\n  - [**9\\.9** Seasonal ARIMA models](https:\/\/otexts.com\/fpp3\/seasonal-arima.html)\n  - [**9\\.10** ARIMA vs ETS](https:\/\/otexts.com\/fpp3\/arima-ets.html)\n  - [**9\\.11** Exercises](https:\/\/otexts.com\/fpp3\/arima-exercises.html)\n  - [**9\\.12** Further reading](https:\/\/otexts.com\/fpp3\/arima-reading.html)\n- [**10** Dynamic regression models](https:\/\/otexts.com\/fpp3\/dynamic.html)\n  - [**10\\.1** Estimation](https:\/\/otexts.com\/fpp3\/estimation.html)\n  - [**10\\.2** Regression with ARIMA errors using `fable`](https:\/\/otexts.com\/fpp3\/regarima.html)\n  - [**10\\.3** Forecasting](https:\/\/otexts.com\/fpp3\/forecasting.html)\n  - [**10\\.4** Stochastic and deterministic trends](https:\/\/otexts.com\/fpp3\/stochastic-and-deterministic-trends.html)\n  - [**10\\.5** Dynamic harmonic regression](https:\/\/otexts.com\/fpp3\/dhr.html)\n  - [**10\\.6** Lagged predictors](https:\/\/otexts.com\/fpp3\/lagged-predictors.html)\n  - [**10\\.7** Exercises](https:\/\/otexts.com\/fpp3\/dynamic-exercises.html)\n  - [**10\\.8** Further reading](https:\/\/otexts.com\/fpp3\/dynamic-reading.html)\n- [**11** Forecasting hierarchical and grouped time series](https:\/\/otexts.com\/fpp3\/hierarchical.html)\n  - [**11\\.1** Hierarchical and grouped time series](https:\/\/otexts.com\/fpp3\/hts.html)\n  - [**11\\.2** Single level approaches](https:\/\/otexts.com\/fpp3\/single-level.html)\n  - [**11\\.3** Forecast reconciliation](https:\/\/otexts.com\/fpp3\/reconciliation.html)\n  - [**11\\.4** Forecasting Australian domestic tourism](https:\/\/otexts.com\/fpp3\/tourism.html)\n  - [**11\\.5** Reconciled distributional forecasts](https:\/\/otexts.com\/fpp3\/rec-prob.html)\n  - [**11\\.6** Forecasting Australian prison population](https:\/\/otexts.com\/fpp3\/prison.html)\n  - [**11\\.7** Exercises](https:\/\/otexts.com\/fpp3\/hierarchical-exercises.html)\n  - [**11\\.8** Further reading](https:\/\/otexts.com\/fpp3\/hierarchical-reading.html)\n- [**12** Advanced forecasting methods](https:\/\/otexts.com\/fpp3\/advanced.html)\n  - [**12\\.1** Complex seasonality](https:\/\/otexts.com\/fpp3\/complexseasonality.html)\n  - [**12\\.2** Prophet model](https:\/\/otexts.com\/fpp3\/prophet.html)\n  - [**12\\.3** Vector autoregressions](https:\/\/otexts.com\/fpp3\/VAR.html)\n  - [**12\\.4** Neural network models](https:\/\/otexts.com\/fpp3\/nnetar.html)\n  - [**12\\.5** Bootstrapping and bagging](https:\/\/otexts.com\/fpp3\/bootstrap.html)\n  - [**12\\.6** Exercises](https:\/\/otexts.com\/fpp3\/advanced-exercises.html)\n  - [**12\\.7** Further reading](https:\/\/otexts.com\/fpp3\/advanced-reading.html)\n- [**13** Some practical forecasting issues](https:\/\/otexts.com\/fpp3\/practical.html)\n  - [**13\\.1** Weekly, daily and sub-daily data](https:\/\/otexts.com\/fpp3\/weekly.html)\n  - [**13\\.2** Time series of counts](https:\/\/otexts.com\/fpp3\/counts.html)\n  - [**13\\.3** Ensuring forecasts stay within limits](https:\/\/otexts.com\/fpp3\/limits.html)\n  - [**13\\.4** Forecast combinations](https:\/\/otexts.com\/fpp3\/combinations.html)\n  - [**13\\.5** Prediction intervals for aggregates](https:\/\/otexts.com\/fpp3\/aggregates.html)\n  - [**13\\.6** Backcasting](https:\/\/otexts.com\/fpp3\/backcasting.html)\n  - [**13\\.7** Very long and very short time series](https:\/\/otexts.com\/fpp3\/long-short-ts.html)\n  - [**13\\.8** Forecasting on training and test sets](https:\/\/otexts.com\/fpp3\/training-test.html)\n  - [**13\\.9** Dealing with outliers and missing values](https:\/\/otexts.com\/fpp3\/missing-outliers.html)\n  - [**13\\.10** Further reading](https:\/\/otexts.com\/fpp3\/further-reading-1.html)\n- [Appendix: Using R](https:\/\/otexts.com\/fpp3\/appendix-using-r.html)\n- [Appendix: For instructors](https:\/\/otexts.com\/fpp3\/appendix-for-instructors.html)\n- [Appendix: Reviews](https:\/\/otexts.com\/fpp3\/appendix-reviews.html)\n- [Translations](https:\/\/otexts.com\/fpp3\/translations.html)\n- [About the authors](https:\/\/otexts.com\/fpp3\/about-the-authors.html)\n- [Buy a print version](https:\/\/otexts.com\/fpp3\/buy-a-print-version.html)\n- [Help and feedback](https:\/\/otexts.com\/fpp3\/help-and-feedback.html)\n- [Changelog](https:\/\/otexts.com\/fpp3\/changelog.html)\n- [Bibliography](https:\/\/otexts.com\/fpp3\/bibliography-1.html)\n- [Published by OTexts™ with bookdown](https:\/\/otexts.com\/)\n\n# [Forecasting: Principles and Practice (3rd ed)](https:\/\/otexts.com\/fpp3\/)\n## 8\\.1 Simple exponential smoothing\nThe simplest of the exponentially smoothing methods is naturally called **simple exponential smoothing** (SES)[16](https:\/\/otexts.com\/fpp3\/ses.html#fn16). This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure [8\\.1](https:\/\/otexts.com\/fpp3\/ses.html#fig:7-oil) do not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section [5\\.2](https:\/\/otexts.com\/fpp3\/simple-methods.html#simple-methods)).\n```\nalgeria_economy <- global_economy |>\n  filter(Country == \"Algeria\")\nalgeria_economy |>\n  autoplot(Exports) +\n  labs(y = \"% of GDP\", title = \"Exports: Algeria\")\n```\n![Exports of goods and services from Algeria from 1960 to 2017.](https:\/\/otexts.com\/fpp3\/fpp_files\/figure-html\/7-oil-1.png)\n\nFigure 8.1: Exports of goods and services from Algeria from 1960 to 2017.\n\nUsing the naïve method, all forecasts for the future are equal to the last observed value of the series, \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = y\\_{T}, \\\\\\] for \\\\(h=1,2,\\\\dots\\\\). Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation.\n\nUsing the average method, all future forecasts are equal to a simple average of the observed data, \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = \\\\frac1T \\\\sum\\_{t=1}^T y\\_t, \\\\\\] for \\\\(h=1,2,\\\\dots\\\\). Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts.\n\nWe often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations: \\\\\\[\\\\begin{equation} \\\\hat{y}\\_{T+1\\|T} = \\\\alpha y\\_T + \\\\alpha(1-\\\\alpha) y\\_{T-1} + \\\\alpha(1-\\\\alpha)^2 y\\_{T-2}+ \\\\cdots, \\\\tag{8.1} \\\\end{equation}\\\\\\] where \\\\(0 \\\\le \\\\alpha \\\\le 1\\\\) is the smoothing parameter. The one-step-ahead forecast for time \\\\(T+1\\\\) is a weighted average of all of the observations in the series \\\\(y\\_1,\\\\dots,y\\_T\\\\). The rate at which the weights decrease is controlled by the parameter \\\\(\\\\alpha\\\\).\n\nThe table below shows the weights attached to observations for four different values of \\\\(\\\\alpha\\\\) when forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of \\\\(\\\\alpha\\\\) will be approximately one for any reasonable sample size.\n\n|   | \\\\(\\\\alpha=0.2\\\\) | \\\\(\\\\alpha=0.4\\\\) | \\\\(\\\\alpha=0.6\\\\) | \\\\(\\\\alpha=0.8\\\\) |\n|---|---|---|---|---|\n| \\\\(y\\_{T}\\\\) | 0\\.2000 | 0\\.4000 | 0\\.6000 | 0\\.8000 |\n| \\\\(y\\_{T-1}\\\\) | 0\\.1600 | 0\\.2400 | 0\\.2400 | 0\\.1600 |\n| \\\\(y\\_{T-2}\\\\) | 0\\.1280 | 0\\.1440 | 0\\.0960 | 0\\.0320 |\n| \\\\(y\\_{T-3}\\\\) | 0\\.1024 | 0\\.0864 | 0\\.0384 | 0\\.0064 |\n| \\\\(y\\_{T-4}\\\\) | 0\\.0819 | 0\\.0518 | 0\\.0154 | 0\\.0013 |\n| \\\\(y\\_{T-5}\\\\) | 0\\.0655 | 0\\.0311 | 0\\.0061 | 0\\.0003 |\n\nFor any \\\\(\\\\alpha\\\\) between 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If \\\\(\\\\alpha\\\\) is small (i.e., close to 0), more weight is given to observations from the more distant past. If \\\\(\\\\alpha\\\\) is large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where \\\\(\\\\alpha=1\\\\), \\\\(\\\\hat{y}\\_{T+1\\|T}=y\\_T\\\\), so the forecasts are equal to the naïve forecasts.\n\nWe present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation [(8.1)](https:\/\/otexts.com\/fpp3\/ses.html#eq:7-ses).\n\n### Weighted average form\nThe forecast at time \\\\(T+1\\\\) is equal to a weighted average between the most recent observation \\\\(y\\_T\\\\) and the previous forecast \\\\(\\\\hat{y}\\_{T\\|T-1}\\\\): \\\\\\[ \\\\hat{y}\\_{T+1\\|T} = \\\\alpha y\\_T + (1-\\\\alpha) \\\\hat{y}\\_{T\\|T-1}, \\\\\\] where \\\\(0 \\\\le \\\\alpha \\\\le 1\\\\) is the smoothing parameter. Similarly, we can write the fitted values as \\\\\\[ \\\\hat{y}\\_{t+1\\|t} = \\\\alpha y\\_t + (1-\\\\alpha) \\\\hat{y}\\_{t\\|t-1}, \\\\\\] for \\\\(t=1,\\\\dots,T\\\\). (Recall that fitted values are simply one-step forecasts of the training data.)\n\nThe process has to start somewhere, so we let the first fitted value at time 1 be denoted by \\\\(\\\\ell\\_0\\\\) (which we will have to estimate). Then \\\\\\[\\\\begin{align\\*} \\\\hat{y}\\_{2\\|1} &= \\\\alpha y\\_1 + (1-\\\\alpha) \\\\ell\\_0\\\\\\\\ \\\\hat{y}\\_{3\\|2} &= \\\\alpha y\\_2 + (1-\\\\alpha) \\\\hat{y}\\_{2\\|1}\\\\\\\\ \\\\hat{y}\\_{4\\|3} &= \\\\alpha y\\_3 + (1-\\\\alpha) \\\\hat{y}\\_{3\\|2}\\\\\\\\ \\\\vdots\\\\\\\\ \\\\hat{y}\\_{T\\|T-1} &= \\\\alpha y\\_{T-1} + (1-\\\\alpha) \\\\hat{y}\\_{T-1\\|T-2}\\\\\\\\ \\\\hat{y}\\_{T+1\\|T} &= \\\\alpha y\\_T + (1-\\\\alpha) \\\\hat{y}\\_{T\\|T-1}. \\\\end{align\\*}\\\\\\] Substituting each equation into the following equation, we obtain \\\\\\[\\\\begin{align\\*} \\\\hat{y}\\_{3\\|2} & = \\\\alpha y\\_2 + (1-\\\\alpha) \\\\left\\[\\\\alpha y\\_1 + (1-\\\\alpha) \\\\ell\\_0\\\\right\\] \\\\\\\\ & = \\\\alpha y\\_2 + \\\\alpha(1-\\\\alpha) y\\_1 + (1-\\\\alpha)^2 \\\\ell\\_0 \\\\\\\\ \\\\hat{y}\\_{4\\|3} & = \\\\alpha y\\_3 + (1-\\\\alpha) \\[\\\\alpha y\\_2 + \\\\alpha(1-\\\\alpha) y\\_1 + (1-\\\\alpha)^2 \\\\ell\\_0\\]\\\\\\\\ & = \\\\alpha y\\_3 + \\\\alpha(1-\\\\alpha) y\\_2 + \\\\alpha(1-\\\\alpha)^2 y\\_1 + (1-\\\\alpha)^3 \\\\ell\\_0 \\\\\\\\ & \\~~\\\\vdots \\\\\\\\ \\\\hat{y}\\_{T+1\\|T} & = \\\\sum\\_{j=0}^{T-1} \\\\alpha(1-\\\\alpha)^j y\\_{T-j} + (1-\\\\alpha)^T \\\\ell\\_{0}. \\\\end{align\\*}\\\\\\] The last term becomes tiny for large \\\\(T\\\\). So, the weighted average form leads to the same forecast Equation [(8.1)](https:\/\/otexts.com\/fpp3\/ses.html#eq:7-ses).\n\n### Component form\nAn alternative representation is the component form. For simple exponential smoothing, the only component included is the level, \\\\(\\\\ell\\_t\\\\). (Other methods which are considered later in this chapter may also include a trend \\\\(b\\_t\\\\) and a seasonal component \\\\(s\\_t\\\\).) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by: \\\\\\[\\\\begin{align\\*} \\\\text{Forecast equation} && \\\\hat{y}\\_{t+h\\|t} & = \\\\ell\\_{t}\\\\\\\\ \\\\text{Smoothing equation} && \\\\ell\\_{t} & = \\\\alpha y\\_{t} + (1 - \\\\alpha)\\\\ell\\_{t-1}, \\\\end{align\\*}\\\\\\] where \\\\(\\\\ell\\_{t}\\\\) is the level (or the smoothed value) of the series at time \\\\(t\\\\). Setting \\\\(h=1\\\\) gives the fitted values, while setting \\\\(t=T\\\\) gives the true forecasts beyond the training data.\n\nThe forecast equation shows that the forecast value at time \\\\(t+1\\\\) is the estimated level at time \\\\(t\\\\). The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period \\\\(t\\\\).\n\nIf we replace \\\\(\\\\ell\\_t\\\\) with \\\\(\\\\hat{y}\\_{t+1\\|t}\\\\) and \\\\(\\\\ell\\_{t-1}\\\\) with \\\\(\\\\hat{y}\\_{t\\|t-1}\\\\) in the smoothing equation, we will recover the weighted average form of simple exponential smoothing.\n\nThe component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components.\n\n### Flat forecasts\nSimple exponential smoothing has a “flat” forecast function: \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = \\\\hat{y}\\_{T+1\\|T}=\\\\ell\\_T, \\\\qquad h=2,3,\\\\dots. \\\\\\] That is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component.\n\n### Optimisation\nThe application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of \\\\(\\\\alpha\\\\) and \\\\(\\\\ell\\_0\\\\). All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen.\n\nIn some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data.\n\nIn Section [7\\.2](https:\/\/otexts.com\/fpp3\/least-squares.html#least-squares), we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as \\\\(e\\_t=y\\_t - \\\\hat{y}\\_{t\\|t-1}\\\\) for \\\\(t=1,\\\\dots,T\\\\). Hence, we find the values of the unknown parameters and the initial values that minimise \\\\\\[\\\\begin{equation} \\\\text{SSE}=\\\\sum\\_{t=1}^T(y\\_t - \\\\hat{y}\\_{t\\|t-1})^2=\\\\sum\\_{t=1}^Te\\_t^2. \\\\tag{8.2} \\\\end{equation}\\\\\\]\n\nUnlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it.\n\n### Example: Algerian exports\nIn this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria.\n```\n# Estimate parameters\nfit <- algeria_economy |>\n  model(ETS(Exports ~ error(\"A\") + trend(\"N\") + season(\"N\")))\nfc <- fit |>\n  forecast(h = 5)\n```\nThis gives parameter estimates \\\\(\\\\hat\\\\alpha=0.84\\\\) and \\\\(\\\\hat\\\\ell\\_0=39.5\\\\), obtained by minimising SSE over periods \\\\(t=1,2,\\\\dots,58\\\\), subject to the restriction that \\\\(0\\\\le\\\\alpha\\\\le1\\\\).\n\nIn Table [8\\.1](https:\/\/otexts.com\/fpp3\/ses.html#tab:export-ses) we demonstrate the calculation using these parameters. The second last column shows the estimated level for times \\\\(t=0\\\\) to \\\\(t=58\\\\); the last few rows of the last column show the forecasts for \\\\(h=1\\\\) to \\\\(5\\\\)\\-steps ahead.\n\n| Year | Time | Observation | Level | Forecast |\n|---|---|---|---|---|\n|   | \\\\(t\\\\) | \\\\(y\\_t\\\\) | \\\\(\\\\ell\\_t\\\\) | \\\\(\\\\hat{y}\\_{t\\\\vert t-1}\\\\) |\n| 1959 | 0 |   | 39\\.54 |   |\n| 1960 | 1 | 39\\.04 | 39\\.12 | 39\\.54 |\n| 1961 | 2 | 46\\.24 | 45\\.10 | 39\\.12 |\n| 1962 | 3 | 19\\.79 | 23\\.84 | 45\\.10 |\n| 1963 | 4 | 24\\.68 | 24\\.55 | 23\\.84 |\n| 1964 | 5 | 25\\.08 | 25\\.00 | 24\\.55 |\n| 1965 | 6 | 22\\.60 | 22\\.99 | 25\\.00 |\n| 1966 | 7 | 25\\.99 | 25\\.51 | 22\\.99 |\n| 1967 | 8 | 23\\.43 | 23\\.77 | 25\\.51 |\n|   | ⋮ | ⋮ | ⋮ | ⋮ |\n| 2014 | 55 | 30\\.22 | 30\\.80 | 33\\.85 |\n| 2015 | 56 | 23\\.17 | 24\\.39 | 30\\.80 |\n| 2016 | 57 | 20\\.86 | 21\\.43 | 24\\.39 |\n| 2017 | 58 | 22\\.64 | 22\\.44 | 21\\.43 |\n|   | \\\\(h\\\\) |   |   | \\\\(\\\\hat{y}\\_{T+h\\\\vert T}\\\\) |\n| 2018 | 1 |   |   | 22\\.44 |\n| 2019 | 2 |   |   | 22\\.44 |\n| 2020 | 3 |   |   | 22\\.44 |\n| 2021 | 4 |   |   | 22\\.44 |\n| 2022 | 5 |   |   | 22\\.44 |\n\nThe black line in Figure [8\\.2](https:\/\/otexts.com\/fpp3\/ses.html#fig:ses) shows the data, which has a changing level over time.\n```\nfc |>\n  autoplot(algeria_economy) +\n  geom_line(aes(y = .fitted), col=\"#D55E00\",\n            data = augment(fit)) +\n  labs(y=\"% of GDP\", title=\"Exports: Algeria\") +\n  guides(colour = \"none\")\n```\n![Simple exponential smoothing applied to exports from Algeria (1960--2017). The orange curve shows the one-step-ahead fitted values.](https:\/\/otexts.com\/fpp3\/fpp_files\/figure-html\/ses-1.png)\n\nFigure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values.\n\nThe forecasts for the period 2018–2022 are plotted in Figure [8\\.2](https:\/\/otexts.com\/fpp3\/ses.html#fig:ses). Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of \\\\(\\\\alpha\\\\) in this example is reflected in the large adjustment that takes place in the estimated level \\\\(\\\\ell\\_t\\\\) at each time. A smaller value of \\\\(\\\\alpha\\\\) would lead to smaller changes over time, and so the series of fitted values would be smoother.\n\nThe prediction intervals shown here are calculated using the methods described in Section [8\\.7](https:\/\/otexts.com\/fpp3\/ets-forecasting.html#ets-forecasting). The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading.\n\n***\n1. In some books it is called “single exponential smoothing”.[↩︎](https:\/\/otexts.com\/fpp3\/ses.html#fnref16)","attrs_readable_markdown":"The simplest of the exponentially smoothing methods is naturally called **simple exponential smoothing** (SES)[16](https:\/\/otexts.com\/fpp3\/ses.html#fn16). This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure [8\\.1](https:\/\/otexts.com\/fpp3\/ses.html#fig:7-oil) do not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section [5\\.2](https:\/\/otexts.com\/fpp3\/simple-methods.html#simple-methods)).\n```\nalgeria_economy <- global_economy |>\n  filter(Country == \"Algeria\")\nalgeria_economy |>\n  autoplot(Exports) +\n  labs(y = \"% of GDP\", title = \"Exports: Algeria\")\n```\n![Exports of goods and services from Algeria from 1960 to 2017.](https:\/\/otexts.com\/fpp3\/fpp_files\/figure-html\/7-oil-1.png)\n\nFigure 8.1: Exports of goods and services from Algeria from 1960 to 2017.\n\nUsing the naïve method, all forecasts for the future are equal to the last observed value of the series, \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = y\\_{T}, \\\\\\] for \\\\(h=1,2,\\\\dots\\\\). Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation.\n\nUsing the average method, all future forecasts are equal to a simple average of the observed data, \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = \\\\frac1T \\\\sum\\_{t=1}^T y\\_t, \\\\\\] for \\\\(h=1,2,\\\\dots\\\\). Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts.\n\nWe often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations: \\\\\\[\\\\begin{equation} \\\\hat{y}\\_{T+1\\|T} = \\\\alpha y\\_T + \\\\alpha(1-\\\\alpha) y\\_{T-1} + \\\\alpha(1-\\\\alpha)^2 y\\_{T-2}+ \\\\cdots, \\\\tag{8.1} \\\\end{equation}\\\\\\] where \\\\(0 \\\\le \\\\alpha \\\\le 1\\\\) is the smoothing parameter. The one-step-ahead forecast for time \\\\(T+1\\\\) is a weighted average of all of the observations in the series \\\\(y\\_1,\\\\dots,y\\_T\\\\). The rate at which the weights decrease is controlled by the parameter \\\\(\\\\alpha\\\\).\n\nThe table below shows the weights attached to observations for four different values of \\\\(\\\\alpha\\\\) when forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of \\\\(\\\\alpha\\\\) will be approximately one for any reasonable sample size.\n\n|   | \\\\(\\\\alpha=0.2\\\\) | \\\\(\\\\alpha=0.4\\\\) | \\\\(\\\\alpha=0.6\\\\) | \\\\(\\\\alpha=0.8\\\\) |\n|---|---|---|---|---|\n| \\\\(y\\_{T}\\\\) | 0\\.2000 | 0\\.4000 | 0\\.6000 | 0\\.8000 |\n| \\\\(y\\_{T-1}\\\\) | 0\\.1600 | 0\\.2400 | 0\\.2400 | 0\\.1600 |\n| \\\\(y\\_{T-2}\\\\) | 0\\.1280 | 0\\.1440 | 0\\.0960 | 0\\.0320 |\n| \\\\(y\\_{T-3}\\\\) | 0\\.1024 | 0\\.0864 | 0\\.0384 | 0\\.0064 |\n| \\\\(y\\_{T-4}\\\\) | 0\\.0819 | 0\\.0518 | 0\\.0154 | 0\\.0013 |\n| \\\\(y\\_{T-5}\\\\) | 0\\.0655 | 0\\.0311 | 0\\.0061 | 0\\.0003 |\n\nFor any \\\\(\\\\alpha\\\\) between 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If \\\\(\\\\alpha\\\\) is small (i.e., close to 0), more weight is given to observations from the more distant past. If \\\\(\\\\alpha\\\\) is large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where \\\\(\\\\alpha=1\\\\), \\\\(\\\\hat{y}\\_{T+1\\|T}=y\\_T\\\\), so the forecasts are equal to the naïve forecasts.\n\nWe present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation [(8.1)](https:\/\/otexts.com\/fpp3\/ses.html#eq:7-ses).\n\n### Weighted average form\nThe forecast at time \\\\(T+1\\\\) is equal to a weighted average between the most recent observation \\\\(y\\_T\\\\) and the previous forecast \\\\(\\\\hat{y}\\_{T\\|T-1}\\\\): \\\\\\[ \\\\hat{y}\\_{T+1\\|T} = \\\\alpha y\\_T + (1-\\\\alpha) \\\\hat{y}\\_{T\\|T-1}, \\\\\\] where \\\\(0 \\\\le \\\\alpha \\\\le 1\\\\) is the smoothing parameter. Similarly, we can write the fitted values as \\\\\\[ \\\\hat{y}\\_{t+1\\|t} = \\\\alpha y\\_t + (1-\\\\alpha) \\\\hat{y}\\_{t\\|t-1}, \\\\\\] for \\\\(t=1,\\\\dots,T\\\\). (Recall that fitted values are simply one-step forecasts of the training data.)\n\nThe process has to start somewhere, so we let the first fitted value at time 1 be denoted by \\\\(\\\\ell\\_0\\\\) (which we will have to estimate). Then \\\\\\[\\\\begin{align\\*} \\\\hat{y}\\_{2\\|1} &= \\\\alpha y\\_1 + (1-\\\\alpha) \\\\ell\\_0\\\\\\\\ \\\\hat{y}\\_{3\\|2} &= \\\\alpha y\\_2 + (1-\\\\alpha) \\\\hat{y}\\_{2\\|1}\\\\\\\\ \\\\hat{y}\\_{4\\|3} &= \\\\alpha y\\_3 + (1-\\\\alpha) \\\\hat{y}\\_{3\\|2}\\\\\\\\ \\\\vdots\\\\\\\\ \\\\hat{y}\\_{T\\|T-1} &= \\\\alpha y\\_{T-1} + (1-\\\\alpha) \\\\hat{y}\\_{T-1\\|T-2}\\\\\\\\ \\\\hat{y}\\_{T+1\\|T} &= \\\\alpha y\\_T + (1-\\\\alpha) \\\\hat{y}\\_{T\\|T-1}. \\\\end{align\\*}\\\\\\] Substituting each equation into the following equation, we obtain \\\\\\[\\\\begin{align\\*} \\\\hat{y}\\_{3\\|2} & = \\\\alpha y\\_2 + (1-\\\\alpha) \\\\left\\[\\\\alpha y\\_1 + (1-\\\\alpha) \\\\ell\\_0\\\\right\\] \\\\\\\\ & = \\\\alpha y\\_2 + \\\\alpha(1-\\\\alpha) y\\_1 + (1-\\\\alpha)^2 \\\\ell\\_0 \\\\\\\\ \\\\hat{y}\\_{4\\|3} & = \\\\alpha y\\_3 + (1-\\\\alpha) \\[\\\\alpha y\\_2 + \\\\alpha(1-\\\\alpha) y\\_1 + (1-\\\\alpha)^2 \\\\ell\\_0\\]\\\\\\\\ & = \\\\alpha y\\_3 + \\\\alpha(1-\\\\alpha) y\\_2 + \\\\alpha(1-\\\\alpha)^2 y\\_1 + (1-\\\\alpha)^3 \\\\ell\\_0 \\\\\\\\ & \\~~\\\\vdots \\\\\\\\ \\\\hat{y}\\_{T+1\\|T} & = \\\\sum\\_{j=0}^{T-1} \\\\alpha(1-\\\\alpha)^j y\\_{T-j} + (1-\\\\alpha)^T \\\\ell\\_{0}. \\\\end{align\\*}\\\\\\] The last term becomes tiny for large \\\\(T\\\\). So, the weighted average form leads to the same forecast Equation [(8.1)](https:\/\/otexts.com\/fpp3\/ses.html#eq:7-ses).\n\n### Component form\nAn alternative representation is the component form. For simple exponential smoothing, the only component included is the level, \\\\(\\\\ell\\_t\\\\). (Other methods which are considered later in this chapter may also include a trend \\\\(b\\_t\\\\) and a seasonal component \\\\(s\\_t\\\\).) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by: \\\\\\[\\\\begin{align\\*} \\\\text{Forecast equation} && \\\\hat{y}\\_{t+h\\|t} & = \\\\ell\\_{t}\\\\\\\\ \\\\text{Smoothing equation} && \\\\ell\\_{t} & = \\\\alpha y\\_{t} + (1 - \\\\alpha)\\\\ell\\_{t-1}, \\\\end{align\\*}\\\\\\] where \\\\(\\\\ell\\_{t}\\\\) is the level (or the smoothed value) of the series at time \\\\(t\\\\). Setting \\\\(h=1\\\\) gives the fitted values, while setting \\\\(t=T\\\\) gives the true forecasts beyond the training data.\n\nThe forecast equation shows that the forecast value at time \\\\(t+1\\\\) is the estimated level at time \\\\(t\\\\). The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period \\\\(t\\\\).\n\nIf we replace \\\\(\\\\ell\\_t\\\\) with \\\\(\\\\hat{y}\\_{t+1\\|t}\\\\) and \\\\(\\\\ell\\_{t-1}\\\\) with \\\\(\\\\hat{y}\\_{t\\|t-1}\\\\) in the smoothing equation, we will recover the weighted average form of simple exponential smoothing.\n\nThe component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components.\n\n### Flat forecasts\nSimple exponential smoothing has a “flat” forecast function: \\\\\\[ \\\\hat{y}\\_{T+h\\|T} = \\\\hat{y}\\_{T+1\\|T}=\\\\ell\\_T, \\\\qquad h=2,3,\\\\dots. \\\\\\] That is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component.\n\n### Optimisation\nThe application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of \\\\(\\\\alpha\\\\) and \\\\(\\\\ell\\_0\\\\). All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen.\n\nIn some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data.\n\nIn Section [7\\.2](https:\/\/otexts.com\/fpp3\/least-squares.html#least-squares), we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as \\\\(e\\_t=y\\_t - \\\\hat{y}\\_{t\\|t-1}\\\\) for \\\\(t=1,\\\\dots,T\\\\). Hence, we find the values of the unknown parameters and the initial values that minimise \\\\\\[\\\\begin{equation} \\\\text{SSE}=\\\\sum\\_{t=1}^T(y\\_t - \\\\hat{y}\\_{t\\|t-1})^2=\\\\sum\\_{t=1}^Te\\_t^2. \\\\tag{8.2} \\\\end{equation}\\\\\\]\n\nUnlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it.\n\n### Example: Algerian exports\nIn this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria.\n```\n# Estimate parameters\nfit <- algeria_economy |>\n  model(ETS(Exports ~ error(\"A\") + trend(\"N\") + season(\"N\")))\nfc <- fit |>\n  forecast(h = 5)\n```\nThis gives parameter estimates \\\\(\\\\hat\\\\alpha=0.84\\\\) and \\\\(\\\\hat\\\\ell\\_0=39.5\\\\), obtained by minimising SSE over periods \\\\(t=1,2,\\\\dots,58\\\\), subject to the restriction that \\\\(0\\\\le\\\\alpha\\\\le1\\\\).\n\nIn Table [8\\.1](https:\/\/otexts.com\/fpp3\/ses.html#tab:export-ses) we demonstrate the calculation using these parameters. The second last column shows the estimated level for times \\\\(t=0\\\\) to \\\\(t=58\\\\); the last few rows of the last column show the forecasts for \\\\(h=1\\\\) to \\\\(5\\\\)\\-steps ahead.\n\n| Year | Time | Observation | Level | Forecast |\n|---|---|---|---|---|\n|   | \\\\(t\\\\) | \\\\(y\\_t\\\\) | \\\\(\\\\ell\\_t\\\\) | \\\\(\\\\hat{y}\\_{t\\\\vert t-1}\\\\) |\n| 1959 | 0 |   | 39\\.54 |   |\n| 1960 | 1 | 39\\.04 | 39\\.12 | 39\\.54 |\n| 1961 | 2 | 46\\.24 | 45\\.10 | 39\\.12 |\n| 1962 | 3 | 19\\.79 | 23\\.84 | 45\\.10 |\n| 1963 | 4 | 24\\.68 | 24\\.55 | 23\\.84 |\n| 1964 | 5 | 25\\.08 | 25\\.00 | 24\\.55 |\n| 1965 | 6 | 22\\.60 | 22\\.99 | 25\\.00 |\n| 1966 | 7 | 25\\.99 | 25\\.51 | 22\\.99 |\n| 1967 | 8 | 23\\.43 | 23\\.77 | 25\\.51 |\n|   | ⋮ | ⋮ | ⋮ | ⋮ |\n| 2014 | 55 | 30\\.22 | 30\\.80 | 33\\.85 |\n| 2015 | 56 | 23\\.17 | 24\\.39 | 30\\.80 |\n| 2016 | 57 | 20\\.86 | 21\\.43 | 24\\.39 |\n| 2017 | 58 | 22\\.64 | 22\\.44 | 21\\.43 |\n|   | \\\\(h\\\\) |   |   | \\\\(\\\\hat{y}\\_{T+h\\\\vert T}\\\\) |\n| 2018 | 1 |   |   | 22\\.44 |\n| 2019 | 2 |   |   | 22\\.44 |\n| 2020 | 3 |   |   | 22\\.44 |\n| 2021 | 4 |   |   | 22\\.44 |\n| 2022 | 5 |   |   | 22\\.44 |\n\nThe black line in Figure [8\\.2](https:\/\/otexts.com\/fpp3\/ses.html#fig:ses) shows the data, which has a changing level over time.\n```\nfc |>\n  autoplot(algeria_economy) +\n  geom_line(aes(y = .fitted), col=\"#D55E00\",\n            data = augment(fit)) +\n  labs(y=\"% of GDP\", title=\"Exports: Algeria\") +\n  guides(colour = \"none\")\n```\n![Simple exponential smoothing applied to exports from Algeria (1960--2017). The orange curve shows the one-step-ahead fitted values.](https:\/\/otexts.com\/fpp3\/fpp_files\/figure-html\/ses-1.png)\n\nFigure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values.\n\nThe forecasts for the period 2018–2022 are plotted in Figure [8\\.2](https:\/\/otexts.com\/fpp3\/ses.html#fig:ses). Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of \\\\(\\\\alpha\\\\) in this example is reflected in the large adjustment that takes place in the estimated level \\\\(\\\\ell\\_t\\\\) at each time. A smaller value of \\\\(\\\\alpha\\\\) would lead to smaller changes over time, and so the series of fitted values would be smoother.\n\nThe prediction intervals shown here are calculated using the methods described in Section [8\\.7](https:\/\/otexts.com\/fpp3\/ets-forecasting.html#ets-forecasting). The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading.\n\n***\n1. In some books it is called “single exponential smoothing”.[↩︎](https:\/\/otexts.com\/fpp3\/ses.html#fnref16)","meta_canonical":null}

3. Robots.txt Check

Query:

Response:

4. Spam/Ban Check

Query:

Response:

5. Seen Status Check

ℹ️ Skipped - page is already crawled

📄

INDEXABLE

✅

CRAWLED

3 days ago

🤖

ROBOTS ALLOWED

Page Info Filters

Filter	Status	Condition	Details
HTTP status	PASS	`download_http_code = 200`	HTTP 200
Age cutoff	PASS	`download_stamp > now() - 6 MONTH`	0.1 months ago
History drop	PASS	`isNull(history_drop_reason)`	No drop reason
Spam/ban	PASS	`fh_dont_index != 1 AND ml_spam_score = 0`	ml_spam_score=0
Canonical	PASS	`meta_canonical IS NULL OR = '' OR = src_unparsed`	Not set

Page Details

Property	Value
URL	https://otexts.com/fpp3/ses.html
Last Crawled	2026-04-14 19:27:00 (3 days ago)
First Indexed	2019-06-20 15:57:00 (6 years ago)
HTTP Status Code	200
Meta Title	8.1 Simple exponential smoothing \| Forecasting: Principles and Practice (3rd ed)
Meta Description	3rd edition
Meta Canonical	null
Boilerpipe Text	The simplest of the exponentially smoothing methods is naturally called simple exponential smoothing (SES) 16 . This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure 8.1 do not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section 5.2 ). algeria_economy <- global_economy \|> filter (Country == "Algeria" ) algeria_economy \|> autoplot (Exports) + labs ( y = "% of GDP" , title = "Exports: Algeria" ) Figure 8.1: Exports of goods and services from Algeria from 1960 to 2017. Using the naïve method, all forecasts for the future are equal to the last observed value of the series, \[ \hat{y}_{T+h\|T} = y_{T}, \] for \(h=1,2,\dots\) . Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation. Using the average method, all future forecasts are equal to a simple average of the observed data, \[ \hat{y}_{T+h\|T} = \frac1T \sum_{t=1}^T y_t, \] for \(h=1,2,\dots\) . Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts. We often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations: \[\begin{equation} \hat{y}_{T+1\|T} = \alpha y_T + \alpha(1-\alpha) y_{T-1} + \alpha(1-\alpha)^2 y_{T-2}+ \cdots, \tag{8.1} \end{equation}\] where \(0 \le \alpha \le 1\) is the smoothing parameter. The one-step-ahead forecast for time \(T+1\) is a weighted average of all of the observations in the series \(y_1,\dots,y_T\) . The rate at which the weights decrease is controlled by the parameter \(\alpha\) . The table below shows the weights attached to observations for four different values of \(\alpha\) when forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of \(\alpha\) will be approximately one for any reasonable sample size. \(\alpha=0.2\) \(\alpha=0.4\) \(\alpha=0.6\) \(\alpha=0.8\) \(y_{T}\) 0.2000 0.4000 0.6000 0.8000 \(y_{T-1}\) 0.1600 0.2400 0.2400 0.1600 \(y_{T-2}\) 0.1280 0.1440 0.0960 0.0320 \(y_{T-3}\) 0.1024 0.0864 0.0384 0.0064 \(y_{T-4}\) 0.0819 0.0518 0.0154 0.0013 \(y_{T-5}\) 0.0655 0.0311 0.0061 0.0003 For any \(\alpha\) between 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If \(\alpha\) is small (i.e., close to 0), more weight is given to observations from the more distant past. If \(\alpha\) is large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where \(\alpha=1\) , \(\hat{y}_{T+1\|T}=y_T\) , so the forecasts are equal to the naïve forecasts. We present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation (8.1) . Weighted average form The forecast at time \(T+1\) is equal to a weighted average between the most recent observation \(y_T\) and the previous forecast \(\hat{y}_{T\|T-1}\) : \[ \hat{y}_{T+1\|T} = \alpha y_T + (1-\alpha) \hat{y}_{T\|T-1}, \] where \(0 \le \alpha \le 1\) is the smoothing parameter. Similarly, we can write the fitted values as \[ \hat{y}_{t+1\|t} = \alpha y_t + (1-\alpha) \hat{y}_{t\|t-1}, \] for \(t=1,\dots,T\) . (Recall that fitted values are simply one-step forecasts of the training data.) The process has to start somewhere, so we let the first fitted value at time 1 be denoted by \(\ell_0\) (which we will have to estimate). Then \[\begin{align} \hat{y}_{2\|1} &= \alpha y_1 + (1-\alpha) \ell_0\\ \hat{y}_{3\|2} &= \alpha y_2 + (1-\alpha) \hat{y}_{2\|1}\\ \hat{y}_{4\|3} &= \alpha y_3 + (1-\alpha) \hat{y}_{3\|2}\\ \vdots\\ \hat{y}_{T\|T-1} &= \alpha y_{T-1} + (1-\alpha) \hat{y}_{T-1\|T-2}\\ \hat{y}_{T+1\|T} &= \alpha y_T + (1-\alpha) \hat{y}_{T\|T-1}. \end{align}\] Substituting each equation into the following equation, we obtain \[\begin{align} \hat{y}_{3\|2} & = \alpha y_2 + (1-\alpha) \left[\alpha y_1 + (1-\alpha) \ell_0\right] \\ & = \alpha y_2 + \alpha(1-\alpha) y_1 + (1-\alpha)^2 \ell_0 \\ \hat{y}_{4\|3} & = \alpha y_3 + (1-\alpha) [\alpha y_2 + \alpha(1-\alpha) y_1 + (1-\alpha)^2 \ell_0]\\ & = \alpha y_3 + \alpha(1-\alpha) y_2 + \alpha(1-\alpha)^2 y_1 + (1-\alpha)^3 \ell_0 \\ & ~~\vdots \\ \hat{y}_{T+1\|T} & = \sum_{j=0}^{T-1} \alpha(1-\alpha)^j y_{T-j} + (1-\alpha)^T \ell_{0}. \end{align}\] The last term becomes tiny for large \(T\) . So, the weighted average form leads to the same forecast Equation (8.1) . Component form An alternative representation is the component form. For simple exponential smoothing, the only component included is the level, \(\ell_t\) . (Other methods which are considered later in this chapter may also include a trend \(b_t\) and a seasonal component \(s_t\) .) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by: \[\begin{align} \text{Forecast equation} && \hat{y}_{t+h\|t} & = \ell_{t}\\ \text{Smoothing equation} && \ell_{t} & = \alpha y_{t} + (1 - \alpha)\ell_{t-1}, \end{align}\] where \(\ell_{t}\) is the level (or the smoothed value) of the series at time \(t\) . Setting \(h=1\) gives the fitted values, while setting \(t=T\) gives the true forecasts beyond the training data. The forecast equation shows that the forecast value at time \(t+1\) is the estimated level at time \(t\) . The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period \(t\) . If we replace \(\ell_t\) with \(\hat{y}_{t+1\|t}\) and \(\ell_{t-1}\) with \(\hat{y}_{t\|t-1}\) in the smoothing equation, we will recover the weighted average form of simple exponential smoothing. The component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components. Flat forecasts Simple exponential smoothing has a “flat” forecast function: \[ \hat{y}_{T+h\|T} = \hat{y}_{T+1\|T}=\ell_T, \qquad h=2,3,\dots. \] That is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component. Optimisation The application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of \(\alpha\) and \(\ell_0\) . All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen. In some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data. In Section 7.2 , we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as \(e_t=y_t - \hat{y}_{t\|t-1}\) for \(t=1,\dots,T\) . Hence, we find the values of the unknown parameters and the initial values that minimise \[\begin{equation} \text{SSE}=\sum_{t=1}^T(y_t - \hat{y}_{t\|t-1})^2=\sum_{t=1}^Te_t^2. \tag{8.2} \end{equation}\] Unlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it. Example: Algerian exports In this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria. # Estimate parameters fit <- algeria_economy \|> model ( ETS (Exports ~ error ( "A" ) + trend ( "N" ) + season ( "N" ))) fc <- fit \|> forecast ( h = 5 ) This gives parameter estimates \(\hat\alpha=0.84\) and \(\hat\ell_0=39.5\) , obtained by minimising SSE over periods \(t=1,2,\dots,58\) , subject to the restriction that \(0\le\alpha\le1\) . In Table 8.1 we demonstrate the calculation using these parameters. The second last column shows the estimated level for times \(t=0\) to \(t=58\) ; the last few rows of the last column show the forecasts for \(h=1\) to \(5\) -steps ahead. Table 8.1: Forecasting goods and services exports from Algeria using simple exponential smoothing. Year Time Observation Level Forecast \(t\) \(y_t\) \(\ell_t\) \(\hat{y}_{t\vert t-1}\) 1959 0 39.54 1960 1 39.04 39.12 39.54 1961 2 46.24 45.10 39.12 1962 3 19.79 23.84 45.10 1963 4 24.68 24.55 23.84 1964 5 25.08 25.00 24.55 1965 6 22.60 22.99 25.00 1966 7 25.99 25.51 22.99 1967 8 23.43 23.77 25.51 ⋮ ⋮ ⋮ ⋮ 2014 55 30.22 30.80 33.85 2015 56 23.17 24.39 30.80 2016 57 20.86 21.43 24.39 2017 58 22.64 22.44 21.43 \(h\) \(\hat{y}_{T+h\vert T}\) 2018 1 22.44 2019 2 22.44 2020 3 22.44 2021 4 22.44 2022 5 22.44 The black line in Figure 8.2 shows the data, which has a changing level over time. fc \|> autoplot (algeria_economy) + geom_line ( aes ( y = .fitted), col= "#D55E00" , data = augment (fit)) + labs ( y= "% of GDP" , title= "Exports: Algeria" ) + guides ( colour = "none" ) Figure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values. The forecasts for the period 2018–2022 are plotted in Figure 8.2 . Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of \(\alpha\) in this example is reflected in the large adjustment that takes place in the estimated level \(\ell_t\) at each time. A smaller value of \(\alpha\) would lead to smaller changes over time, and so the series of fitted values would be smoother. The prediction intervals shown here are calculated using the methods described in Section 8.7 . The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading. In some books it is called “single exponential smoothing”. ↩︎
Markdown	- [Forecasting: Principles and Practice](https://otexts.com/fpp3/) - [Preface](https://otexts.com/fpp3/index.html) - [1 Getting started](https://otexts.com/fpp3/intro.html) - [1\.1 What can be forecast?](https://otexts.com/fpp3/what-can-be-forecast.html) - [1\.2 Forecasting, goals and planning](https://otexts.com/fpp3/planning.html) - [1\.3 Determining what to forecast](https://otexts.com/fpp3/determining-what-to-forecast.html) - [1\.4 Forecasting data and methods](https://otexts.com/fpp3/data-methods.html) - [1\.5 Some case studies](https://otexts.com/fpp3/case-studies.html) - [1\.6 The basic steps in a forecasting task](https://otexts.com/fpp3/basic-steps.html) - [1\.7 The statistical forecasting perspective](https://otexts.com/fpp3/perspective.html) - [1\.8 Exercises](https://otexts.com/fpp3/intro-exercises.html) - [1\.9 Further reading](https://otexts.com/fpp3/intro-reading.html) - [2 Time series graphics](https://otexts.com/fpp3/graphics.html) - [2\.1 `tsibble` objects](https://otexts.com/fpp3/tsibbles.html) - [2\.2 Time plots](https://otexts.com/fpp3/time-plots.html) - [2\.3 Time series patterns](https://otexts.com/fpp3/tspatterns.html) - [2\.4 Seasonal plots](https://otexts.com/fpp3/seasonal-plots.html) - [2\.5 Seasonal subseries plots](https://otexts.com/fpp3/subseries.html) - [2\.6 Scatterplots](https://otexts.com/fpp3/scatterplots.html) - [2\.7 Lag plots](https://otexts.com/fpp3/lag-plots.html) - [2\.8 Autocorrelation](https://otexts.com/fpp3/acf.html) - [2\.9 White noise](https://otexts.com/fpp3/wn.html) - [2\.10 Exercises](https://otexts.com/fpp3/graphics-exercises.html) - [2\.11 Further reading](https://otexts.com/fpp3/graphics-reading.html) - [3 Time series decomposition](https://otexts.com/fpp3/decomposition.html) - [3\.1 Transformations and adjustments](https://otexts.com/fpp3/transformations.html) - [3\.2 Time series components](https://otexts.com/fpp3/components.html) - [3\.3 Moving averages](https://otexts.com/fpp3/moving-averages.html) - [3\.4 Classical decomposition](https://otexts.com/fpp3/classical-decomposition.html) - [3\.5 Methods used by official statistics agencies](https://otexts.com/fpp3/methods-used-by-official-statistics-agencies.html) - [3\.6 STL decomposition](https://otexts.com/fpp3/stl.html) - [3\.7 Exercises](https://otexts.com/fpp3/decomposition-exercises.html) - [3\.8 Further reading](https://otexts.com/fpp3/decomposition-reading.html) - [4 Time series features](https://otexts.com/fpp3/features.html) - [4\.1 Some simple statistics](https://otexts.com/fpp3/some-simple-statistics.html) - [4\.2 ACF features](https://otexts.com/fpp3/acf-features.html) - [4\.3 STL Features](https://otexts.com/fpp3/stlfeatures.html) - [4\.4 Other features](https://otexts.com/fpp3/other-features.html) - [4\.5 Exploring Australian tourism data](https://otexts.com/fpp3/exploring-australian-tourism-data.html) - [4\.6 Exercises](https://otexts.com/fpp3/feast-exercises.html) - [4\.7 Further reading](https://otexts.com/fpp3/further-reading.html) - [5 The forecaster’s toolbox](https://otexts.com/fpp3/toolbox.html) - [5\.1 A tidy forecasting workflow](https://otexts.com/fpp3/a-tidy-forecasting-workflow.html) - [5\.2 Some simple forecasting methods](https://otexts.com/fpp3/simple-methods.html) - [5\.3 Fitted values and residuals](https://otexts.com/fpp3/residuals.html) - [5\.4 Residual diagnostics](https://otexts.com/fpp3/diagnostics.html) - [5\.5 Distributional forecasts and prediction intervals](https://otexts.com/fpp3/prediction-intervals.html) - [5\.6 Forecasting using transformations](https://otexts.com/fpp3/ftransformations.html) - [5\.7 Forecasting with decomposition](https://otexts.com/fpp3/forecasting-decomposition.html) - [5\.8 Evaluating point forecast accuracy](https://otexts.com/fpp3/accuracy.html) - [5\.9 Evaluating distributional forecast accuracy](https://otexts.com/fpp3/distaccuracy.html) - [5\.10 Time series cross-validation](https://otexts.com/fpp3/tscv.html) - [5\.11 Exercises](https://otexts.com/fpp3/toolbox-exercises.html) - [5\.12 Further reading](https://otexts.com/fpp3/basics-reading.html) - [6 Judgmental forecasts](https://otexts.com/fpp3/judgmental.html) - [6\.1 Beware of limitations](https://otexts.com/fpp3/judgmental-limitations.html) - [6\.2 Key principles](https://otexts.com/fpp3/judgmental-principles.html) - [6\.3 The Delphi method](https://otexts.com/fpp3/delphimethod.html) - [6\.4 Forecasting by analogy](https://otexts.com/fpp3/analogies.html) - [6\.5 Scenario forecasting](https://otexts.com/fpp3/scenarios.html) - [6\.6 New product forecasting](https://otexts.com/fpp3/new-products.html) - [6\.7 Judgmental adjustments](https://otexts.com/fpp3/judgmental-adjustments.html) - [6\.8 Further reading](https://otexts.com/fpp3/judgmental-reading.html) - [7 Time series regression models](https://otexts.com/fpp3/regression.html) - [7\.1 The linear model](https://otexts.com/fpp3/regression-intro.html) - [7\.2 Least squares estimation](https://otexts.com/fpp3/least-squares.html) - [7\.3 Evaluating the regression model](https://otexts.com/fpp3/regression-evaluation.html) - [7\.4 Some useful predictors](https://otexts.com/fpp3/useful-predictors.html) - [7\.5 Selecting predictors](https://otexts.com/fpp3/selecting-predictors.html) - [7\.6 Forecasting with regression](https://otexts.com/fpp3/forecasting-regression.html) - [7\.7 Nonlinear regression](https://otexts.com/fpp3/nonlinear-regression.html) - [7\.8 Correlation, causation and forecasting](https://otexts.com/fpp3/causality.html) - [7\.9 Matrix formulation](https://otexts.com/fpp3/regression-matrices.html) - [7\.10 Exercises](https://otexts.com/fpp3/regression-exercises.html) - [7\.11 Further reading](https://otexts.com/fpp3/regression-reading.html) - [8 Exponential smoothing](https://otexts.com/fpp3/expsmooth.html) - [8\.1 Simple exponential smoothing](https://otexts.com/fpp3/ses.html) - [8\.2 Methods with trend](https://otexts.com/fpp3/holt.html) - [8\.3 Methods with seasonality](https://otexts.com/fpp3/holt-winters.html) - [8\.4 A taxonomy of exponential smoothing methods](https://otexts.com/fpp3/taxonomy.html) - [8\.5 Innovations state space models for exponential smoothing](https://otexts.com/fpp3/ets.html) - [8\.6 Estimation and model selection](https://otexts.com/fpp3/ets-estimation.html) - [8\.7 Forecasting with ETS models](https://otexts.com/fpp3/ets-forecasting.html) - [8\.8 Exercises](https://otexts.com/fpp3/expsmooth-exercises.html) - [8\.9 Further reading](https://otexts.com/fpp3/expsmooth-reading.html) - [9 ARIMA models](https://otexts.com/fpp3/arima.html) - [9\.1 Stationarity and differencing](https://otexts.com/fpp3/stationarity.html) - [9\.2 Backshift notation](https://otexts.com/fpp3/backshift.html) - [9\.3 Autoregressive models](https://otexts.com/fpp3/AR.html) - [9\.4 Moving average models](https://otexts.com/fpp3/MA.html) - [9\.5 Non-seasonal ARIMA models](https://otexts.com/fpp3/non-seasonal-arima.html) - [9\.6 Estimation and order selection](https://otexts.com/fpp3/arima-estimation.html) - [9\.7 ARIMA modelling in `fable`](https://otexts.com/fpp3/arima-r.html) - [9\.8 Forecasting](https://otexts.com/fpp3/arima-forecasting.html) - [9\.9 Seasonal ARIMA models](https://otexts.com/fpp3/seasonal-arima.html) - [9\.10 ARIMA vs ETS](https://otexts.com/fpp3/arima-ets.html) - [9\.11 Exercises](https://otexts.com/fpp3/arima-exercises.html) - [9\.12 Further reading](https://otexts.com/fpp3/arima-reading.html) - [10 Dynamic regression models](https://otexts.com/fpp3/dynamic.html) - [10\.1 Estimation](https://otexts.com/fpp3/estimation.html) - [10\.2 Regression with ARIMA errors using `fable`](https://otexts.com/fpp3/regarima.html) - [10\.3 Forecasting](https://otexts.com/fpp3/forecasting.html) - [10\.4 Stochastic and deterministic trends](https://otexts.com/fpp3/stochastic-and-deterministic-trends.html) - [10\.5 Dynamic harmonic regression](https://otexts.com/fpp3/dhr.html) - [10\.6 Lagged predictors](https://otexts.com/fpp3/lagged-predictors.html) - [10\.7 Exercises](https://otexts.com/fpp3/dynamic-exercises.html) - [10\.8 Further reading](https://otexts.com/fpp3/dynamic-reading.html) - [11 Forecasting hierarchical and grouped time series](https://otexts.com/fpp3/hierarchical.html) - [11\.1 Hierarchical and grouped time series](https://otexts.com/fpp3/hts.html) - [11\.2 Single level approaches](https://otexts.com/fpp3/single-level.html) - [11\.3 Forecast reconciliation](https://otexts.com/fpp3/reconciliation.html) - [11\.4 Forecasting Australian domestic tourism](https://otexts.com/fpp3/tourism.html) - [11\.5 Reconciled distributional forecasts](https://otexts.com/fpp3/rec-prob.html) - [11\.6 Forecasting Australian prison population](https://otexts.com/fpp3/prison.html) - [11\.7 Exercises](https://otexts.com/fpp3/hierarchical-exercises.html) - [11\.8 Further reading](https://otexts.com/fpp3/hierarchical-reading.html) - [12 Advanced forecasting methods](https://otexts.com/fpp3/advanced.html) - [12\.1 Complex seasonality](https://otexts.com/fpp3/complexseasonality.html) - [12\.2 Prophet model](https://otexts.com/fpp3/prophet.html) - [12\.3 Vector autoregressions](https://otexts.com/fpp3/VAR.html) - [12\.4 Neural network models](https://otexts.com/fpp3/nnetar.html) - [12\.5 Bootstrapping and bagging](https://otexts.com/fpp3/bootstrap.html) - [12\.6 Exercises](https://otexts.com/fpp3/advanced-exercises.html) - [12\.7 Further reading](https://otexts.com/fpp3/advanced-reading.html) - [13 Some practical forecasting issues](https://otexts.com/fpp3/practical.html) - [13\.1 Weekly, daily and sub-daily data](https://otexts.com/fpp3/weekly.html) - [13\.2 Time series of counts](https://otexts.com/fpp3/counts.html) - [13\.3 Ensuring forecasts stay within limits](https://otexts.com/fpp3/limits.html) - [13\.4 Forecast combinations](https://otexts.com/fpp3/combinations.html) - [13\.5 Prediction intervals for aggregates](https://otexts.com/fpp3/aggregates.html) - [13\.6 Backcasting](https://otexts.com/fpp3/backcasting.html) - [13\.7 Very long and very short time series](https://otexts.com/fpp3/long-short-ts.html) - [13\.8 Forecasting on training and test sets](https://otexts.com/fpp3/training-test.html) - [13\.9 Dealing with outliers and missing values](https://otexts.com/fpp3/missing-outliers.html) - [13\.10 Further reading](https://otexts.com/fpp3/further-reading-1.html) - [Appendix: Using R](https://otexts.com/fpp3/appendix-using-r.html) - [Appendix: For instructors](https://otexts.com/fpp3/appendix-for-instructors.html) - [Appendix: Reviews](https://otexts.com/fpp3/appendix-reviews.html) - [Translations](https://otexts.com/fpp3/translations.html) - [About the authors](https://otexts.com/fpp3/about-the-authors.html) - [Buy a print version](https://otexts.com/fpp3/buy-a-print-version.html) - [Help and feedback](https://otexts.com/fpp3/help-and-feedback.html) - [Changelog](https://otexts.com/fpp3/changelog.html) - [Bibliography](https://otexts.com/fpp3/bibliography-1.html) - [Published by OTexts™ with bookdown](https://otexts.com/) # [Forecasting: Principles and Practice (3rd ed)](https://otexts.com/fpp3/) ## 8\.1 Simple exponential smoothing The simplest of the exponentially smoothing methods is naturally called simple exponential smoothing (SES)[16](https://otexts.com/fpp3/ses.html#fn16). This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure [8\.1](https://otexts.com/fpp3/ses.html#fig:7-oil) do not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section [5\.2](https://otexts.com/fpp3/simple-methods.html#simple-methods)). ``` algeria_economy <- global_economy \|> filter(Country == "Algeria") algeria_economy \|> autoplot(Exports) + labs(y = "% of GDP", title = "Exports: Algeria") ``` ![Exports of goods and services from Algeria from 1960 to 2017.](https://otexts.com/fpp3/fpp_files/figure-html/7-oil-1.png) Figure 8.1: Exports of goods and services from Algeria from 1960 to 2017. Using the naïve method, all forecasts for the future are equal to the last observed value of the series, \\\[ \\hat{y}\_{T+h\\|T} = y\_{T}, \\\] for \\(h=1,2,\\dots\\). Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation. Using the average method, all future forecasts are equal to a simple average of the observed data, \\\[ \\hat{y}\_{T+h\\|T} = \\frac1T \\sum\_{t=1}^T y\_t, \\\] for \\(h=1,2,\\dots\\). Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts. We often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations: \\\[\\begin{equation} \\hat{y}\_{T+1\\|T} = \\alpha y\_T + \\alpha(1-\\alpha) y\_{T-1} + \\alpha(1-\\alpha)^2 y\_{T-2}+ \\cdots, \\tag{8.1} \\end{equation}\\\] where \\(0 \\le \\alpha \\le 1\\) is the smoothing parameter. The one-step-ahead forecast for time \\(T+1\\) is a weighted average of all of the observations in the series \\(y\_1,\\dots,y\_T\\). The rate at which the weights decrease is controlled by the parameter \\(\\alpha\\). The table below shows the weights attached to observations for four different values of \\(\\alpha\\) when forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of \\(\\alpha\\) will be approximately one for any reasonable sample size. \| \| \\(\\alpha=0.2\\) \| \\(\\alpha=0.4\\) \| \\(\\alpha=0.6\\) \| \\(\\alpha=0.8\\) \| \|---\|---\|---\|---\|---\| \| \\(y\_{T}\\) \| 0\.2000 \| 0\.4000 \| 0\.6000 \| 0\.8000 \| \| \\(y\_{T-1}\\) \| 0\.1600 \| 0\.2400 \| 0\.2400 \| 0\.1600 \| \| \\(y\_{T-2}\\) \| 0\.1280 \| 0\.1440 \| 0\.0960 \| 0\.0320 \| \| \\(y\_{T-3}\\) \| 0\.1024 \| 0\.0864 \| 0\.0384 \| 0\.0064 \| \| \\(y\_{T-4}\\) \| 0\.0819 \| 0\.0518 \| 0\.0154 \| 0\.0013 \| \| \\(y\_{T-5}\\) \| 0\.0655 \| 0\.0311 \| 0\.0061 \| 0\.0003 \| For any \\(\\alpha\\) between 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If \\(\\alpha\\) is small (i.e., close to 0), more weight is given to observations from the more distant past. If \\(\\alpha\\) is large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where \\(\\alpha=1\\), \\(\\hat{y}\_{T+1\\|T}=y\_T\\), so the forecasts are equal to the naïve forecasts. We present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation [(8.1)](https://otexts.com/fpp3/ses.html#eq:7-ses). ### Weighted average form The forecast at time \\(T+1\\) is equal to a weighted average between the most recent observation \\(y\_T\\) and the previous forecast \\(\\hat{y}\_{T\\|T-1}\\): \\\[ \\hat{y}\_{T+1\\|T} = \\alpha y\_T + (1-\\alpha) \\hat{y}\_{T\\|T-1}, \\\] where \\(0 \\le \\alpha \\le 1\\) is the smoothing parameter. Similarly, we can write the fitted values as \\\[ \\hat{y}\_{t+1\\|t} = \\alpha y\_t + (1-\\alpha) \\hat{y}\_{t\\|t-1}, \\\] for \\(t=1,\\dots,T\\). (Recall that fitted values are simply one-step forecasts of the training data.) The process has to start somewhere, so we let the first fitted value at time 1 be denoted by \\(\\ell\_0\\) (which we will have to estimate). Then \\\[\\begin{align\} \\hat{y}\_{2\\|1} &= \\alpha y\_1 + (1-\\alpha) \\ell\_0\\\\ \\hat{y}\_{3\\|2} &= \\alpha y\_2 + (1-\\alpha) \\hat{y}\_{2\\|1}\\\\ \\hat{y}\_{4\\|3} &= \\alpha y\_3 + (1-\\alpha) \\hat{y}\_{3\\|2}\\\\ \\vdots\\\\ \\hat{y}\_{T\\|T-1} &= \\alpha y\_{T-1} + (1-\\alpha) \\hat{y}\_{T-1\\|T-2}\\\\ \\hat{y}\_{T+1\\|T} &= \\alpha y\_T + (1-\\alpha) \\hat{y}\_{T\\|T-1}. \\end{align\}\\\] Substituting each equation into the following equation, we obtain \\\[\\begin{align\} \\hat{y}\_{3\\|2} & = \\alpha y\_2 + (1-\\alpha) \\left\[\\alpha y\_1 + (1-\\alpha) \\ell\_0\\right\] \\\\ & = \\alpha y\_2 + \\alpha(1-\\alpha) y\_1 + (1-\\alpha)^2 \\ell\_0 \\\\ \\hat{y}\_{4\\|3} & = \\alpha y\_3 + (1-\\alpha) \[\\alpha y\_2 + \\alpha(1-\\alpha) y\_1 + (1-\\alpha)^2 \\ell\_0\]\\\\ & = \\alpha y\_3 + \\alpha(1-\\alpha) y\_2 + \\alpha(1-\\alpha)^2 y\_1 + (1-\\alpha)^3 \\ell\_0 \\\\ & \~~\\vdots \\\\ \\hat{y}\_{T+1\\|T} & = \\sum\_{j=0}^{T-1} \\alpha(1-\\alpha)^j y\_{T-j} + (1-\\alpha)^T \\ell\_{0}. \\end{align\}\\\] The last term becomes tiny for large \\(T\\). So, the weighted average form leads to the same forecast Equation [(8.1)](https://otexts.com/fpp3/ses.html#eq:7-ses). ### Component form An alternative representation is the component form. For simple exponential smoothing, the only component included is the level, \\(\\ell\_t\\). (Other methods which are considered later in this chapter may also include a trend \\(b\_t\\) and a seasonal component \\(s\_t\\).) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by: \\\[\\begin{align\} \\text{Forecast equation} && \\hat{y}\_{t+h\\|t} & = \\ell\_{t}\\\\ \\text{Smoothing equation} && \\ell\_{t} & = \\alpha y\_{t} + (1 - \\alpha)\\ell\_{t-1}, \\end{align\}\\\] where \\(\\ell\_{t}\\) is the level (or the smoothed value) of the series at time \\(t\\). Setting \\(h=1\\) gives the fitted values, while setting \\(t=T\\) gives the true forecasts beyond the training data. The forecast equation shows that the forecast value at time \\(t+1\\) is the estimated level at time \\(t\\). The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period \\(t\\). If we replace \\(\\ell\_t\\) with \\(\\hat{y}\_{t+1\\|t}\\) and \\(\\ell\_{t-1}\\) with \\(\\hat{y}\_{t\\|t-1}\\) in the smoothing equation, we will recover the weighted average form of simple exponential smoothing. The component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components. ### Flat forecasts Simple exponential smoothing has a “flat” forecast function: \\\[ \\hat{y}\_{T+h\\|T} = \\hat{y}\_{T+1\\|T}=\\ell\_T, \\qquad h=2,3,\\dots. \\\] That is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component. ### Optimisation The application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of \\(\\alpha\\) and \\(\\ell\_0\\). All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen. In some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data. In Section [7\.2](https://otexts.com/fpp3/least-squares.html#least-squares), we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as \\(e\_t=y\_t - \\hat{y}\_{t\\|t-1}\\) for \\(t=1,\\dots,T\\). Hence, we find the values of the unknown parameters and the initial values that minimise \\\[\\begin{equation} \\text{SSE}=\\sum\_{t=1}^T(y\_t - \\hat{y}\_{t\\|t-1})^2=\\sum\_{t=1}^Te\_t^2. \\tag{8.2} \\end{equation}\\\] Unlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it. ### Example: Algerian exports In this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria. ``` # Estimate parameters fit <- algeria_economy \|> model(ETS(Exports ~ error("A") + trend("N") + season("N"))) fc <- fit \|> forecast(h = 5) ``` This gives parameter estimates \\(\\hat\\alpha=0.84\\) and \\(\\hat\\ell\_0=39.5\\), obtained by minimising SSE over periods \\(t=1,2,\\dots,58\\), subject to the restriction that \\(0\\le\\alpha\\le1\\). In Table [8\.1](https://otexts.com/fpp3/ses.html#tab:export-ses) we demonstrate the calculation using these parameters. The second last column shows the estimated level for times \\(t=0\\) to \\(t=58\\); the last few rows of the last column show the forecasts for \\(h=1\\) to \\(5\\)\-steps ahead. \| Year \| Time \| Observation \| Level \| Forecast \| \|---\|---\|---\|---\|---\| \| \| \\(t\\) \| \\(y\_t\\) \| \\(\\ell\_t\\) \| \\(\\hat{y}\_{t\\vert t-1}\\) \| \| 1959 \| 0 \| \| 39\.54 \| \| \| 1960 \| 1 \| 39\.04 \| 39\.12 \| 39\.54 \| \| 1961 \| 2 \| 46\.24 \| 45\.10 \| 39\.12 \| \| 1962 \| 3 \| 19\.79 \| 23\.84 \| 45\.10 \| \| 1963 \| 4 \| 24\.68 \| 24\.55 \| 23\.84 \| \| 1964 \| 5 \| 25\.08 \| 25\.00 \| 24\.55 \| \| 1965 \| 6 \| 22\.60 \| 22\.99 \| 25\.00 \| \| 1966 \| 7 \| 25\.99 \| 25\.51 \| 22\.99 \| \| 1967 \| 8 \| 23\.43 \| 23\.77 \| 25\.51 \| \| \| ⋮ \| ⋮ \| ⋮ \| ⋮ \| \| 2014 \| 55 \| 30\.22 \| 30\.80 \| 33\.85 \| \| 2015 \| 56 \| 23\.17 \| 24\.39 \| 30\.80 \| \| 2016 \| 57 \| 20\.86 \| 21\.43 \| 24\.39 \| \| 2017 \| 58 \| 22\.64 \| 22\.44 \| 21\.43 \| \| \| \\(h\\) \| \| \| \\(\\hat{y}\_{T+h\\vert T}\\) \| \| 2018 \| 1 \| \| \| 22\.44 \| \| 2019 \| 2 \| \| \| 22\.44 \| \| 2020 \| 3 \| \| \| 22\.44 \| \| 2021 \| 4 \| \| \| 22\.44 \| \| 2022 \| 5 \| \| \| 22\.44 \| The black line in Figure [8\.2](https://otexts.com/fpp3/ses.html#fig:ses) shows the data, which has a changing level over time. ``` fc \|> autoplot(algeria_economy) + geom_line(aes(y = .fitted), col="#D55E00", data = augment(fit)) + labs(y="% of GDP", title="Exports: Algeria") + guides(colour = "none") ``` ![Simple exponential smoothing applied to exports from Algeria (1960--2017). The orange curve shows the one-step-ahead fitted values.](https://otexts.com/fpp3/fpp_files/figure-html/ses-1.png) Figure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values. The forecasts for the period 2018–2022 are plotted in Figure [8\.2](https://otexts.com/fpp3/ses.html#fig:ses). Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of \\(\\alpha\\) in this example is reflected in the large adjustment that takes place in the estimated level \\(\\ell\_t\\) at each time. A smaller value of \\(\\alpha\\) would lead to smaller changes over time, and so the series of fitted values would be smoother. The prediction intervals shown here are calculated using the methods described in Section [8\.7](https://otexts.com/fpp3/ets-forecasting.html#ets-forecasting). The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading. *** 1. In some books it is called “single exponential smoothing”.[↩︎](https://otexts.com/fpp3/ses.html#fnref16)
Readable Markdown	The simplest of the exponentially smoothing methods is naturally called simple exponential smoothing (SES)[16](https://otexts.com/fpp3/ses.html#fn16). This method is suitable for forecasting data with no clear trend or seasonal pattern. For example, the data in Figure [8\.1](https://otexts.com/fpp3/ses.html#fig:7-oil) do not display any clear trending behaviour or any seasonality. (There is a decline in the last few years, which might suggest a trend. We will consider whether a trended method would be better for this series later in this chapter.) We have already considered the naïve and the average as possible methods for forecasting such data (Section [5\.2](https://otexts.com/fpp3/simple-methods.html#simple-methods)). ``` algeria_economy <- global_economy \|> filter(Country == "Algeria") algeria_economy \|> autoplot(Exports) + labs(y = "% of GDP", title = "Exports: Algeria") ``` ![Exports of goods and services from Algeria from 1960 to 2017.](https://otexts.com/fpp3/fpp_files/figure-html/7-oil-1.png) Figure 8.1: Exports of goods and services from Algeria from 1960 to 2017. Using the naïve method, all forecasts for the future are equal to the last observed value of the series, \\\[ \\hat{y}\_{T+h\\|T} = y\_{T}, \\\] for \\(h=1,2,\\dots\\). Hence, the naïve method assumes that the most recent observation is the only important one, and all previous observations provide no information for the future. This can be thought of as a weighted average where all of the weight is given to the last observation. Using the average method, all future forecasts are equal to a simple average of the observed data, \\\[ \\hat{y}\_{T+h\\|T} = \\frac1T \\sum\_{t=1}^T y\_t, \\\] for \\(h=1,2,\\dots\\). Hence, the average method assumes that all observations are of equal importance, and gives them equal weights when generating forecasts. We often want something between these two extremes. For example, it may be sensible to attach larger weights to more recent observations than to observations from the distant past. This is exactly the concept behind simple exponential smoothing. Forecasts are calculated using weighted averages, where the weights decrease exponentially as observations come from further in the past — the smallest weights are associated with the oldest observations: \\\[\\begin{equation} \\hat{y}\_{T+1\\|T} = \\alpha y\_T + \\alpha(1-\\alpha) y\_{T-1} + \\alpha(1-\\alpha)^2 y\_{T-2}+ \\cdots, \\tag{8.1} \\end{equation}\\\] where \\(0 \\le \\alpha \\le 1\\) is the smoothing parameter. The one-step-ahead forecast for time \\(T+1\\) is a weighted average of all of the observations in the series \\(y\_1,\\dots,y\_T\\). The rate at which the weights decrease is controlled by the parameter \\(\\alpha\\). The table below shows the weights attached to observations for four different values of \\(\\alpha\\) when forecasting using simple exponential smoothing. Note that the sum of the weights even for a small value of \\(\\alpha\\) will be approximately one for any reasonable sample size. \| \| \\(\\alpha=0.2\\) \| \\(\\alpha=0.4\\) \| \\(\\alpha=0.6\\) \| \\(\\alpha=0.8\\) \| \|---\|---\|---\|---\|---\| \| \\(y\_{T}\\) \| 0\.2000 \| 0\.4000 \| 0\.6000 \| 0\.8000 \| \| \\(y\_{T-1}\\) \| 0\.1600 \| 0\.2400 \| 0\.2400 \| 0\.1600 \| \| \\(y\_{T-2}\\) \| 0\.1280 \| 0\.1440 \| 0\.0960 \| 0\.0320 \| \| \\(y\_{T-3}\\) \| 0\.1024 \| 0\.0864 \| 0\.0384 \| 0\.0064 \| \| \\(y\_{T-4}\\) \| 0\.0819 \| 0\.0518 \| 0\.0154 \| 0\.0013 \| \| \\(y\_{T-5}\\) \| 0\.0655 \| 0\.0311 \| 0\.0061 \| 0\.0003 \| For any \\(\\alpha\\) between 0 and 1, the weights attached to the observations decrease exponentially as we go back in time, hence the name “exponential smoothing”. If \\(\\alpha\\) is small (i.e., close to 0), more weight is given to observations from the more distant past. If \\(\\alpha\\) is large (i.e., close to 1), more weight is given to the more recent observations. For the extreme case where \\(\\alpha=1\\), \\(\\hat{y}\_{T+1\\|T}=y\_T\\), so the forecasts are equal to the naïve forecasts. We present two equivalent forms of simple exponential smoothing, each of which leads to the forecast Equation [(8.1)](https://otexts.com/fpp3/ses.html#eq:7-ses). ### Weighted average form The forecast at time \\(T+1\\) is equal to a weighted average between the most recent observation \\(y\_T\\) and the previous forecast \\(\\hat{y}\_{T\\|T-1}\\): \\\[ \\hat{y}\_{T+1\\|T} = \\alpha y\_T + (1-\\alpha) \\hat{y}\_{T\\|T-1}, \\\] where \\(0 \\le \\alpha \\le 1\\) is the smoothing parameter. Similarly, we can write the fitted values as \\\[ \\hat{y}\_{t+1\\|t} = \\alpha y\_t + (1-\\alpha) \\hat{y}\_{t\\|t-1}, \\\] for \\(t=1,\\dots,T\\). (Recall that fitted values are simply one-step forecasts of the training data.) The process has to start somewhere, so we let the first fitted value at time 1 be denoted by \\(\\ell\_0\\) (which we will have to estimate). Then \\\[\\begin{align\} \\hat{y}\_{2\\|1} &= \\alpha y\_1 + (1-\\alpha) \\ell\_0\\\\ \\hat{y}\_{3\\|2} &= \\alpha y\_2 + (1-\\alpha) \\hat{y}\_{2\\|1}\\\\ \\hat{y}\_{4\\|3} &= \\alpha y\_3 + (1-\\alpha) \\hat{y}\_{3\\|2}\\\\ \\vdots\\\\ \\hat{y}\_{T\\|T-1} &= \\alpha y\_{T-1} + (1-\\alpha) \\hat{y}\_{T-1\\|T-2}\\\\ \\hat{y}\_{T+1\\|T} &= \\alpha y\_T + (1-\\alpha) \\hat{y}\_{T\\|T-1}. \\end{align\}\\\] Substituting each equation into the following equation, we obtain \\\[\\begin{align\} \\hat{y}\_{3\\|2} & = \\alpha y\_2 + (1-\\alpha) \\left\[\\alpha y\_1 + (1-\\alpha) \\ell\_0\\right\] \\\\ & = \\alpha y\_2 + \\alpha(1-\\alpha) y\_1 + (1-\\alpha)^2 \\ell\_0 \\\\ \\hat{y}\_{4\\|3} & = \\alpha y\_3 + (1-\\alpha) \[\\alpha y\_2 + \\alpha(1-\\alpha) y\_1 + (1-\\alpha)^2 \\ell\_0\]\\\\ & = \\alpha y\_3 + \\alpha(1-\\alpha) y\_2 + \\alpha(1-\\alpha)^2 y\_1 + (1-\\alpha)^3 \\ell\_0 \\\\ & \~~\\vdots \\\\ \\hat{y}\_{T+1\\|T} & = \\sum\_{j=0}^{T-1} \\alpha(1-\\alpha)^j y\_{T-j} + (1-\\alpha)^T \\ell\_{0}. \\end{align\}\\\] The last term becomes tiny for large \\(T\\). So, the weighted average form leads to the same forecast Equation [(8.1)](https://otexts.com/fpp3/ses.html#eq:7-ses). ### Component form An alternative representation is the component form. For simple exponential smoothing, the only component included is the level, \\(\\ell\_t\\). (Other methods which are considered later in this chapter may also include a trend \\(b\_t\\) and a seasonal component \\(s\_t\\).) Component form representations of exponential smoothing methods comprise a forecast equation and a smoothing equation for each of the components included in the method. The component form of simple exponential smoothing is given by: \\\[\\begin{align\} \\text{Forecast equation} && \\hat{y}\_{t+h\\|t} & = \\ell\_{t}\\\\ \\text{Smoothing equation} && \\ell\_{t} & = \\alpha y\_{t} + (1 - \\alpha)\\ell\_{t-1}, \\end{align\}\\\] where \\(\\ell\_{t}\\) is the level (or the smoothed value) of the series at time \\(t\\). Setting \\(h=1\\) gives the fitted values, while setting \\(t=T\\) gives the true forecasts beyond the training data. The forecast equation shows that the forecast value at time \\(t+1\\) is the estimated level at time \\(t\\). The smoothing equation for the level (usually referred to as the level equation) gives the estimated level of the series at each period \\(t\\). If we replace \\(\\ell\_t\\) with \\(\\hat{y}\_{t+1\\|t}\\) and \\(\\ell\_{t-1}\\) with \\(\\hat{y}\_{t\\|t-1}\\) in the smoothing equation, we will recover the weighted average form of simple exponential smoothing. The component form of simple exponential smoothing is not particularly useful on its own, but it will be the easiest form to use when we start adding other components. ### Flat forecasts Simple exponential smoothing has a “flat” forecast function: \\\[ \\hat{y}\_{T+h\\|T} = \\hat{y}\_{T+1\\|T}=\\ell\_T, \\qquad h=2,3,\\dots. \\\] That is, all forecasts take the same value, equal to the last level component. Remember that these forecasts will only be suitable if the time series has no trend or seasonal component. ### Optimisation The application of every exponential smoothing method requires the smoothing parameters and the initial values to be chosen. In particular, for simple exponential smoothing, we need to select the values of \\(\\alpha\\) and \\(\\ell\_0\\). All forecasts can be computed from the data once we know those values. For the methods that follow there is usually more than one smoothing parameter and more than one initial component to be chosen. In some cases, the smoothing parameters may be chosen in a subjective manner — the forecaster specifies the value of the smoothing parameters based on previous experience. However, a more reliable and objective way to obtain values for the unknown parameters is to estimate them from the observed data. In Section [7\.2](https://otexts.com/fpp3/least-squares.html#least-squares), we estimated the coefficients of a regression model by minimising the sum of the squared residuals (usually known as SSE or “sum of squared errors”). Similarly, the unknown parameters and the initial values for any exponential smoothing method can be estimated by minimising the SSE. The residuals are specified as \\(e\_t=y\_t - \\hat{y}\_{t\\|t-1}\\) for \\(t=1,\\dots,T\\). Hence, we find the values of the unknown parameters and the initial values that minimise \\\[\\begin{equation} \\text{SSE}=\\sum\_{t=1}^T(y\_t - \\hat{y}\_{t\\|t-1})^2=\\sum\_{t=1}^Te\_t^2. \\tag{8.2} \\end{equation}\\\] Unlike the regression case (where we have formulas which return the values of the regression coefficients that minimise the SSE), this involves a non-linear minimisation problem, and we need to use an optimisation tool to solve it. ### Example: Algerian exports In this example, simple exponential smoothing is applied to forecast exports of goods and services from Algeria. ``` # Estimate parameters fit <- algeria_economy \|> model(ETS(Exports ~ error("A") + trend("N") + season("N"))) fc <- fit \|> forecast(h = 5) ``` This gives parameter estimates \\(\\hat\\alpha=0.84\\) and \\(\\hat\\ell\_0=39.5\\), obtained by minimising SSE over periods \\(t=1,2,\\dots,58\\), subject to the restriction that \\(0\\le\\alpha\\le1\\). In Table [8\.1](https://otexts.com/fpp3/ses.html#tab:export-ses) we demonstrate the calculation using these parameters. The second last column shows the estimated level for times \\(t=0\\) to \\(t=58\\); the last few rows of the last column show the forecasts for \\(h=1\\) to \\(5\\)\-steps ahead. \| Year \| Time \| Observation \| Level \| Forecast \| \|---\|---\|---\|---\|---\| \| \| \\(t\\) \| \\(y\_t\\) \| \\(\\ell\_t\\) \| \\(\\hat{y}\_{t\\vert t-1}\\) \| \| 1959 \| 0 \| \| 39\.54 \| \| \| 1960 \| 1 \| 39\.04 \| 39\.12 \| 39\.54 \| \| 1961 \| 2 \| 46\.24 \| 45\.10 \| 39\.12 \| \| 1962 \| 3 \| 19\.79 \| 23\.84 \| 45\.10 \| \| 1963 \| 4 \| 24\.68 \| 24\.55 \| 23\.84 \| \| 1964 \| 5 \| 25\.08 \| 25\.00 \| 24\.55 \| \| 1965 \| 6 \| 22\.60 \| 22\.99 \| 25\.00 \| \| 1966 \| 7 \| 25\.99 \| 25\.51 \| 22\.99 \| \| 1967 \| 8 \| 23\.43 \| 23\.77 \| 25\.51 \| \| \| ⋮ \| ⋮ \| ⋮ \| ⋮ \| \| 2014 \| 55 \| 30\.22 \| 30\.80 \| 33\.85 \| \| 2015 \| 56 \| 23\.17 \| 24\.39 \| 30\.80 \| \| 2016 \| 57 \| 20\.86 \| 21\.43 \| 24\.39 \| \| 2017 \| 58 \| 22\.64 \| 22\.44 \| 21\.43 \| \| \| \\(h\\) \| \| \| \\(\\hat{y}\_{T+h\\vert T}\\) \| \| 2018 \| 1 \| \| \| 22\.44 \| \| 2019 \| 2 \| \| \| 22\.44 \| \| 2020 \| 3 \| \| \| 22\.44 \| \| 2021 \| 4 \| \| \| 22\.44 \| \| 2022 \| 5 \| \| \| 22\.44 \| The black line in Figure [8\.2](https://otexts.com/fpp3/ses.html#fig:ses) shows the data, which has a changing level over time. ``` fc \|> autoplot(algeria_economy) + geom_line(aes(y = .fitted), col="#D55E00", data = augment(fit)) + labs(y="% of GDP", title="Exports: Algeria") + guides(colour = "none") ``` ![Simple exponential smoothing applied to exports from Algeria (1960--2017). The orange curve shows the one-step-ahead fitted values.](https://otexts.com/fpp3/fpp_files/figure-html/ses-1.png) Figure 8.2: Simple exponential smoothing applied to exports from Algeria (1960–2017). The orange curve shows the one-step-ahead fitted values. The forecasts for the period 2018–2022 are plotted in Figure [8\.2](https://otexts.com/fpp3/ses.html#fig:ses). Also plotted are one-step-ahead fitted values alongside the data over the period 1960–2017. The large value of \\(\\alpha\\) in this example is reflected in the large adjustment that takes place in the estimated level \\(\\ell\_t\\) at each time. A smaller value of \\(\\alpha\\) would lead to smaller changes over time, and so the series of fitted values would be smoother. The prediction intervals shown here are calculated using the methods described in Section [8\.7](https://otexts.com/fpp3/ets-forecasting.html#ets-forecasting). The prediction intervals show that there is considerable uncertainty in the future exports over the five-year forecast period. So interpreting the point forecasts without accounting for the large uncertainty can be very misleading. *** 1. In some books it is called “single exponential smoothing”.[↩︎](https://otexts.com/fpp3/ses.html#fnref16)
Shard	36 (laksa)
Root Hash	13347583336389170836
Unparsed URL	com,otexts!/fpp3/ses.html s443