Climatic variables such as rainfall and temperature have nonlinear and non-stationary characteristics such that analysing them using linear methods inconclusive results are found. Ensemble empirical mode decomposition (EEMD) is a data-adaptive method that is best suitable for data with nonlinear and non-stationary characteristics. The average monthly rainfall and temperature data for a selected region in South Africa are decomposed into intrinsic mode functions (IMFs) at different time scales using EEMD. The IMFs exhibit an inter-annual to inter-decadal variability. The influence of climatic oscillations such as El-Niño Southern Oscillation (ENSO) and quasi-biennial oscillation (QBO) is identified. The influence of temperature variability on rainfall is also shown at different time scales. Based on the results obtained, the EEMD method is found to be suitable to identify different oscillations in the rainfall and temperature data.
The annual variation of rainfall in Southern Africa, directly and indirectly, affect human livelihoods and ecosystem through droughts, temperature changes, water supply problems and reduced agricultural production. Most of Southern Africa experience Austral summer rainfall (October to March) and north-eastern to south-western regions experience austral winter rainfall (May to August) (Dieppois et al., 2016). The summer and winter rainfall variability has been shown to be influenced by El Niño-Southern Oscillation (ENSO) and becomes nonlinear, with extreme weather conditions (Phillippon et al., 2012; Fauchereau et al., 2009; Kane, 2009). The temperature in South Africa has increased by over one and half times more than the globally observed temperature increases (Kruger & Shongwe, 2004; DEA, 2011; Mackellar et al., 2014).
El Niño describes the warming of sea surface temperature that occurs periodically, typically concentrated in the central-east equatorial Pacific (Lehodey et al., 1997). La Niña is the term adopted which describes the opposite side of the fluctuations. The Niño 3.4 index is one of the key atmospheric indices used to gauge the strength of El Niño and La Niña. The other driver that has been shown to have an impact on weather patterns is the Quasi-Biennial Oscillation (QBO) (Begue et al., 2010). QBO is a regular variation of the winds that blow high above the equator. Strong winds in the stratosphere travel in a belt around the planet, and in about 14 months these winds completely change direction (Kane, 2009). In the study by Kane (2009), it was shown that the warming of climate and sea surface temperatures has an impact on the rainfall patterns. Due to the influences of these climatic drivers and others, the weather patterns become nonlinear and non-stationary such that linear models are sometimes inconclusive (Huang et al., 1999; Molla et al., 2006). Fourier based methods assume that the data is linear and the data must be strictly periodic which is not the case with climate data (Schulte, 2016).
Empirical mode decomposition (EMD), was introduced by Huang et al. (1998), which does not make assumptions about linearity and stationarity of the time series and it is best suitable to analyse climate data. The time series is decomposed into different time scales called intrinsic mode functions (IMFs), which can reveal intrinsic changes in the climate system (Huang et al., 1998; Wu et al., 2009). EMD has a challenge of mixing the signals from one IMF to another, therefore, to cater for this, a noise assisted method named Ensemble Empirical Mode Decomposition (EEMD) was introduced (Wu et al., 2009).
EEMD has been applied widely in the hydrological and atmospheric studies. Chiew et al. (2005) applied EMD to annual streamflow to find significant oscillations in the data. Twenty unimpaired catchments from different parts of the world were used and 3, 6–7, 11–15 and 20–25 year oscillations in the stream flows were identified. In another study, the climate variability was observed in the case study of East and Central Africa, which was demonstrated by mapping the coupling between precipitation variables, inter-annual vegetation changes and the ENSO and Indian Ocean Dipole (IOD). EEMD was adopted because of its ability to breakdown normalised vegetation index (NDVI) series into multiple time scales components and its generic ability to be used with other time series analysis tools (Hawinkel et al., 2015, 2016). The Pacific Decadal Oscillation (PDO) is known to have an influence over East China’s annual and summer rainfall. Using EEMD the monthly rainfall influence of PDO was identified (Yang et al., 2017). The intrinsic oscillations in the land surface temperature of Wuhan, China were revealed using EEMD by decomposing the data into annual, inter-annual, noise and trend (Liu et al., 2019). In South Korea several climate drivers have been shown to have an impact on the precipitation, however many studies did not take into consideration the inherent cycles in the long term precipitation. Using EEMD, cross-correlation and multiple linear regression on the influence of ENSO, QBO, Arctic Oscillation, Atlantic Meridional Mode and others was shown at the monthly level (Kim et al., 2018). Some notable applications of EEMD include finding the effects of temperature and precipitation trends in Plateau droughts (Sun & Ma, 2015) and identifying the variability of monthly precipitation in Iran (Alizadeh et al., 2019).
In this study EEMD is used to decompose a 38-year rainfall and temperature data for a selected region in Western Cape, South Africa to reveal underlying physical signals. The influence of ENSO, QBO and temperature on the rainfall pattern is identified. The selected region chosen has been facing a lot of water challenges recently. There has been growing general interest in the winter rainfall mainly due to a threat of “day zero” in 2018 (Kruger et al., 2017; Maxmen, 2018; Wolski, 2018). However, very few studies have used statistical analysis to investigate the rainfall variability during winter. This is the first study that uses EEMD and synchronisation to investigate the influence of ENSO and QBO on the rainfall pattern for the region. This paper is organized as follows: in section 2, the data set and methodology are described; in section 3, the analysis and discussion of the results is done and conclusion is done in section 4.
The average monthly rainfall and temperature data were obtained from the South African Weather Service (SAWS) for the period 1980 to 2018 for an area between 18.2–19.2ºE and 33.5–34.5ºS, which contains 11 weather stations as shown in Figure 1. The area selected receives winter rainfall from April to August. The data for Niño 3.4 and QBO is publicly available on Climate Explorer website (http://climexp.knmi.nl) (Oldenborgh & Burgers, 2005).
Rainfall and temperature data from weather stations has challenge of having missing data. Therefore, to cater for the missing data multivariate imputation by chained equations (MICE) method is used to impute the missing data (Van Buuren & Groothuis-Oudshoorn, 2011). The MICE method was chosen for this study because it does not assume normal distribution of the data and it also assumes missing at random (MAR). The R package ‘mice’ developed by Van Buuren & Groothuis-Oudshoorn (2011) is used for imputation.
Empirical Mode Decomposition (EMD) is an adaptive time-frequency data representation technique, which requires only that the data must consist of a simple intrinsic mode of oscillations (Huang et al., 1998). It is most suitable for nonlinear and non-stationary data. The EMD methodology is based on a shifting process, which identifies local extrema (maxima and minima) and results in the formation of intrinsic mode functions (IMFs). In order to decompose a given time series xt into IMFs the following algorithm is used:
The first IMF contain the highest fluctuations and this is subtracted from the original data and subsequent IMFs are then derived from the subtracted data. The IMFs and residual data approximate the original data when they are summed together (Huang et al., 1998). A time series is decomposed into IMFs, ht (i) (i = 1,2, …n) and residual rt so that the original data is approximated by the sum of IMFs and residual.
EMD has a challenge of mode mixing, where signals from one IMF is found in another. Therefore, a noise assisted method, Ensemble Empirical Mode Decomposition (EEMD), which consist of adding white noise before carrying out EMD algorithm was introduced (Wu & Huang, 2009). For a given time series xt an ensemble of white noise of size m, εj (j = 1,2…,m), is introduced to each data point, xi, such that the ith “artificial” value becomes
An average of the IMFs found from the data with noise becomes the final IMF, that is
where dt (j) is the IMF of the time series with added noise, yt and ct (i) is the final ith IMF for the original time series xt. The average of residuals from yt gives the final residual that is,
whereare residuals from the time series with added noise.
The IMFs must be mutually orthogonal to each other. Higher orthogonality corresponds to less amount of information leakage. The index of orthogonality (IO) is used to calculate the orthogonality which is given by
where i and j represents the ith and jth IMFs and n is the size of the IMF (Molla et al., 2006).
Synchronisation of coupled oscillating systems means appearance of certain relations between their phases and frequencies (Rosenblum et al., 2001). Here we use this concept in order to reveal the interaction between rainfall and other climatic drivers. R package ‘synchrony’ is used which measures phase synchrony between quasiperiodic times series (Cazelles & Stone 2003). Time series that are phase synchronised or locked exhibit a modal distribution with a prominent peak at a given phase difference, whereas unrelated times series are characterized by a uniform or diffuse distribution.
The average monthly rainfall for the region located between 18.2–19.2ºE and 33.5–34.5ºS was standardised for easy computation and comparison. The multivariate imputation by chained equations is used to impute the missing rainfall and temperature data. The data was decomposed until when there is at most one maximum and one minimum in the residual. From the standardised rainfall data, 7 IMFs were found and are shown in Figure 2. IMF 3 has a period of about 12 months which corresponds to an annual (seasonal) oscillation, IMF 4 has a period of about 26 months which approximated a 2 year oscillation, IMF 5 has an oscillation of about 54 months (4.5 years) and IMF 6 captures a quasi-decadal oscillation (7 year period). Previous studies have used Wavelet Analysis, identified similar oscillations in the South African rainfall that were also found in this study (Kane, 2009; Dieppois et al., 2016). In these previous studies, winter rainfall was found to be having significant 2–3 year period and 3–4 year period which contributed to the rainfall variability. As compared to Wavelet Analysis, EEMD is adaptive, intuitive and does not use basis functions. Additionally, the impact of different climate drivers at different time scales can be shown. The graph on the bottom right of Figure 2 illustrates the residual plot, which shows the general trend of the rainfall. A study by Maúre et al. (2018) used several climatic models to predict the rainfall trend over Southern Africa under global warming and the study pointed to a decreasing daily rainfall for Western Cape, which is in agreement with the obtained residual plot.
The probability density function for each IMF is approximately normally distributed. The IMFs and residual are added together to reconstruct the data. The reconstructed data approximates the original data with Root Mean Square Error of order 10–14. This clearly shows that EMD is lossless decomposition with minimal data being lost in the decomposition and managing to capture most of the oscillations in the data.
It is noted that the maximum value of orthogonality between the IMFs is found to be approximately equal to 0.001 and it is way below the acceptable value of less than 0.1. The index of orthogonality for the IMFs is 0.594 × 10–4 for rainfall, 0.154 × 10–6 for Niño 3.4 and 0.235 × 10–5 for QBO. It confirms that there is less amount of information leakage.
The cross-correlation between rainfall and QBO and Niño 3.4 shows that there is no correlation as shown in the auto-correlation function (ACF) plot in Figure 3. However, when the time series were decomposed correlation is identified for IMF 3 for both Niño 3.4 and QBO as shown in Figure 4. There is a correlation of rainfall’s IMF3 and Niño 3.4 index at lag –4, –5, –6, 2 and 3. These results confirm the influence of ENSO on the seasonal rainfall and also the quasi-biennial oscillation which is consistent with results found by Philippon et al. (2012) and Kane (2009). Additionally, the general pattern of the rainfall at different time scales is identified up to quasi-decadal oscillation.
Cross-correlation can be used only when the time series is stationary, the Augmented Dicky Fuller Test (ADF) and Phillips-Perron Unit Root Test shows that IMF 4 to 7 are not stationary therefore cross-correlation cannot be used. The synchronisation does not require that the time series to be stationary hence it was used to identify any relationship for those IMFs. These IMFs found are further synchronised with Niño 3.4 and QBO and the results are shown in Figure 5 below. The results show that there is weak coupling between the original rainfall time series and Niño 3.4 or QBO. However, there is phase-locking identified for IMF 5 for Niño 3.4, since there is a clear peak. These results for Niño 3.4 shows that there may be an influence of ENSO on the rainfall pattern. This is in agreement with the results that were found by Philippon et al. (2012), which found that there was a significant association between ENSO and winter rainfall. They showed that there is a strong correlation of the May-June-July season with ENSO than any other period of the year. In this study using EEMD, the correlation is further done at a monthly level than seasonal level. The clear peak on the histogram of rainfall’s IMF6 and QBO (Figure 5) shows that there is phase locking. This QBO signal identified is in agreement with a study by Kane (2009) which also identified the presence of QBO and it was shown that it contributed significantly to the variability of the winter rainfall for the same region.
Temperature for the selected region was also decomposed into 7 IMFs and a residual. There is weak coupling between the original rainfall time series and temperature as shown in Figure 6 below. However, phase locking is observed for IMF 1 and IMF5, since there is a clear peak for both of them. IMF1 captures the noise found in the signal and IMF5 is a 4.5-year oscillation. The phase-locking in IMF1 suggest that the temperature variability may have an impact on the rainfall patterns. The synchronisation on IMF5 shows that the temperature changes may have a long term impact on the rainfall variability. This is consistent with other studies that have used different climatic models to find the impact of increasing temperature (Maúre et al., 2018; Nangombe et al., 2018; Nikulin et al., 2018). These models predict that there will be an increase in extreme rainfall patterns. In our study, we have managed to show the direct impact of increasing temperature using historical data.
The effectiveness of EEMD to analyse nonlinear and non-stationary data was demonstrated. The rainfall and temperature data were decomposed into IMFs and residual data, which summed up to the original data. The decomposed IMFs found can be used with other methods such as regression and neural networks to predict the impact of climate drivers in the future. EEMD was effective in isolating the data into different timescales and therefore the variability of the rainfall pattern was identified, in the end, evidence of the effect of ENSO and QBO was provided. Cross-correlation and phase synchronisation was used to find the relationship of the IMFs from the different time series under study. It will be of interest for future studies to carry out a study for a longer period to find the pattern of the rainfall over decades.
The additional files for this article can be found as follows:Niño 3.4 Index
A measurement of the strength of El-Niño and La-Niña. DOI: https://doi.org/10.5334/dsj-2019-046.s1QBO Data
Measurement of the variation of the wind above the equator. DOI: https://doi.org/10.5334/dsj-2019-046.s2Rainfall Data
Original rainfall data for the study area and the decomposed rainfall into IMFs. DOI: https://doi.org/10.5334/dsj-2019-046.s3Temperature Data
Original temperature data for the study area and the decomposed temperature into IMFs. DOI: https://doi.org/10.5334/dsj-2019-046.s4
The author would like to acknowledge South African Weather Service for the data that was used in this study and National Research Fund (Grant number 112979) for the funding of the work.
Willard Zvarevashe is the main author of the work. Venkataraman Sivakumar edited and contributed on the atmospheric part of the study. Syamala Krishnannair provided final approval of the version to be published.
The authors have no competing interests to declare.
Alizadeh, F, Roushangar, K and Adamowski, J. 2019. Investigating monthly precipitation variability using a multiscale approach based on ensemble empirical mode decomposition. Paddy and Water Environment, 1–19. DOI: https://doi.org/10.1007/s10333-019-00754-x
Cazelles, B and Stone, L. 2003. Detection of imperfect population synchrony in an uncertain world. Journal of Animal Ecology, 72(6), 953–968. DOI: https://doi.org/10.1046/j.1365-2656.2003.00763.x
Dieppois, B, Pohl, B, Rouault, M, New, M, Lawler, D and Keenlyside, N. 2016. Interannual to interdecadal variability of winter and summer southern African rainfall, and their teleconnections. Journal of Geophysical Research: Atmospheres, 121(11): 6215–6239. DOI: https://doi.org/10.1002/2015JD024576
Fauchereau, N, Pohl, B, Reason, CJC, Rouault, M and Richard, Y. 2009. Recurrent daily OLR patterns in the Southern Africa/Southwest Indian Ocean region, implications for South African rainfall and teleconnections. Climate Dynamics, 32(4): 575–591. DOI: https://doi.org/10.1007/s00382-008-0426-2
Hawinkel, P, Swinnen, E, Lhermitte, S, Verbist, B, Van Orshoven, J and Muys, B. 2015. A time series processing tool to extract climate-driven interannual vegetation dynamics using ensemble empirical mode decomposition (EEMD). Remote Sensing of Environment, 169: 375–389. DOI: https://doi.org/10.1016/j.rse.2015.08.024
Hawinkel, P, Thiery, W, Lhermitte, S, Swinnen, E, Verbist, B, Van Orshoven, J and Muys, B. 2016. Vegetation response to precipitation variability in East Africa controlled by biogeographical factors. Journal of Geophysical Research: Biogeosciences, 121(9): 2422–2444. DOI: https://doi.org/10.1002/2016JG003436
Huang, NE, Shen, Z and Long, SR. 1999. A new review of non-linear water waves: the Hilbert spectrum. Annual Review of Fluid Mechanics, 31: 417–457. DOI: https://doi.org/10.1146/annurev.fluid.31.1.417
Huang, NEZ, Shen, R, Long, S, Wu, MC, Shih, HH, Zheng, Q, Yen, NC, Tung, V and Liu, HH. 1998. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis. Proceeding of the Royal Society of London, Series A. DOI: https://doi.org/10.1098/rspa.1998.0193
Kane, R. 2009. Periodicities, ENSO effects and trends of some South African rainfall series: an update. South African Journal of Science, 105(5–6): 199–207. DOI: https://doi.org/10.4102/sajs.v105i5/6.90
Kim, T, Shin, JY, Kim, S and Heo, JH. 2018. Identification of relationships between climate indices and long-term precipitation in South Korea using ensemble empirical mode decomposition. Journal of Hydrology, 557: 726–739. DOI: https://doi.org/10.1016/j.jhydrol.2017.12.069
Kruger, AC and Nxumalo, MP. 2017. Historical rainfall trends in South Africa: 1921–2015. Water SA, 43(2): 285–297. DOI: https://doi.org/10.4314/wsa.v43i2.12
Kruger, AC and Shongwe, S. 2004. Temperature trends in South Africa: 1960–2003. International Journal of Climatology: A Journal of the Royal Meteorological Society, 24(15): 1929–1945. DOI: https://doi.org/10.1002/joc.1096
Lehodey, P, Bertignac, M, Hampton, J, Lewis, A and Picaut, J. 1997. El Niño southern oscillation and tuna in the western pacific. Nature, 389(6652): 715–718. DOI: https://doi.org/10.1038/39575
Liu, H, Zhan, Q, Yang, C and Wang, J. 2019. The multi-timescale temporal patterns and dynamics of land surface temperature using Ensemble Empirical Mode Decomposition. Science of the Total Environment, 652: 243–255. DOI: https://doi.org/10.1016/j.scitotenv.2018.10.252
MacKellar, N, New, M and Jack, C. 2014. Observed and modelled trends in rainfall and temperature for South Africa: 1960–2010. South African Journal of Science, 110(7–8): 1–13. DOI: https://doi.org/10.1590/sajs.2014/20130353
Maúre, G, Pinto, I, Ndebele-Murisa, M, Muthige, M, Lennard, C, Nikulin, G, Meque, A, et al. 2018. The southern African climate under 1.5 C and 2 C of global warming as simulated by CORDEX regional climate models. Environmental Research Letters, 13(6): 065002. DOI: https://doi.org/10.1088/1748-9326/aab190
Maxmen, A. 2018. As Cape Town water crisis deepens, scientists prepare for ‘Day Zero’. Nature, 554(7690). DOI: https://doi.org/10.1038/d41586-018-01134-x
Molla, MKI, Rahman, MS, Sumi, A and Banik, P. 2006. Empirical mode decomposition analysis of climate changes with special reference to rainfall data. Discrete dynamics in Nature and Society, 2006. DOI: https://doi.org/10.1155/DDNS/2006/45348
Nangombe, S, Zhou, T, Zhang, W, Wu, B, Hu, S, Zou, L and Li, D. 2018. Record-breaking climate extremes in Africa under stabilized 1.5 C and 2 C global warming scenarios. Nature Climate Change, 8(5): 375. DOI: https://doi.org/10.1038/s41558-018-0145-6
Nikulin, G, Lennard, C, Dosio, A, Kjellström, E, Chen, Y, Hänsler, A, van Meijgaard, E, et al. 2018. The effects of 1.5 and 2 degrees of global warming on Africa in the CORDEX ensemble. Environmental Research Letters, 13(6): 065003. DOI: https://doi.org/10.1088/1748-9326/aab1b1
Philippon, N, Rouault, M, Richard, Y and Favre, A. 2012. The influence of ENSO on winter rainfall in South Africa. International Journal of Climatology, 32(15): 2333–2347. DOI: https://doi.org/10.1002/joc.3403
Rosenblum, M, Pikovsky, A, Kurths, J, Schäfer, C and Tass, PA. 2001. Phase synchronization: from theory to data analysis. In: Handbook of biological physics, 4: 279–321. North-Holland. DOI: https://doi.org/10.1016/S1383-8121(01)80012-9
Schulte, JA. 2016. Wavelet analysis for non-stationary, nonlinear time series. Nonlinear Processes in Geophysics, 23(4): 257. DOI: https://doi.org/10.5194/npg-23-257-2016
Sun, C and Ma, Y. 2015. Effects of non-linear temperature and precipitation trends on Loess Plateau droughts. Quaternary International, 372: 175–179. DOI: https://doi.org/10.1016/j.quaint.2015.01.051
Van Buuren, S and Groothuis-Oudshoorn, K. 2011. mice: Multivariate Imputation by Chained Equations in R. Journal of Statistical Software, 45(3): 1–67. DOI: https://doi.org/10.18637/jss.v045.i03
van Oldenborgh, GJ and Burgers, G. 2005. Searching for decadal variations in ENSO precipitation teleconnections. Geophys. Res. Lett., 32(15): L15701. DOI: https://doi.org/10.1029/2005GL023110
Wolski, P. 2018. How severe is Cape Town’s “Day Zero” drought? Significance, 15(2): 24–27. DOI: https://doi.org/10.1111/j.1740-9713.2018.01127.x
Wu, Z and Huang, NE. 2009. Ensemble Empirical Mode Decomposition: A noise-assisted data analysis method. Advanced Adaptive Data Analysis, 1: 1–41. DOI: https://doi.org/10.1142/S1793536909000047
Yang, Q, Ma, Z and Xu, B. 2017. Modulation of monthly precipitation patterns over East China by the Pacific Decadal Oscillation. Climatic change, 144(3): 405–417. DOI: https://doi.org/10.1007/s10584-016-1662-9