This post focuses on models having innovations which are not only independent identically distributed (IID) but Gaussian. It is clear why this makes them so easy to analyze and simulate.
Model building goals. We evaluate the model based on innovations.
- Is each series of innovations a (weak) white noise, measured by the empirical autocorrelation function (ACF) and standard Ljung-Box white noise tests? If yes, this would mean autocorrelations are zero. But this does not yet mean the series is truly IID. Stochastic volatility models show why.
- Is each series of innovations after taking their absolute values have autotocorrelations zero? We again apply the empirical ACF and the Ljung-Box white noise tests. If yes, and the first answer above is also yes, then it is reasonable to model this series as IID.
- Is each series of innovations Gaussian? We can ask this question only if we answered affirmatively on the first two. This is answered by making a quantile-quantile plot versus the normal distribution and applying Jarque-Bera normality test.
As an attentive reader can see, the techniques are essentially the same as previously. But there are a couple of important differences.
- First, we apply the Ljung-Box white noise tests based on the (weighted) sum of squares, not the customized sum of absolute values tests we considered previously. I think it is simply easier and better known to apply Ljung-Box tests. The test based on L1 norm did not really show anything special different from L2 tests.
- Second, we do not apply the Shapiro-Wilk normality test. We consider it to be a bit of an overkill. Jarque-Bera test captures skewness and fat tails commonly present in financial analysis which prevent the data from being normal. And anyway, the Jarque-Bera test is present in the standard Python OLS regression output.
Let us stress what features we are not interested in.
- Maximizing
of linear regressions
- Information criteria (Akakie/Bayesian)
- Minimizing the standard error
of regression residuals.
Data description. We have annual data 1927-2025. It is available as a spreadsheet here. Take five data series:
- Total returns invested in S&P 500 and its predecessor, S&P 90:
during year
available 1928-2025. End-of-year values were taken by Ian Anderson from Yahoo Finance, and dividend annual data is taken from Robert Shiller’s data library. See the source here.
- Total returns for international stocks (see remark below):
available 1970-2025.
- Total return index value
for the USA corporate bonds (measured by Bank of America Intercontinental Exchange total return index value, taken from Federal Reserve Economic Data (FRED) web site), available 1972-2025.
- Annual realized volatility
for year
Recall that this was computed by Angel Piotrowski for 1928-2025.
- December daily average BAA Moody’s rate:
available for 1927-2025 also from FRED web site.
A note on international stocks. The total returns of international stocks are now measured by a customized portfolio of 88% MSCI EAFE and 12% MSCI Canada. I did this adjustment because the data for MSCI EAFE (=developed markets including Europe, Australia and Far East) was available from 1970 on the web site Novel Investor, as opposed to MSCI emerging markets (available only from 1988), but this EAFE index did not include Canada! I thought this is very unfair, since Canada is a major component (~12%) of existing developed market stock ETFs. So I decided to include it manually. The data for Canada was also available as another MSCI index from 1970.
The simplest model equations. Try modeling where
are innovation series. This is in line with our long-standing idea of dividing stock returns by volatility to make them closer to IID Gaussian. It works perfectly well here.
Also, Note that this makes the rates non-stationary: More like a geometric random walk, except we have stochastic volatility here. This is one more remarkable example of how to use stock volatility for bonds, which we discussed earlier.
Next, the equation for volatility stays the same as in Angel Piotrowski’s analysis:
Finally, the equation for bond returns is as follows:
Each is IID Gaussian series with mean zero. This is confirmed by the tests and graphs above.
Properties of this model. As mentioned above, rates and therefore bond returns are non-stationary. But stock returns and volatility are stationary. Also, stock returns do not have duration in their modeling. In fact, if we include the same difference term, it would be highly significant, with extremely high T-value. But we decided to create the simplest model.
Extensions. We can increase complexity of this model as follows:
- Include
then the innovations are also IID Gaussian. This involves duration for stocks just like for bonds. The values of coefficients are significantly different from zero. Accept!
- Include a constant for increments of log rates
and
has
for Student T-test. Accept!
- On top of item 1, for stock returns (both domestic and international) we can add volatility as an additive factor, not just multiplicative:
Here
is different from zero, judging by the Student T-test. Accept!
- We could run the autoregression with stochastic volatility terms
but this violates normality of
Same would be true for the simple autoregression
Reject! Unfortunately, this means we must consider a non-stationary model.
- We could add volatility to bond returns as an additive factor, not just a multiplicative one. Thus we make regression
But this would fail the IID assumption. Reject!
- Writing
we replace de facto arithmetic returns with geometric returns, but in a modified way. The IID Gaussian assumption holds. Accept!
- Adding volatility to bond returns as an additive factor, not just a multiplicative one: Similarly to 5, but with model as in 6, then the IID fails. Reject!
The new valuation measure. Following previous blog posts, we consider comparing total annual returns with annual dividend growth and detrending it. Take cumulative quantities, which can be expressed using current dividends: and regress
versus the previous value
and the time trend
We get: [/latex] M(t+1) – M(t) = \alpha + \beta M(t) + \gamma t + W(t). [/latex] Similarly to the article we rewrite this as a simple autoregression for detrended
This autoregression will also have residuals
Such residuals are tested and they do not pass our tests: They are not IID. Reject!
The long-short term spread. We considered this spread between 10-year and 3-month average December annual rates, and its predecessors from 1927.
- Classic autoregression:
- AR with stochastic volatility:
- AR with stochastic volatility but without volatility as an additive factor:
Unfortunately, none has Gaussian residuals. Reject!
Leave a comment