Can Crude Oil Predict Equity Returns

The Nasdaq Data Link datasets that this research uses have been discontinued.

Abstract

In this tutorial we use regression to predict the return from the stock market and compare it to the short-term U.S. T-bill rate. It is based on the paper "Striking Oil: Another Puzzle?" by Gerben, Ben and Benjamin (2007). If the predicted return is larger than the risk-free rate, the portfolio is fully invested in stock; if the predicted return is lower than the risk-free rate, the portfolio is invested in short-term U.S T-bills. The backtesting period starts in 1980 and is divided into an in-sample period where regression analysis is made and an out of sample period where the regression result is embedded "statically" into the strategy.

In our implementation of the strategy we adapt the method of the original paper to make it more applicable to the current market. We have set our backtesting period to be from 2010 to 2017 and we refresh our regression analysis each month to form a rolling dynamic projection. This is because empirical evidence shows us the correlation between oil and stocks is not as strong as in the 1980's. We use the price of S&P GSCI Crude Oil Total Return Index ETNs to represent spot oil price. We import the oil price and T-bill data from Nasdaq Data Link. We use Scheduled Events to trigger an event every month automatically and the History function to retrieve data for regression analysis.

Our analysis shows this strategy under performs the market in recent years. In the 9 year analysis period the algorithm was mostly long the S&P500 index and only 9 trades were performed as the markets were strongly bullish. The trades could potentially simply be due to the weakening of the relationship between stocks and oil.

Background

We assume the predicted return of the stock is proportional to the return of oil. This can be represented by the regression equation:

\[r^{stock}_t=a_0+a_1r^{oil}_{t-1}+e_t\]

with

\[e_t=r^{stock}_t-E_{t-1}[r^{stock}_t]\]

The independent variable is the return of the oil and the dependent variable is the return of the stock. We use the monthly returns over a regression period of 2 years, giving us 22-23 observations to regress. Every month regression analysis is conducted, and we use the estimated coefficient from the regression to compute the expected stock return with the given return of oil.

Method

The algorithm implementation consists of mainly three parts: Defining the custom imported data, initialization of the strategy parameters, and monthly re-balancing of the portfolio.

Step 1: Defining Custom Imported Data

We import T-Bill data from Nasdaq Data Link - a marketplace for financial, economic and alternative data. In our Initialize function, we use the following commands to add the custom data to our algorithm.

self.oil = self.add_data(NasdaqDataLink, "OPEC/ORB").symbol
self.tbill = self.add_data(NasdaqDataLink, "USTREASURY/BILLRATES").symbol

Step 2: Initialization of the Strategy Parameters

In our Initialize function we set the cash amount, start-end date as well as other parameters that are specific to this strategy. We set a parameters for the regression analysis period:

self.reg_period = timedelta(days=2*365)

The variable regPeriod indicates the period of time we are going to take into consideration in our regression analysis. We need to set up the Scheduled Event in Initialize so as to trigger the monthly re-balancing function every month.

self.schedule.on(self.date_rules.month_start(self.spy), self.time_rules.at(0, 0), self.monthly_reg)

Step 3: Monthly Re-balancing of the Portfolio

Every month we reconstruct the regression analysis to determine whether to be 100% long stocks or T-Bill contracts. We perform this re-balancing in the MonthlyReg function at the start of each month. We use the History function to retrieve historical data for oil and stocks and then divide the T-Bill rate by 12 to make it comparable to the monthly expected return of stocks.

hist = self.history([self.oil, self.spy], self.reg_period, Resolution.DAILY)
oilSeries = hist.loc[self.oil]['value'].resample('m').last()
spySeries = hist.loc[self.spy]['close'].resample('m').last()
index = sorted(set(oilSeries.index).intersection(spySeries.index))
oilSeries = oilSeries[index]
spySeries = spySeries[index]
rf = float(self.securities[self.tbill].price)/12.0

Then we make an OLS regression by using numpy to make the prediction on next month's stock return.

x = np.array(oilSeries)
x = (np.diff(x)/x[:-1])
y = np.array(spySeries)
y = (np.diff(y)/y[:-1])
A = np.vstack([x[:-1],np.ones(len(x[:-1]))]).T
beta, alpha = np.linalg.lstsq(A,y[1:])[0]
yPred = alpha + x[-1]*beta

Finally, we compare the expected return of stocks with risk-free rate. If the former is larger than the latter, we invest fully in stocks; otherwise we liquidate our holdings. Because we cannot purchase T-Bill contracts the performance is likely slightly underestimated.

if yPred > rf:
    self.set_holdings(self.spy, 1)
else:
    self.liquidate(self.spy)

Summary

We backtested this strategy over the period beginning in 2010 and ending in 2017. It has a Sharpe ratio of 0.726, which is similar to the benchmark's 0.735 over the same period. Although the annual return closely matches that of the paper, it is largely a coincidence of the strong bull market in recent years. If we look at the monthly regression results, we could find that in most cases, the p-value is not small enough to reject the null hypothesis that there is no correlation between oil and stocks. So the investment decisions based on the insignificant statistical results are almost meaningless. The performance of this strategy cannot effectively beat the benchmark, mostly due to the weakened correlation between oil and stocks.

Further research and backtesting could be conducted on assets other than oil that have a stronger relationship with stocks.

Reference

Gerben, Driesprong (2007). Striking Oil: Another Puzzle? page 1, Online Copy

The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by QuantConnect. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. QuantConnect makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances. All investments involve risk, including loss of principal. You should consult with an investment professional before making any investment decisions.

Runtime Error: Trying to retrieve an element from a collection using a key that does not exist in that collection throws a KeyError exception. To prevent the exception, ensure that the 'OPEC/ORB.NasdaqDataLink 2S' key exist in the collection and/or that collection is not empty. at wrapped_function raise KeyError(f"No key found for either mapped or original key. Mapped Key: {mKey}; Original Key: {oKey}") in PandasMapper.py: line 75 at MonthlyReg oilSeries = hist.loc[self.oil]['value'].resample('m').last() ~~~~~~~~^^^^^^^^^^ in main.py: line 28

Chris Andreou

368 ,

When I run this back test I get an error

Could you look into the strategy and apply a fix to prevent this error?

Looking forward to running it.

Jing Wu INVESTOR

Update Backtest

Notebook

person upvoted this people upvoted this

To unlock posting to the community forums please complete at least 30% of Boot Camp.
You can continue your Boot Camp training progress from the terminal. We hope to see you in the community soon!

Can Crude Oil Predict Equity Returns

Quant League Is Moving Forward as Strategies

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research Publications

465,400 Quants.

VOTE FOR UPCOMING FEATURES

Abstract

Background

Method

Step 1: Defining Custom Imported Data

Step 2: Initialization of the Strategy Parameters

Step 3: Monthly Re-balancing of the Portfolio

Summary

Reference

Allocate to this Strategy

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

Actions

Join QuantConnect for Free

SIGN IN

Can Crude Oil Predict Equity Returns

Quant League Is Moving Forward as Strategies

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research Publications

465,400 Quants.

VOTE FOR UPCOMING FEATURES

Abstract

Background

Method

Step 1: Defining Custom Imported Data

Step 2: Initialization of the Strategy Parameters

Step 3: Monthly Re-balancing of the Portfolio

Summary

Reference

Allocate to this Strategy

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

SHARE RESEARCH

SHARE DISCUSSION

SHARE ARTICLE

SHARE

Actions

Join QuantConnect for Free