Gold vs Gold-Miners Statistical Arbirtage

Here is my implementation, in QuantConnect, of Ernie Chan's "Gold vs Gold-Miners Statistical Arbirtage" strategy found in his book Quantitative Trading.

A stationary time-series is mean-reverting by definition since it never drifts far from its initial/mean value. Unfortunately, price series are not stationary as we can model them as geometric brownian motion that does drift from its initial value. However, the market value of a pair (or basket) of assets, often from the same industry, are stationary when they are cointegrated: a linear combination of their prices are integrated of order zero. Why? Because the assets of a pair respond equally to the same shocks (news) and, consequently, the "walk" together.

In the proposed methodology, we do not rely on any statistical metric to select the pair nor we verify that the pair is cointegrated with a statistical test. This is something we can explore in future implementations using, for example, the Universe Selection feature. We just assume their are as Chan did. He has selected GDX (Gold Miners ETF) and GLD (Gold ETF).

The first step is to define the linear combination: we make a linear regression of the price series using MathNet.Numerics.Fit.Line(x, y), wthere x is GLD price series and y is GDX one to find beta (AKA hedge ratio). The spread is, thus, defined as

e = x - beta * y.

This is important: the spread is our stationary time-series.
Next, we calculate the simple moving average and standard deviation of the spread in order to express it in terms of z score:

zScore = (e - sma) / std.

This derived stationary time-series is centered at zero and the scaled to the spread standard deviation.

Finally, the trading logic is reduced to a comparison between the z score and a given threshold (+/- 0.5). In the attached project, we enter positions when zscore is higher than 0.5 or lower than -0.5 and close the positions once it returns to its mean (cross zero):

Long trade
Entry condition: zScore < -0.5
Buy 1000 shares of GLD
Sell 1000 * beta shares of GDX
Exit condition: zScore > 0

Short trade
Entry condition: zScore > 0.5
Sell 1000 shares of GLD
Buy 1000 * beta shares of GDX
Exit condition: zScore < 0

We can make some sensibility tests on the threshold. In theory, higher values give us more certainty, but fewer trades. Also, we can change the test window length and how ofter beta is re-evaluated. In this algorithm, we have evaluated beta only once with prices from the first 90 trading days.

The material on this website is provided for informational purposes only and does not constitute an offer to sell, a solicitation to buy, or a recommendation or endorsement for any security or strategy, nor does it constitute an offer to provide investment advisory services by QuantConnect. In addition, the material offers no opinion with respect to the suitability of any security or specific investment. QuantConnect makes no guarantees as to the accuracy or completeness of the views expressed in the website. The views are subject to change, and may have become unreliable for various reasons, including changes in market conditions or economic circumstances. All investments involve risk, including loss of principal. You should consult with an investment professional before making any investment decisions.

Good Job Mikhail Socrates!

Just in case, I implemented the methodology in this paper to find cointegrated pairs.

The method is in Python, IIRC it works fine, but I don’t remember if is tested in depth.

Hope it helps.

First, nice implementation and thanks for sharing.

It does appear that inverting the strategy (as in, changing the sign of the order quantities) makes money mid 2011-2014.

JayJayD

42.9k ,

Petter Hansson

10.5k ,

Yan Xiaowei

24k ,

Hi Mikhail,

Two thoughts on this:

1. I wonder whether there is any intercept from the linear regression or the p-value of the intercept is big enough to neglect the intercept? I saw your formula e = x - beta * y, but no intercept.

2. I think it would be better to open your position when the z-score breaks your threshold(0.5,-0.5) twice. The reason is that the first time it break the threshold, the prices are expected to continue diverging, and only when the s-score come back(break the threshold the second time) means the prices are converging.

Roger M. Benites

344 ,

Thanks for sharing! Great Write-up!

INVESTOR

Update Backtest

Notebook

person upvoted this people upvoted this

To unlock posting to the community forums please complete at least 30% of Boot Camp.
You can continue your Boot Camp training progress from the terminal. We hope to see you in the community soon!

Platform

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research PUblications

About Quant League

competition rules

previous competitions

286,500 Quants.

VOTE FOR UPCOMING FEATURES

Gold vs Gold-Miners Statistical Arbirtage

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

Actions

Join QuantConnect for Free

Platform

SIGN IN

Radically Open-Source Algorithmic Trading Engine

Join Our Discord Channel

Quarterly Open-Source Trading Competition

Draft Discussions

Bookmarked Discussions

SEARCH DISCUSSIONS

TOP 5 Research PUblications

About Quant League

competition rules

previous competitions

286,500 Quants.

VOTE FOR UPCOMING FEATURES

Gold vs Gold-Miners Statistical Arbirtage

Organization

Team

Clone Strategy

Previous Ranking

IN THIS RESEARCH

PARTICIPANTS

Discussion Awards

SHARE RESEARCH

SHARE DISCUSSION

SHARE ARTICLE

SHARE

Actions

Join QuantConnect for Free