How Well Run Is Your ETF?

How Well Run Is Your ETF?

Tracking difference can be up for interpretation based on variations in input data.

ElisabethKashner_200x200.png
|
Director of Research
Reviewed by: Elisabeth Kashner
,
Edited by: Elisabeth Kashner

Most ETFs track indexes. This means ETF investors can understand exactly how much it costs, all-in, to hold an ETF. ETF investors expect to get the index returns, minus fund expenses, with no surprises.

But it turns out there’s more to this story than you might expect. To tease out how well run—or how complex—an index-tracking ETF is, we generally turn to tracking difference, which is the performance gap between the ETF’s net asset value (NAV) returns and the returns of its underlying index. Alas, this measurement, while useful, can also be absolutely maddening.

In a perfect world, tracking difference tells the investor the exact difference between expectations (index performance) and reality (NAV total returns), excluding any trading costs. All else equal, we would expect tracking difference to equal the net expense ratio, plus portfolio management slippage, minus foreign dividend tax recapture, minus securities lending revenue. The problem is that all else is not equal. And it gets worse when NAVs are calculated asynchronously versus the underlying index.

First off, the actual ex-post expense ratio might not equal the ex-ante ratio stated in the prospectus.

Second, portfolio management slippage is real, but generally not made available to the public.

Third, foreign dividend tax recapture might not square with the underlying index’s applied withholding rate.

To cap it all off, some issuers calculate NAVs for non-U.S. equity funds using exchange rates taken at 4:00 p.m. ET, rather than the standard WM/Reuters 4:00 p.m. GMT (10:00 a.m. ET).

Put that all together, and you have a bit of an interpretation problem.

Where Tracking Difference Works

Nevertheless, tracking difference remains quite a useful tool. Here are a few ways you can look at it:

First, a simple case, where tracking difference tells you all you need to know about long-term holding costs: the three S&P 500 ETFs. Because the portfolios are limited to U.S.-listed stocks, you don’t have to worry about foreign tax withholding or fair valuation. Also, securities lending will not be a huge factor in the U.S. large/midcap space. So the differences in holding costs will come down to expenses and portfolio management.

The table below highlights the differences between the SPDR S&P 500 (SPY), the Vanguard 500 ETF (VOO) and the iShares S&P 500 ETF (IVV).

 

TickerFundExpense RatioMedian Tracking DifferenceMaximum Upside DeviationMaximum Downside DeviationTracking Range
SPYSPDR S&P 500 ETF Trust0.09%-0.08%-0.03%-0.19%0.16%
IVViShares Core S&P 500 ETF0.04%-0.05%-0.04%-0.07%0.04%
VOOVanguard S&P 500 Index Fund0.05%-0.04%-0.02%-0.06%0.04%

 

The first thing you see is the median tracking difference is very close to the expense ratio (± 0.01%), which is exactly what you would expect.

You might not have expected the differences in the range, which is the tracking difference span from the maximum upside to the maximum downside. The tracking range is a great measure of variability.

SPY’s tracking range is four times the width of IVV’s or VOO’s. That’s huge! But there are probably reasons. SPY is structured as a unit investment trust base, which means it can’t reinvest any cash that comes in from dividends. The fund is required by law to hang on to the cash until it pays it to investors. From a pure tracking perspective, SPY is clearly the worst choice of the three.

VOO seems to edge out IVV, despite its slightly higher expense ratio. This is an accident of history: IVV cut its expense ratio from 0.07% to 0.04% last month. All else equal, we should expect IVV’s tracking difference to approach -0.04%, and likely equal VOO’s, as time goes by. These are truly minuscule differences—too tiny to make or break an investment decision.

Tracking Difference And The EM Space
Tracking difference is useful in international funds, too, but you need to be a bit savvy about index construction to use it well. The next example of emerging market vanilla ETF tracking difference introduces a wrinkle; variability in the indexers’ practices around foreign dividends.

 

TickerUnderlying Index ProviderExpense RatioMedian Tracking DifferenceMaximum Upside DeviationMaximum Downside DeviationTracking RangeIndex Foreign Dividend Withholding Rules
IEMGMSCI0.14%0.08%0.51%-0.04%0.54%Institutional
EEMMSCI0.69%-0.44%-0.33%-0.67%0.35%Institutional
SCHEFTSE0.13%-0.12%0.27%-0.65%0.92%Institutional
GMMS&P0.59%-0.90%-0.29%-1.61%1.32%Institutional
VWOFTSE 0.15%0.07%0.45%-0.11%0.57%RIC

 

A Few Observations

The iShares Core MSCI Emerging Markets ETF’s (IEMG) and the iShares MSCI Emerging Markets ETF’s (EEM) apparent outperformance versus their expense ratio is likely attributable to inadvertent sandbagging; MSCI’s rules for accounting for net withholding are simply far more conservative than the actual experience of a U.S. fund running an ETF, and so the funds “pick up” that difference as positive relative performance.

The Schwab Emerging Markets Equity ETF (SCHE) has that same sandbagging relative to VWO, even though it seems like they track the same FTSE index series. The reality is they track slightly different index variants, with Vanguard’s registered investment company version a bit more realistic for a mutual fund in terms of withholdings than Schwab’s Institutional one.

Even so, SCHE’s tracking difference is still not great, underperforming by a median of 12 bps. That’s significant. Its overall tracking range of 0.92% is surprisingly large, in context. Why? Again—impossible to know precisely, but I’d suspect differences in securities lending revenues and optimization, as SCHE holds only 850 of the 982 stocks in the FTSE emerging index.

 

Speaking of optimization, the SPDR S&P Emerging Markets ETF (GMM) has a highly optimized portfolio, and it shows. Its range is very large in comparison with the other funds in the space, and its median is depressed relative to its expense ratio. Optimization almost always introduces wider tracking differences, which is fine if they’re consistently working in your favor. Unfortunately, here it doesn’t look like they are.

If holding costs are your main concern, eliminating SCHE and GMM for poor tracking makes sense. Eliminating EEM on cost alone makes sense too, narrowing the choice of a cheap-to-hold vanilla emerging market ETF down to two funds: IEMG and Vanguard Emerging Markets ETF (VWO). Given that VWO’s positive median tracking difference is far more believable than IEMG’s, and that their overall range is quite similar, I’d say that VWO is the clear winner from a holding cost perspective.

Of course, there’s a lot more to consider than just tracking difference—the two funds have radically different investments, with VWO including Chinese A-shares but ignoring South Korea entirely, while IEMG invests 15% of its portfolio in the country. These exposure differences will likely dwarf any tracking difference comparison.

Where Tracking Difference Fails

Sometimes, tracking difference explains more about the index business than portfolio management. Take a look at the U.S. Aerospace and Defense segment, and you’ll see that one fund, the PowerShares Aerospace & Defense Portfolio (PPA), has an unexpectedly positive tracking difference.

 

TickerFund NameExpense RatioMedian Tracking DifferenceMaximum Upside Tracking DifferenceMaximum Downside Tracking DifferenceUnderlying Index Return Variant
PPAPowerShares Aerospace & Defense Portfolio0.64%0.96%1.28%0.73%Price Return
XARSPDR S&P Aerospace & Defense ETF0.35%-0.32%-0.25%-0.43%Gross
ITAiShares U.S. Aerospace & Defense ETF0.44%-0.46%-0.36%-0.52%Gross

 

These three funds have similar holdings, with heavy stakes in firms like Boeing and Lockheed Martin. Securities lending opportunities are most likely quite similar and limited for all three providers, but PowerShares has not been actively lending stocks in PPA’s portfolio. By rights, PPA’s 12-month tracking difference should be pretty close to its 64 bp expense ratio. But PPA’s tracking difference appears to be positive, by 0.96%.

In fact, the positive tracking difference is an illusion. The index that FactSet uses to calculate PPA’s tracking difference, the SPADE Defense Index, does not reinvest any dividend payments. We have no idea why, as virtually every index in the world is calculated as a total return index by default. In this case, however, the index provider simply doesn’t, which introduces a huge sandbagging effect. To get a clear picture, we need to back at the dividend yield from the index return.

 

PPA’s portfolio had a trailing 12-month dividend yield of 1.61% as of Oct. 1. Therefore, PPA’s corrected tracking difference should be 0.96% - 1.61% = -0.65%. That’s precisely what we should see from a fund with a 0.64% expense ratio.

What a boring outcome. In the U.S. Aerospace & Defense area, funds’ expense ratios tell you pretty much all you need to know about their long-term holding costs. The tracking difference statistics simply highlight the importance of using total return indexes.

Don’t Dismiss The Impact Of Accounting Rules
It gets worse, and even more maddening. Sometimes fund NAVs simply don’t match the index values because of differences in their accounting rules.

This is on display dramatically in the Japan Total Market segment. The real action is in the range—the variability of tracking difference.

When we looked at the S&P 500 funds, the tracking range was a make-it-or-break-it field. Investors like predictability. Tight tracking ranges suggest that portfolio management is consistent over long periods of time.

Wide ranges mean that investors can’t know what’s coming, and that makes a body nervous. But in this case, investors can calm down, because the tracking range has been artificially inflated. Blame it on the fund accountants, or on clever fund construction.

 

TickerFund NameExpense RatioMedian Tracking DifferenceMaximum Upside Tracking DifferenceMaximum Downside Tracking DifferenceRange
DBJPDeutsche X-trackers MSCI Japan Hedged Equity ETF0.45%-0.55%-0.38%-1.16%0.77%
FJPFirst Trust Japan AlphaDEX Fund0.80%-0.63%2.42%-3.78%6.21%
HEWJiShares Currency Hedged MSCI Japan ETF0.48%-0.72%4.25%-5.39%9.64%
EWJiShares MSCI Japan ETF0.48%-0.28%-0.13%-0.53%0.40%
DXJWisdomTree Japan Hedged Equity Fund0.48%-0.81%-0.52%-1.43%0.91%

 

Let’s look at the First Trust Japan AlphaDEX Fund (FJP) first. Overall, FJP tracks its index pretty well, trailing it by less than its expense ratio would predict. But FJP’s tracking range—varying from 2.42% on the upside to -3.78% on the downside, looks downright scary. However, there’s a story here.

First Trust, like many ETF issuers that offer both exchange-traded funds and mutual funds, has to be mindful of market timing in its mutual funds. Since the days of the mutual fund timing scandal, mutual fund accountants have been obligated to value all foreign currency positions as of 4:00 p.m. ET, when the U.S. markets close. Issuers want to use uniform calculation methods for mutual funds and ETFs, so the ETF NAVs also use a 4:00 p.m. ET currency strike.

Index providers don’t have to worry about market timing, because indexes aren’t portfolios. Index calculation agents don’t have to worry about fund accounting rules or premiums and discounts. So they follow the index industry standard, and quote their currency positions at 4:00 p.m. London time (or 10 a.m. ET). On days when the Japanese yen is volatile, this six-hour difference can push NAVs off of index levels by several percentage points. FJP’s tracking range isn’t bad. It’s just drawn that way.

 

HEWJ And EWJ

You’d think the same would hold true for the iShares Currency Hedged MSCI Japan ETF (HEWJ). After all, BlackRock manages a huge suite of mutual funds. But you’d be wrong.

BlackRock didn’t always own iShares. It bought the business from Barclays, who had been managing it for Morgan Stanley, which designed ETFs to track MSCI’s indexes. iShares started out with every incentive to treat its ETFs like indexes. That means using 4:00 p.m. London currency strike times in its NAV calculations. This practice continues, even under BlackRock. HEWJ’s NAV is not fair-valued.

Instead, HEWJ’s tracking range is a victim of BlackRock’s cleverness. Let me explain.

BlackRock’s currency-hedged suite takes advantage of the enormous liquidity of iShares biggest international funds, to build a highly liquid, easily tradable currency-hedged version. They simply take an unhedged ETF like the iShares MSCI Japan (EWJ), and use it to provide exposure to the MSCI Japan index. They add in a currency hedge, and voila! A two-position portfolio that offers hedged exposure to the Japanese large and midcap equity markets.

We know that EWJ tracks its index well, with a median tracking difference of -0.28% and a range of only 0.40%, which is not bad for a basket of securities that don’t even trade during U.S. market hours. But here’s the thing: EWJ trades right up to 4:00 p.m. ET. EWJ’s price will reflect up-to-the-minute information, including movements in the Japanese yen and investor sentiment about the portfolio securities. EWJ’s market price is nothing like its NAV. That’s why EWJ’s premium/discount chart looks like this:

 

 

HEWJ’s NAV is calculated using the closing price of EWJ, not its NAV. That opens the door for EWJ’s “premium” or “discount” to flow through to HEWJ’s NAV. In general, EWJ’s premiums/discounts are not real, but simply a reflection of the staleness of EWJ’s NAV. HEWJ passes along the discrepancy.

In HEWJ’s case, the best way to assess tracking range is to proxy it via EWJ. For FJP, investors are simply out of luck, until the day that First Trust decides to provide nonfair-valued NAVs.

 

Conclusion

In the end, tracking difference and tracking range are only as good as their input data. For U.S. equity ETFs mimicking indexes that reinvest the dividends and targeting market segments where securities lending revenue is stable, the tracking statistics allow investors to predict their future holding costs pretty well.

But as soon as something gets complicated, whether by international exposure that has to deal with dividend tax withholding and post-mutual-fund-timing-scandal regulations that drive a wedge between NAV and index returns, or by sloppy index calculation, tracking difference needs to be unpacked before it can become useful.

Often investors need additional information about NAV valuation practices, index return types, and index dividend withholding rates to make real comparisons between funds.

The interpretation of tracking difference is not always cut and dried, but there’s usually a story there.

At the time of writing, the author held a position in IVV. You can reach Elisabeth Kashner at [email protected].

 

Elisabeth Kashner is FactSet's director of ETF research. She is responsible for the methodology powering FactSet's Analytics system, providing leadership in data quality, investment analysis and ETF classification. Kashner also serves as co-head of the San Francisco chapter of Women in ETFs.