ProShares Replication#

HBS Case#

ProShares Hedge Replication ETF#


1. READING - The ProShares ETF Product#

1. Alternative ETFs#

Describe the two types of investments referenced by this term.

2. Hedge Funds.#

  • a. Using just the information in the case, what are two measures by which hedge funds are an attractive investment?

  • b. What are the main benefits of investing in hedge funds via an ETF instead of directly?

3. The Benchmarks#

  • a. Explain as simply as possible how HFRI, MLFM, MLFM-ES, and HDG differ in their construction and purpose.

  • b. How well does the Merrill Lynch Factor Model (MLFM) track the HFRI?

  • c. In which factor does the MLFM have the largest loading? (See a slide in Exhibit 1.)

  • d. What are the main concerns you have for how the MLFM attempts to replicate the HFRI?

4. The HDG Product#

  • a. What does ProShares ETF, HDG, attempt to track? Is the tracking error small?

  • b. HDG is, by construction, delivering beta for investors. Isn’t the point of hedge funds to generate alpha? Then why would HDG be valuable?

  • c. The fees of a typical hedge-fund are 2% on total assets plus 20% of excess returns if positive. HDG’s expense ratio is roughly 1% on total assets. What would their respective net Sharpe Ratios be, assuming both have a gross excess returns of 10% and volatility of 20%?


2. Analyzing the Data#

Use the data found on Canvas, in ’proshares analysis data.xlsx’.

It has monthly data on financial indexes and ETFs from Aug 2011 through May 2025.

1.#

For the series in the “hedge fund series” tab, report the following summary statistics:

  • mean

  • volatility

  • Sharpe ratio

Annualize these statistics.

2.#

For the series in the “hedge fund series” tab, calculate the following statistics related to tail-risk.

  • Skewness

  • Excess Kurtosis (in excess of 3)

  • VaR (.05) - the fifth quantile of historic returns

  • CVaR (.05) - the mean of the returns at or below the fifth quantile

  • Maximum drawdown - include the dates of the max/min/recovery within the max drawdown period.

There is no need to annualize any of these statistics.

3.#

For the series in the “hedge fund series” tab, run a regression of each against SPY (found in the “merrill factors” tab.) Include an intercept. Report the following regression-based statistics:

  • Market Beta

  • Treynor Ratio

  • Information ratio

Annualize these three statistics as appropriate.

4.#

Discuss the previous statistics, and what they tell us about…

  • the differences between SPY and the hedge-fund series?

  • which performs better between HDG and QAI.

  • whether HDG and the ML series capture the most notable properties of HFRI.

5.#

Report the correlation matrix for these assets.

  • Show the correlations as a heat map.

  • Which series have the highest and lowest correlations?

6.#

Replicate HFRI with the six factors listed on the “merrill factors” tab. Include a constant, and run the unrestricted regression,

\[\begin{split}\begin{align} r^{\hfri}_{t} &= \alpha^{\merr} + x_{t}^{\merr}\beta^{\merr} + \epsilon_{t}^{\merr}\\[5pt] \hat{r}^{\hfri}_{t} &= \hat{\alpha}^{\merr} + x_{t}^{\merr}\hat{\beta}^{\merr} \end{align}\end{split}\]

Note that the second equation is just our notation for the fitted replication.

a. Report the intercept and betas.

b. Are the betas realistic position sizes, or do they require huge long-short positions?

c. Report the R-squared.

d. Report the volatility of \(\epsilon^{\merr}\), the tracking error.

7.#

Let’s examine the replication out-of-sample (OOS).

Starting with \(t = 61\) month of the sample, do the following:

  • Use the previous 60 months of data to estimate the regression equation. This gives time-t estimates of the regression parameters, \(\tilde{\alpha}^{\merr}_{t}\) and \(\tilde{\beta}^{\merr}_{t}\).

  • Use the estimated regression parameters, along with the time-t regressor values, \(x^{\merr}_{t}\), calculate the time-t replication value that is, with respect to the regression estimate, built “out-of-sample” (OOS).

\[\hat{r}^{\hfri}_{t} \equiv \tilde{\alpha}^{\merr} + (x_{t}^{\merr})'\tilde{\beta}^{\merr}\]
  • Step forward to \(t = 62\), and now use \(t = 2\) through \(t = 61\) for the estimation. Re-run the steps above, and continue this process throughout the data series. Thus, we are running a rolling, 60-month regression for each point-in-time.

How well does the out-of-sample replication perform with respect to the target?


3. EXTRA - Other Estimations#

1.#

In Section 2, we estimated the replications using an intercept. Try the full-sample estimation, but this time without an intercept.

\[\begin{split}\begin{align} r^{\hfri}_{t} &= \alpha^{merr} + x_{t}^{\merr}\beta^{\merr} + \epsilon_{t}^{\merr}\\[5pt] \check{r}^{\hfri}_{t} &= \check{\alpha}^{\merr} + x_{t}^{\merr}\check{\beta}^{\merr} \end{align}\end{split}\]

Report

  • the regression beta. How does it compare to the estimated beta with an intercept, \(\hat{\beta}^{\merr}\)?

  • the mean of the fitted value, \(\check{r}^{\hfri}_{t}\). How does it compare to the mean of the HFRI?

  • the correlations of the fitted values, \(\check{r}^{\hfri}_{t}\) to the HFRI. How does the correlation compare to that of the fitted values with an intercept, \(\hat{r}^{\hfri}_{t}\)

Do you think Merrill and ProShares fit their replicators with an intercept or not?

2.#

Merrill constrains the weights of each asset in its replication regression of HFRI. Try constraining your weights by re-doing 2.6.

  • Use Non-Negative Least Squares (NNLS) instead of OLS.

  • Go further by using a Generalized Linear Model to put separate interval constraints on each beta, rather than simply constraining them to be non-negative.

Hints#

  • Try using LinearRegression in scikit-learn with the parameter positive=True.

  • Try using GLM in statsmodels.

4. EXTRA - Other Decompositions#

1.#

Let’s decompose a few other targets to see if they behave as their name suggests.

  • Regress HEFA on the same style factors used to decompose HFRI. Does HEFA appear to be a currency-hedged version of EFA?

  • Decompose TRVCI with the same style factors used to decompose HFRI. The TRVCI Index tracks venture capital funds–in terms of our styles, what best describes venture capital?

  • TAIL is an ETF that tracks SPY, but that also buys put options to protect against market downturns. Calculate the statistics in questions 2.1-2.3 for TAIL. Does it seem to behave as indicated by this description? That is, does it have high correlation to SPY while delivering lower tail risk?

2.#

The ProShares case introduces Levered ETFs. ProShares made much of its name originally through levered, or “geared” ETFs.

Explain conceptually why Levered ETFs may track their index well for a given day but diverge over time. How is this exacerbated in volatile periods like 2008?

3.#

Analyze SPXU and UPRO relative to SPY.

  • SPXU is ProShares -3x SPX ETF.

  • UPRO is ProShres +3x SPX ETF.

Questions:

  • Analyze them with the statistics from 2.1-2.3.

  • Do these two ETFs seem to live up to their names?

  • Plot the cumulative returns of both these ETFs along with SPY.

  • What do you conclude about levered ETFs?