2. Analyzing the Data#
Use the data found on Canvas, in ’proshares analysis data.xlsx’.
It has monthly data on financial indexes and ETFs from Aug 2011 through May 2025.
1.#
For the series in the “hedge fund series” tab, report the following summary statistics:
mean
volatility
Sharpe ratio
Annualize these statistics.
2.#
For the series in the “hedge fund series” tab, calculate the following statistics related to tail-risk.
Skewness
Excess Kurtosis (in excess of 3)
VaR (.05) - the fifth quantile of historic returns
CVaR (.05) - the mean of the returns at or below the fifth quantile
Maximum drawdown - include the dates of the max/min/recovery within the max drawdown period.
There is no need to annualize any of these statistics.
3.#
For the series in the “hedge fund series” tab, run a regression of each against SPY (found in the “merrill factors” tab.) Include an intercept. Report the following regression-based statistics:
Market Beta
Treynor Ratio
Information ratio
Annualize these three statistics as appropriate.
4.#
Discuss the previous statistics, and what they tell us about…
the differences between SPY and the hedge-fund series?
which performs better between HDG and QAI.
whether HDG and the ML series capture the most notable properties of HFRI.
5.#
Report the correlation matrix for these assets.
Show the correlations as a heat map.
Which series have the highest and lowest correlations?
6.#
Replicate HFRI with the six factors listed on the “merrill factors” tab. Include a constant, and run the unrestricted regression,
Note that the second equation is just our notation for the fitted replication.
a. Report the intercept and betas.
b. Are the betas realistic position sizes, or do they require huge long-short positions?
c. Report the R-squared.
d. Report the volatility of \(\epsilon^{\merr}\), the tracking error.
7.#
Let’s examine the replication out-of-sample (OOS).
Starting with \(t = 61\) month of the sample, do the following:
Use the previous 60 months of data to estimate the regression equation. This gives time-t estimates of the regression parameters, \(\tilde{\alpha}^{\merr}_{t}\) and \(\tilde{\beta}^{\merr}_{t}\).
Use the estimated regression parameters, along with the time-t regressor values, \(x^{\merr}_{t}\), calculate the time-t replication value that is, with respect to the regression estimate, built “out-of-sample” (OOS).
Step forward to \(t = 62\), and now use \(t = 2\) through \(t = 61\) for the estimation. Re-run the steps above, and continue this process throughout the data series. Thus, we are running a rolling, 60-month regression for each point-in-time.
How well does the out-of-sample replication perform with respect to the target?
3. EXTRA - Other Estimations#
1.#
In Section 2, we estimated the replications using an intercept. Try the full-sample estimation, but this time without an intercept.
Report
the regression beta. How does it compare to the estimated beta with an intercept, \(\hat{\beta}^{\merr}\)?
the mean of the fitted value, \(\check{r}^{\hfri}_{t}\). How does it compare to the mean of the HFRI?
the correlations of the fitted values, \(\check{r}^{\hfri}_{t}\) to the HFRI. How does the correlation compare to that of the fitted values with an intercept, \(\hat{r}^{\hfri}_{t}\)
Do you think Merrill and ProShares fit their replicators with an intercept or not?
2.#
Merrill constrains the weights of each asset in its replication regression of HFRI. Try constraining your weights by re-doing 2.6.
Use Non-Negative Least Squares (NNLS) instead of OLS.
Go further by using a Generalized Linear Model to put separate interval constraints on each beta, rather than simply constraining them to be non-negative.
Hints#
Try using LinearRegression in scikit-learn with the parameter
positive=True.Try using GLM in statsmodels.
4. EXTRA - Other Decompositions#
1.#
Let’s decompose a few other targets to see if they behave as their name suggests.
Regress HEFA on the same style factors used to decompose HFRI. Does HEFA appear to be a currency-hedged version of EFA?
Decompose TRVCI with the same style factors used to decompose HFRI. The TRVCI Index tracks venture capital funds–in terms of our styles, what best describes venture capital?
TAIL is an ETF that tracks SPY, but that also buys put options to protect against market downturns. Calculate the statistics in questions 2.1-2.3 for TAIL. Does it seem to behave as indicated by this description? That is, does it have high correlation to SPY while delivering lower tail risk?
2.#
The ProShares case introduces Levered ETFs. ProShares made much of its name originally through levered, or “geared” ETFs.
Explain conceptually why Levered ETFs may track their index well for a given day but diverge over time. How is this exacerbated in volatile periods like 2008?
3.#
Analyze SPXU and UPRO relative to SPY.
SPXU is ProShares -3x SPX ETF.
UPRO is ProShres +3x SPX ETF.
Questions:
Analyze them with the statistics from 2.1-2.3.
Do these two ETFs seem to live up to their names?
Plot the cumulative returns of both these ETFs along with SPY.
What do you conclude about levered ETFs?