Single Factor Test
What is a Factor?
As previously discussed, a factor is a variable that helps explain the returns of an asset. While the concept is intuitive, it is difficult to define precisely. Consider the book-to-market (BM) ratio—often referred to as the value factor. We hypothesize that firms with a high BM ratio tend to outperform those with a low BM ratio. If BM is the only factor in our model, the expected return of asset $i$ can be expressed as:
\[E(r_i) = r_f + \beta_{i} \lambda_{BM}\]where:
- $E(r_i)$ is the expected return of asset $i$.
- $r_f$ is the risk-free rate.
- $\beta_{i}$ is the sensitivity of the asset’s return to the BM factor.
- $\lambda_{BM}$ is the expected return of the BM factor, which is the return of a portfolio that goes long on high BM ratio firms and short on low BM ratio firms.
However, we face a challenge: we do not know the factor return $\lambda_{BM}$ (how much the factor explains the return of an asset), nor do we know the factor loading $\beta_{i}$ (how sensitive the asset’s return is to the factor).
This creates a classic chicken-and-egg problem!
To address this, there are two main approaches:
- Estimate $\lambda_{BM}$ first, then estimate $\beta_{i}$ using regression.
- Or, estimate $\beta_{i}$ first, then estimate $\lambda_{BM}$ using regression.
Estimating Factor Return First Using the Sorting Technique
This method uses a factor mimicking portfolio to estimate the factor return. The idea is to sort all assets based on the factor, group them into $m$ portfolios, and use the return of a long-short portfolio (long the top group, short the bottom group) as the factor return.
Using the BM ratio as an example, the steps are as follows:
- Sort all stocks based on their BM ratio (ascending order).
- Group them into $m$ portfolios:
- The first portfolio contains the stocks with the lowest BM ratio, the second portfolio contains the stocks with the next lowest BM ratio, and so on.
- If the BM ratio truly explains asset returns, these portfolios should show increasing returns: the first portfolio has the lowest return, the second has the next lowest, and so forth.
- Use a long-short portfolio to estimate the factor return:
- Go long the top group (highest BM ratio) and short the bottom group (lowest BM ratio).
- The return of this long-short portfolio is the factor return $\lambda_{BM}$.
- Rebalance the portfolios periodically (e.g., monthly or quarterly) to capture changes in the factor return by repeating steps 1 to 3.
Testing the Factor Return
If the BM ratio is a good factor, it should have the following properties:
- The $m$ portfolios should exhibit increasing (or decreasing) returns. → Test using ICs.
- The long-short portfolio should have returns significantly different from zero. → Test using a t-test.
We assume the factor return is i.i.d., so each timestamp provides an independent observation. A t-test can be used to determine if the mean factor return is significantly different from zero.
For the first property, we can calculate the Information Coefficient (IC) at each timestamp. Over multiple timestamps, a t-test can be used to check if the mean IC is significantly different from zero. The ICIR (Information Coefficient Information Ratio), which is the mean IC divided by its standard deviation, can also be calculated. For more details, see the Predictablity Measure - IC, ICIR section.
[!NOTE] In practice, some investors focus only on the profitability of a factor: if the top group has a significantly higher return than the bottom group, the factor is considered valid. In academic research, factors are subject to stricter requirements, which will be discussed later.
Time Series Regression to Estimate Factor Loadings
Once we have the factor return, estimating the factor loading $\beta_{i}$ is straightforward:
- A time series of asset returns $r_i$.
- A time series of factor returns $\lambda_{BM}$.
A simple linear regression can be used to estimate the factor loading $\beta_{i}$:
\[r_i = \alpha + \beta_{i} \lambda_{BM} + \epsilon_i\]where:
- $r_i$ is the return of asset $i$.
- $\alpha$ is the intercept.
- $\beta_{i}$ is the factor loading of asset $i$.
- $\lambda_{BM}$ is the factor return.
- $\epsilon_i$ is the error term.
The estimated $\beta_{i}$ measures the sensitivity of the asset’s return to the factor return.
If there are multiple factors, we can use the sorting technique to estimate the factor returns for each, then use time series regression to estimate the factor loadings for each asset:
\[E(r_i) - r_f = \alpha_i + \beta_{i1} \lambda_{F1} + \beta_{i2} \lambda_{F2} + ... + \beta_{in} \lambda_{Fn}\]If the model is valid, we should have:
\[H_0: \alpha_1 = \alpha_2 = ... = \alpha_i = ... = \alpha_n = 0\]This test checks the validity of the model (rejecting the null hypothesis means the model is not valid).
In practice, we could have some data preprocessing before running the regression, such as removing outliers, normalizing the data, or adjusting for autocorrelation and heteroscedasticity.
[!NOTE] Black, Jensen, and Scholes used this time-series approach to estimate the unconditional CAPM model, where the market beta is the only factor. They could not reject the null hypothesis that the intercept is zero, indicating the CAPM model was valid in their sample.
Disadvantages of the Sorting Technique
The sorting technique is simple and easy to implement. It was the first method proposed by Fama and MacBeth (1973) to estimate factor returns. However, it has some disadvantages:
- It only works for factors that can be sorted, such as BM ratio, size, momentum, etc. It cannot be used for factors that cannot be sorted, such as interest rates, inflation rates, GDP growth, etc.
- It only considers one factor at a time. Ideally, our factor mimicking portfolio should be sensitive only to the factor we are estimating and independent of other factors. The univariate sorting technique cannot guarantee this. (Bivariate sorting can address this issue, but it is more complex and has its own limitations.)
- Grouping is intended to reduce noise in the data, but the choice of the number of groups and group size is subjective and can significantly affect results.
Using Firm Characteristics as Factor Loadings
The second approach is to estimate the factor loading $\beta_{i}$ first. For some style factors, such as the BM ratio, each firm has its own BM ratio—so why not use the BM ratio directly as the factor loading? We can then use a cross-sectional regression at each timestamp to estimate the factor return $\lambda_{BM}$. Suppose we have $n$ companies at timestamp $t$:
\[E(r_{i}) - r_{f} = \alpha + \beta_{i} \lambda_{BM}\]where:
- $E(r_{i})$ is the expected return of asset $i$ at timestamp $t$.
- $r_{f}$ is the risk-free rate at timestamp $t$.
- $\alpha$ is the intercept at timestamp $t$.
- $\beta_{i}$ is the factor loading, equal to the BM ratio of asset $i$ at timestamp $t$.
By running this cross-sectional regression at each rebalancing period, we obtain a time series of factor returns $\lambda_{BM}$.
Testing the Factor Return - Firm Characteristics
Similarly, we can use ICs, ICIR, and t-tests to evaluate the factor return $\lambda_{BM}$.
This time, ICs are calculated at the stock level (whereas the previous method used group portfolios). We calculate the IC at each timestamp, treat each as an independent observation, and use a t-test to determine if ICs and factor returns $\lambda_{BM}$ are significantly different from zero.
Summary
As the number of factors increases, the regression results (factor returns) will differ. In this section, the goal is not to find the “true” factor return, but to determine whether a factor (an effect) is valid.
By using the sorting technique or firm characteristics as factor loadings, we can implement statistical tests to check if a factor is valid. We can then combine multiple factors to form a factor model and perform statistical tests to assess the validity of the model.
For practical application, here is an example of how to construct a factor model:
- Propose an idea: Identify a potential factor based on economic theory or empirical evidence.
- Single Factor Test: Use the sorting technique or firm characteristics as factor loadings to test whether the factor has predictive power.
- Add it to the existing factor model: If the factor is valid, add it to the existing factor model and test the overall model.
Comments