Predicting M&A Targets Using Machine Learning Techniques

Published in

LSEG Developer Community

36 min readOct 6, 2021

The original article can be found in the Refinitiv Development portal. The Source Code can be found in the GitHub folder.

Table of content

Theoretical Background
1.1 Definitions to M&A
1.2 Background to M&A
Methodology
2.1 Methodology of the predictive model
2.2 Methodology for portfolio abnormal returns estimation
Data description
3.1 Dataset for Target companies
3.2 Dataset for non-Target companies
Empirical results
4.1 Variables selection
4.2 Results from Logistic regression models
4.2.1 Model on the entire dataset
4.2.2 Model on the clustered dataset
4.2.3 Comparison of unclustered and clustered results: The effect of clustering
Out of sample predictive power of the model
5.1 Identification of optimal cut-off probability
5.2 Classification results on the holdout sample
5.2.1 Results from the model on the entire dataset
5.2.2 Results from the models on seperate clusters
Portfolio returns
6.1 Announcement returns
6.2 Portfolio returns
6.2.1 Portfolio abnormal returns
6.2.2 Portfolio investment returns
Conclusion

Structure

The purpose of this article is to build a predictive model for Mergers and Acquisitions (M&A) target identification and discover if that will produce an abnormal return for investors by utilizing Refinitiv Data APIs. Extensive literature review is conducted to identify main Machine Learning models as well as variables used in empirical studies which can be provided by individual request. The rest of the article is structured in the following way. Section 1 briefly provides theoretical background to M&A, discusses the main motivations and drivers, and suggests main stakeholders who can benefit from target predictive modeling. Section 2 and Section 3 present the methodology of predictive modeling and describe the data respectively. Section 4 discusses the empirical results from logistic regression models and identifies significant variables. We test out-of sample predictive power of the models in Section 5. Section 6 provides portfolio return estimations based on the prediction outputs.

Below are three code cells containing packages required in this report. If these packages are not already installed on your computer, please run these cells.

We need also to import the following packages:

Section 1: Theoretical Background

1.1 Definitions to M&A

M&A are corporate actions involving restructuring and change of control within companies, which play an essential role in external corporate growth. The literature uses the terms of mergers, acquisitions, and takeovers synonymously; however, there are subtle differences in their economic implications. Piesse, Lee, Lin, and Kuo (2013) interpret acquisitions and takeovers as activities when the acquirer gets control over 50% equity of the target company and mergers when two firms join to form a new entity.

Overall, according to Piesse et al. (2013), the negotiating process is often friendly in M&A, assuming synergies for both firms and hostile in case of takeovers. In this sense, terms “merger” and “acquisition” are used synonymously to refer to friendly corporate actions and “takeovers” to hostile corporate actions. The current article concentrates on friendly M&A only, which assumes a substantial premium for the target’s stock price.

1.2 Background to M&A

M&A activity has increased throughout recent years both in terms of the number and value of the deals. The number of M&A deals reached its peak in 2017 when 50,600 M&A deals were announced, totaling USD 3.5 trillion. The activity was more than twenty times higher than the number of deals in 1985 and around ten times higher than the deal value for the same year. M&A activity for 1985–2021 is summarized in Figure below (Source):

M&A activity is evolving in cycles that coincide with the economic rise and major shocks, such as the IT bubble in the 2000s, the financial crisis in 2007–2009, and recent economic movements caused by COVID-19. These tendencies are well researched in academic literature and are referred to as merger waves (Martynova & Renneboog, 2006). Mitchell and Mulherin (1996) claim that M&A activity is driven by industry and macroeconomic shocks, which trigger the start or end of M&A activity waves triggered by regulatory, economic, or technological shocks and innovations.

1.3 Main stakeholders of M&A prediction modeling

Tunyi (2021) suggests three stakeholders interested in predicting M&A targets:

Investors — Corporate events, such as M&A or bankruptcy announcements, result in substantial stock price changes, thus allowing investors to receive abnormal returns. Previous research reports significant abnormal returns for target companies. For example, Jensen and Ruback (1983) claim 29.1% weighted average abnormal return for target companies in the US in the two months around the M&A announcement.

Company managers — Prior academic literature considers management inefficiency one of the main factors of a company’s acquisition. Thus, knowledge of anticipated takeover allows taking the necessary actions to safeguard shareholder interests by setting up takeover defense strategies (Tunyi, 2021). Those strategies may make the takeover deal unattractive or allow the managers and target company shareholders to generate excess returns (Schwert, 2000). Additionally, information on the probability of the company’s partners’ and competitors’ engagement in M&A could be helpful in the company’s strategy development (Tunyi, 2021).

Regulators An essential role of securities market regulators is to identify and exclude insider trading. Keown and Pinkerton (1981) mention that many individuals and groups, such as bankers, advisers who are engaged in M&A, hold non-public price-sensitive information, which is poorly kept secret. Several researchers (Schwert, 1996; Goergen & Renneboog, 2004) identify stock price run-ups from two to four months before announcing the M&A. While Jensen and Ruback (1983) explain that by market anticipation, Keown and Pinkerton (1981), Schwert (1996) attribute it to insider trading. A practical tool allowing regulators to investigate the extent of market anticipation of takeover based on public information could help them make informed decisions on market abuse

Section 2: Methodology

2.1 Methodology of the predictive model

Researchers used various empirical techniques for target identification, including parametric, linear discriminant analysis, conditional and logistical regression, and non-parametric techniques, such as SVM, Neural Networks, and Utilities Additive Discriminants. Despite the wide range of methodologies, logistic regression was proven to be the most prevailing due to higher classification accuracy and the explainability of the outputs. We have also tested different methodologies, including SVM, Decision tree, Random forest algorithms and identify that logistic regression generalizes better and provides higher accuracy and explainability.

Thus, in line with most previous empirical studies, the current article uses logistic regression to classify target and non-target companies.Intermediate clustering of sample data is employed to test whether prediction accuracy and abnormal returns will increase.

Logistic Regression: The study uses binary logistic regression and is conducted in Python using the sklearn package with a liblinear solver and penalty of l2 (ridge regression). The model equation is given as follows:

Pi is the probability of company i being a target, and Zi is the vector of the company i characteristics given as the following:

where β0 is the intercept and β_j(j = 1,…,k) is the coefficient of respective independent variable Xj (j = 1,…,k) for each company. The dependent dummy variable equals to 1 if the company is a target and 0 otherwise. It is worth highlighting that all the variables are sourced as of 1 month before the announcement, both for target and non-target groups. Non-targets are matched with targets as of the announcement date of the corresponding target acquisition.

To identify the optimal variables to include in the logistic regression model and avoid multicollinearity, correlation analysis, t-test for the mean difference, and Variance Inflation Factor (VIF) test are used. First, correlation analysis was done to identify the interdependence of the variables and potential multicollinearity. One of the correlated variables was eliminated based on its significance in the t-test and VIF score.

Clustering: Further, we propose a clustering technique to test if that increases the accuracy of logistic regression outputs and portfolio returns. We compared prediction outputs and portfolio returns between logistic regression models estimated on the entire and clustered datasets to test this. We started by estimating logistic regression on the entire dataset. Then, we clustered the entire dataset into two groups and evaluated logistic regression models on each cluster separately. Finally, we compared results from models based on the entire dataset and each cluster dataset. Additionally, combined clustered outputs were compared with the results from the entire dataset.

The clustering is done based on liquidity and leverage variables of target and non-target companies, which are the same as in logistic regression. The study uses the Kmeans clustering technique conducted in a Python environment using the sklearn package.

The algorithm is an iterative process aimed to partition data into prespecified groups to minimize the sum of the squared distance (Euclidean distance in our case) between the data points and the cluster centroid (Hastie, Tibshirani, & Friedman,2009).The algorithm aims to minimize the following equation:

where j is the number of clusters, which is 2 in our case, i is the number of sample companies xi^j is the company i for cluster j and cj is the centroid for cluster j.

2.2 Methodology for portfolio abnormal returns estimation

Portfolio construction Strategies

To estimate the practical usefulness of the target prediction model an optimal cut-off based on the maximization of the difference between True Positive Rate (TPR) and False Positive Rate (FPR) is used to classify target and non-target companies.

Then, portfolio abnormal returns for an observation period of 60 days before and 3 days after the announcement for all predicted target companies are calculated. Additionally, an investment strategy of buying all predicted targets during the beginning of the year and selling right after the announcement (false predicted targets are kept in the portfolio until the end of the observation period) is employed. The portfolio investment return is compared with Standard & Poor’s (S&P) 500 return for the same period to identify whether the portfolio constructed from all predicted targets can generate market excess return.

Calculation of abnormal returns

Abnormal returns are calculated by using event study methodology (MacKinlay,1997) using the following equation.

where r(i,t) is the return on security i in period t, R(m,t) is the market return in period t. α and β are model parameters. α is a constant that assumed zero; β is calculated by regressing stock returns against the market returns and shows stock returns volatility versus the market returns.

The methodology assumes an estimation period when model parameters, such as β is estimated and an observation period for which the actual returns are calculated. β is calculated for the estimation period by regressing daily stock returns against the S&P 500 index as a market return proxy. Further, to calculate Cumulative Abnormal Return (CAR), abnormal returns of each stock are summed for the whole observation period:

where, t1, t2 denote the start and the end of the observation period.

The following function calculates abnormal return of a given security during an observation period based on Event study Methodology(MacKinlay,1997).

The following gives an example of calculaing abnormal return for Slack Technologies (WORK.N^G21) during the observation period which includes the aquisition announcement date by Salesforce.

50.620286227582014

The function mentioned above has been used to calculate abnormal returns for 3 different cases:

1. Calculate Run-up return vriable for target and non-target companies — This variable is based on the findings in the previous literature (Keown & Pinkerton, 1981, Barnes, 1998), suggesting that target companies generate significant run-up returns during one to two months before the announcement of the deal. We calculate this return for both target and non-target companies.The period 250 to 60 days before the deal announcement was used as an estimation window. The observation period was two months before the announcement.

2. Calculate post announcement abnormal returns — In order to support the assumption that shareholders of target companies receive abnormal returns after company acquisition post-announcement abnormal returns for target companies are calculated.

3. Calculate portfolio abnormal returns — abnormal returns for the portfolios, constructed based on the models outputs are calculated to test whether a target prediction model can capture some of the examined announcement abnormal returns.

The estimation window for both the announcement and portfolio returns is 250 to 60 days before the deal announcement. The observation period for portfolio abnormal return calculation is 60 days before and 3 days after the announcement. As for the announcement returns, multiple observation periods, such as [-40, +40], [-20, +20], [-10, +10], [-5, +5], are considered to observe both run-up (two months preceding the announcement) and mark-up returns (two months following the deal announcement). The Figure below illustrates observation and estimation periods of both announcement and portfolio abnormal returns.

Section 3: Data description

To build a logistic regression model for target prediction a dataset of target and non-target companies is required. Two separate datasets for each group of companies is retrieved from Refinitiv and then merged into single one with appropriate labels to estimate a Logistic regression model.

3.1 Dataset for Target companies

The target sample is constructed based on M&A Advanced Search section of Refinitiv Workspace to include relevant deals for target companies in the US. The screen shot below shows the filters used to get the relevant data for the article.

The retrieved data consisted of 2549 target companies from the US. Further screening for data availability of selected variables (1547 observations eliminated), peers (254 observations eliminated), and removal of outliers (92 observations eliminated) resulted in a final dataset of 656 target companies for announcement date from April 2010 — June 2021. The final dataset for target companies is included in the github folder of the current article. As the data were retrieved in excel from the Advanced Search directly we retrieve variable values directly in excel using Refinitiv Add-in formulas.

The following code reads the target dataset into a dataframe.

3.2 Dataset for non-Target companies

The non-target sample was constructed from companies similar to target companies first, in terms of business activity, and second, size as measured by market capitalization at the time of the bid. To identify the non-target control group, Refinitiv peers SCREEN is used. The Peer group for each company and the variables to be used in the prediction model are retrieved using the function in the cell below.

The following gives an example of retrieving peers and specified variables for Slack Technologies. The next cell provides the list of target company RICs along with the date for peer identification and variable retrieval.

Running “peer” function over all RICs and dates above will create dataframes (or excel files) for each target company, which will include all peers along with specified variables. After screening the peer data based on their similarity to the target group and data availability, each target was matched by year with the closest non-target company.The final dataset for non-target companies is included in the github folder of the current article.

Hereof, data from 2010–2019 are used as a training sample for the prediction model, and data from 2020–2021 as a hold-out testing sample to measure the prediction outputs. Finally, other non-target companies from peer data (all peers were included based on data availability) were added in the hold-out sample to have a similar to natural world distribution of target and non-target companies. The all target sample consisting of 1705 observations is stored in github folder of this article.

Training dataset

Final training dataset consisting of target and non-target companies is created by merging target and non target group dataset and selecting only the observations before 2020–01–01. This is used to train the logistic regression model for target identification.

Testing dataset

Predictive power of the model is estimated on the hold-out sample consisting of target group observations after 2020–01–01 and all non-target peer companies.

Number of target companies in training dataset is: 572
Number of non-target companies in training dataset is: 572

Number of target companies in hold-out testing dataset is: 84
Number of non-target companies in hold-out testing dataset is: 1704

Section 4: Empirical results

This section discusses the variable selection process and presents empirical results from the logistic regression models.

4.1 Variables selection

Variables for the logistic regression model are selected through triangulation of results from correlation matrix, t-test analysis, and VIF estimates.

Variables for the logistic regression model are selected through triangulation of results from correlation matrix, t-test analysis, and VIF estimates. Variables are presented into 6 groups which describe different aspect of a company:

Inefficient Management: (ROE, ROC, Return on Sales, Profit Margin, Gross Profit Margin, Profit to Capital, Sales Growts(3y))

We identify that Gross Profit Margin and Sales growth are not correlated with any of the variables, thus included in the final model. The most correlated variables are Profit to Capital, ROC, and ROE, which is logical considering the components of the ratios. Considering comparably higher t-test and lower VIF scores, Profit to Capital is selected for the final model. As another highly correlated variable with the Profit to Capital ratio, Profit Margin was also eliminated from the list. Return on Sales, along with Profit to Capital, Gross Profit Margin, and Sales Growth, was the fourth variable considered for the model as a proxy for inefficient management.

Company undervaluation: (P/E, EV to Sales, EV to EBITDA, Market to Book)

Moving to variables describing company undervaluation, EV to EBITDA, and M/B ratios are not correlated with the rest of the variables and are included in the model. In contrast, EV to Sales and Price to Sales is highly correlated with each other. Considering the greater difference in t-test and lower VIF score of Price to Sales ratio, EV to Sales was eliminated from the list of final variables.

Growth-resource Imbalance: (Growth-Resource Mismatch, Free Cash Flow to Sales, Operating Cash Flow to Total Assets)

Free cash flow to Sales and Operating cash flow to Total Assets are other correlated variables. Both describe Growth-resource Imbalance, and based on the above-described considerations, only Free cash flow to sales was included in the model. Dummy of imbalance was included as a proxy for Growth-resource Mismatch.

Leverage: (Total Debt to Equity, Debt to EV, Net Debt per Share, Net Debt to Total Capital)

Debt to EV, Net Debt to Total Capital variables are highly correlated. Only Debt to EV was included in the final model considering its higher significant mean difference. More variables proxying Leverage are Net Debt-based ratios. Only Net debt per share was selected because of the high correlation between Net Debt to Total Capital and other variables (including Debt to EV, Cash to Capital) considered for the model.

Liquidity: (Current Ratio, Cash to Capital, Working Capital to Total Assets)

As for the variables describing liquidity, both the Current and Cash to Capital ratios were included in the model. Working Capital to Total Assets is eliminated because of its high correlation with the current ratio and higher VIF score.

Run-up returns: (Abnormal Return previous 60 days)

Lastly, the t-test of Abnormal return is significant at the 1% level. The mean difference is in line with our hypothesis, suggesting that target companies have higher run-up returns. The abnormal return is not correlated with any observed variables, thus included in the final model.

4.2 Results from Logistic regression models

Three different logistic regression models are estimated. First logistic regression is estimated on the entire dataset (Model 1). Then the entire dataset is divided into two clusters based on liquidity and leverage ratios of target and non-target companies, and logistic regression is run on each cluster separately (Model 2 and 3). Model outputs are discussed thereof.

First, we create separate dataframes for independent and dependent variables

X = data.drop(['Label','Total debt to Equity'],axis =1)
y = data['Label']

4.2.1 Model on the entire dataset

The code below summarizes model output for the entire dataset

Optimization terminated successfully.
         Current function value: 0.676776
         Iterations 5
                           Logit Regression Results                           
==============================================================================
Dep. Variable:                  Label   No. Observations:                 1144
Model:                          Logit   Df Residuals:                     1130
Method:                           MLE   Df Model:                           13
Date:                Tue, 14 Sep 2021   Pseudo R-squ.:                 0.02362
Time:                        22:01:50   Log-Likelihood:                -774.23
converged:                       True   LL-Null:                       -792.96
Covariance Type:            nonrobust   LLR p-value:                 0.0003512
============================================================================================
                               coef    std err          z      P>|z|      [0.025      0.975]
--------------------------------------------------------------------------------------------
Abnormal return 60 day       0.0118      0.004      3.012      0.003       0.004       0.019
Gross Profit Margin          0.0031      0.003      1.030      0.303      -0.003       0.009
Profit to Capital           -1.2249      0.526     -2.329      0.020      -2.256      -0.194
Return on Sales             -0.0007      0.004     -0.164      0.870      -0.009       0.008
EV to EBIDTA                -0.0009      0.001     -0.801      0.423      -0.003       0.001
Sales growth, 3y            -0.0012      0.002     -0.724      0.469      -0.005       0.002
Free cash Flow/Sales         0.0524      0.189      0.278      0.781      -0.317       0.422
Current Ratio                0.0186      0.038      0.490      0.624      -0.056       0.093
Price to Sales              -0.0512      0.028     -1.848      0.065      -0.106       0.003
Market to Book              -0.0003      0.008     -0.040      0.968      -0.017       0.016
Debt to EV                   0.0070      0.003      2.402      0.016       0.001       0.013
Cash to Capital             -0.0586      0.375     -0.156      0.876      -0.794       0.677
Net debt per share          -0.0053      0.003     -1.707      0.088      -0.011       0.001
Growth-Resource Mismatch    -0.1516      0.139     -1.094      0.274      -0.423       0.120
============================================================================================

The results suggest statistically significant output for Abnormal returns in 1%, Profit to capital, Debt to EV ratios in 5%, and Price to Sales, Net debt per share variables in 10% significance level. Moreover, the coefficient of Profit to Capital is negative, confirming the hypothesis of inefficient management, suggesting that underperforming firms are more likely to be acquired. The firm undervaluation hypothesis is confirmed and aligns with previous studies as companies with low Price to Sales ratios are significantly associated with target companies. The results could not claim the hypothesis of Growth-Resource imbalance and Liquidity for none of the variables in observed significance levels. In contrast, the hypothesis on Leverage is confirmed for both observed variables, in 5% and 10% significance levels, respectively.

Moreover, the coefficient of Debt to EV is positive, suggesting that target firms have higher leverage, in line with the previous studies. On the contrary, Net debt per share coefficient is negative, which contradicts the hypothesis and previous studies. A possible explanation is that the ratio excludes cash and cash equivalents from the debt, which may suggest that the observed target companies have higher liquidity in terms of cash. Finally, Abnormal return over the previous 60 days is highly significant with a positive coefficient allowing to claim the hypothesis that target companies generate positive abnormal run-up returns.

4.2.2 Model on the clustered dataset

Next we hypothesize that Clustering companies before classification improves prediction accuracy. This hypothesis is based on our intuition and is motivated by the findings in the literature that companies in financial distress (lower-liquid or high-levered) have a higher probability of being acquired. Acquisitions of financially distressed companies are mainly related to disciplinary takeovers where a target is acquired because of poor performance. The acquirer is usually a financial buyer seeking mainly financial returns by improving the company’s financial health. However, there is also synergetic acquisition where the acquirer looks for synergies with the target company to improve its business operations. In this case, the target companies are not necessarily financially distressed; moreover, they usually have high growth opportunities. Thus, the characteristics of target companies in disciplinary and synergetic acquisitions are different; therefore, different variables may explain their acquisition likelihood. To test this hypothesis, we employed a clustering algorithm based on sample companies’ liquidity and leverage ratios and run separate logistic regression models on each of the two clusters.

Number of elements in Cluster 0 is 702
Number of elements in Cluster 1 is 442

Moving to clustered model outputs, the graph above shows number of target (Label 1) and non-target (Label 2) companies in each cluster. We can see that there are overall 702 observation in Cluster 0 and 442 observations in Cluster 1. The class distribution is slightly unbalanced towards target and non-target companies for cluster 1 and cluster 0 companies respectively.

The table below summarizes cluster centroids for each cluster per variable. It is evident from the results that companies in Cluster 0 have higher liquidity and lower leverage. On the contrary, Cluster 1 companies are in a worse financial condition in terms of long-term leverage and short-term financial power.

Further we run logistic regression models on each cluster datapoints and discuss the results

Optimization terminated successfully.
         Current function value: 0.670424
         Iterations 6
                           Logit Regression Results                           
==============================================================================
Dep. Variable:                  Label   No. Observations:                  702
Model:                          Logit   Df Residuals:                      688
Method:                           MLE   Df Model:                           13
Date:                Tue, 14 Sep 2021   Pseudo R-squ.:                 0.02694
Time:                        22:02:05   Log-Likelihood:                -470.64
converged:                       True   LL-Null:                       -483.67
Covariance Type:            nonrobust   LLR p-value:                   0.01668
============================================================================================
                               coef    std err          z      P>|z|      [0.025      0.975]
--------------------------------------------------------------------------------------------
Abnormal return 60 day       0.0140      0.005      2.648      0.008       0.004       0.024
Gross Profit Margin          0.0036      0.004      0.941      0.347      -0.004       0.011
Profit to Capital           -1.4259      0.663     -2.152      0.031      -2.725      -0.127
Return on Sales              0.0059      0.007      0.862      0.388      -0.008       0.019
EV to EBIDTA                -0.0012      0.001     -0.975      0.330      -0.003       0.001
Sales growth, 3y            -0.0051      0.003     -1.680      0.093      -0.011       0.001
Free cash Flow/Sales        -0.0313      0.375     -0.083      0.934      -0.766       0.703
Current Ratio                0.0321      0.046      0.692      0.489      -0.059       0.123
Price to Sales              -0.0672      0.038     -1.779      0.075      -0.141       0.007
Market to Book              -0.0026      0.023     -0.112      0.911      -0.049       0.043
Debt to EV                  -0.0177      0.008     -2.144      0.032      -0.034      -0.002
Cash to Capital              0.6774      0.457      1.482      0.138      -0.218       1.573
Net debt per share           0.0324      0.017      1.949      0.051      -0.000       0.065
Growth-Resource Mismatch    -0.1303      0.175     -0.744      0.457      -0.473       0.213
============================================================================================

Logistic regression outputs from Cluster 0, which includes companies in better financial health, find similar results to Model based on the entire dataset. Particularly, Abnormal returns are significant in 1%, Profit to capital, Debt to EV ratios in 5%, and Price to Sales at a 10% significance level. In addition, Net debt per share is also significant at a 5% level. The directions of the impact of significant variables are mainly similar to the first model; however, the coefficient values are higher for the Cluster 1 model for all variables. The exception is leverage ratios, where the direction of coefficients is the opposite, suggesting that companies with lower Debt to EV and higher Net debt per share contribute to the acquisition.

Optimization terminated successfully.
         Current function value: 0.648109
         Iterations 6
                           Logit Regression Results                           
==============================================================================
Dep. Variable:                  Label   No. Observations:                  442
Model:                          Logit   Df Residuals:                      428
Method:                           MLE   Df Model:                           13
Date:                Tue, 14 Sep 2021   Pseudo R-squ.:                 0.05057
Time:                        22:02:10   Log-Likelihood:                -286.46
converged:                       True   LL-Null:                       -301.72
Covariance Type:            nonrobust   LLR p-value:                  0.003969
============================================================================================
                               coef    std err          z      P>|z|      [0.025      0.975]
--------------------------------------------------------------------------------------------
Abnormal return 60 day       0.0078      0.006      1.292      0.196      -0.004       0.020
Gross Profit Margin          0.0100      0.006      1.646      0.100      -0.002       0.022
Profit to Capital           -1.3091      1.296     -1.010      0.313      -3.850       1.232
Return on Sales             -0.0185      0.009     -2.015      0.044      -0.036      -0.001
EV to EBIDTA                -0.0004      0.005     -0.075      0.940      -0.010       0.009
Sales growth, 3y             0.0055      0.005      1.170      0.242      -0.004       0.015
Free cash Flow/Sales         0.3640      0.254      1.434      0.152      -0.133       0.861
Current Ratio                0.2083      0.101      2.067      0.039       0.011       0.406
Price to Sales               0.0203      0.060      0.339      0.735      -0.097       0.137
Market to Book              -0.0014      0.009     -0.152      0.879      -0.019       0.016
Debt to EV                   0.0046      0.005      0.892      0.372      -0.006       0.015
Cash to Capital             -1.3762      1.377     -0.999      0.318      -4.076       1.323
Net debt per share          -0.0111      0.005     -2.401      0.016      -0.020      -0.002
Growth-Resource Mismatch    -0.3546      0.283     -1.252      0.211      -0.910       0.201
============================================================================================

Furthermore, Model based on Cluster 1 data, which provides logistic regression outputs based on 442 low liquid and high-levered companies, suggests somewhat different results. Return on Sales proxying inefficient management is significant at 5% level and is in line with previous studies. The rest of the variables are insignificant except Net debt per share and Current ratios. It is worth mentioning that the coefficient for Net debt per share is negative, as was in Model based on Cluster 0 data. Moreover, the current ratio is significant at a 5% level, positively contributing to our earlier argument that observed target companies have higher liquidity.

4.2.3 Comparison of unclustered and clustered results: The effect of clustering

Accuracy metrics for the model based on the entire dataset
Classification accuracy: 0.575
ROC_AUC score: 0.602

Accuracy metrics for the model based on the Cluster 0 data
Classification accuracy: 0.595
ROC_AUC score: 0.618

Accuracy metrics for the model based on the Cluster 1 data
Classification accuracy on test set: 0.606
ROC_AUC score: 0.644

The comparison of the three models shows that Clustered models produce relatively better results according to the Accuracy, AUC measure, and Pseudo R squire. We believe that the AUC measure is a better estimate, considering that Clustered models are imbalanced, and equal to 0.6, 0.618, and 0.644 for Model 1, 2, and 3, respectively. Additionally, models on clustered data have better explanatory power as clustered models produced a more comprehensive view of significant variables. However, it is also worth noting that the difference in model accuracy is not radical and can be associated with the sample size, which is the smallest for Model 3. The difference becomes even smaller after the cross-validation. Results from stratified cross-validation with ten splits for accuracy and AUC scores are summarized in the table below.

Accuracy metrics for the model based on the entire dataset after 10-fold cross-validation
Accuracy: 0.565 (0.033)
ROC_AUC score: 0.578 (0.040)

Accuracy metrics for the model based on the Cluster 0 data after 10-fold cross-validation
Accuracy: 0.570 (0.051)
ROC_AUC score: 0.572 (0.048)

Accuracy metrics for the model based on the Cluster 1 data after 10-fold cross-validation
Accuracy: 0.577 (0.070)
ROC_AUC score: 0.593 (0.073)

It should be noted that the current comparison is only preliminary, and actual accuracies will depend on how well the models will generalize on the hold-out sample, which is discussed later in this chapter. Nevertheless, higher accuracy and AUC measures of clustered models after cross-validation indicate that clustering will improve accuracy. To ensure the robustness of our claim, the hypothesis is also tested on a hold-out sample.

Section 5: Out of sample predictive power of the model

As mentioned earlier, the predictive power of the model is estimated on the hold-out sample consisting of target group observations after 2020–01–01 and all non-target peer companies. The hold sample includes 84 target and 1704 non-target companies The reason we test our model on highly unbalanced dataset is to to have a similar to natural world distribution of target and non-target companies.

5.1 Identification of optimal cut-off probability

To estimate the predictive power of the models, an optimal cut-off rather than an arbitrary one (0.5) needs to be identified. Considering that we aim to compare two models (general and clustered) a universal approach is suggested. Considering unbalanced datasets of clustered models and the one from the hold-out sample, the optimal cut-off for the models is derived G-measure approach which is the geometric mean of precision and recall. The formula is given as follows:

The cut-off is further used to classify companies and include in the portfolio. For the companies where target probability is bigger than the optimal cut-off model classifies as target and includes in the portfolio.

The following function allows to identify the optimal cut-off and highlights it in a plot.

First we identify the optimal cut-off for the model based on the entire dataset using the function above.

Best Threshold: 0.4961 with G-Mean: 0.5865
FPR: 0.4056, TPR: 0.5787

Then we identify the optimal cut-off for the model based on the cluster 0 data.

Best Threshold: 0.4606 with G-Mean: 0.5826
FPR: 0.3916, TPR: 0.558

Finally, we identify the optimal cut-off for the model based on the cluster 1 data.

Best Threshold: 0.5902 with G-Mean: 0.6128
FPR: 0.291, TPR: 0.5296

By employing this methodology, cut-off points of 0.4961, 0.460 and 0.5901 are determined for unclustered and cluster 0 and cluster 1 models respectively. The cut-offs mentioned above are employed to evaluate the predictive ability of the models, including Accuracy to determine how well the models identify actual targets and non-targets, Precision, Recall (TPR), and FPR to get further deeper insight on each aspect of the prediction as well as F1 score to evaluate the overall quality of different models.

First we divide the holdout sample into dependent and independent variables

5.2 Classification results on the holdout sample

5.2.1 Results from the model on the entire dataset

First we calculate and report logistic regression outputs of the model based on the entire dataset

Observations
Total Number of companies: 1788
Number of target companies: 84
Number of non-companies: 1704

Absolute Measures
TP: 48
TN: 982
FP(Type II error): 722
FN(Type I error): 36

Relative Measures
ROC score:0.6
Accuracy: 0.58
Precision: 0.06
Recall(TPR): 0.57
FPR: 0.42
F1 score: 0.11

The model based on the entire dataset correctly identifies 58% of targets and non-targets, which equals 48 out of 84 for targets and 982 out of 1704 for non-targets in absolute measures. Additionally, the Model has a large Type II error, resulting in a very low precision of 0.06. The recall is relatively higher due to lower Type I error and higher True positives. The resulting F1 score is 0.11.

Then we divide the entire dataset into two clusters using the kmeans model we have trained earlier.

5.2.2 Results from the models on seperate clusters

Number of elements in Cluster 0 is 1143
Number of elements in Cluster 1 is 645

We calculate and report logistic regression outputs of the models based on Cluster 0 and Cluster 1 separately. Below are the results from the Cluster 0 model.

Observations
Total Number of companies: 1143
Total Number of companies: 45
Total Number of companies: 1098

Absolute Measures
TP: 27
TN: 697
FP(Type II error): 401
FN(Type I error): 18

Relative Measures
ROC score:0.65
Accuracy: 0.63
Precision: 0.06
Recall(TPR): 0.6
FPR: 0.37
F1 score: 0.11

Cluster 0 model, which encompasses companies with better financial health, has a higher predictive power of identifying non-targets than targets, 64% (697 out 1143) and 59% (27 out of 45). The large number of identified true negatives results in the lowest FPR and the highest accuracy among the models. However, a large Type I error results in one of the lowest precision scores for the model, which is 0.6 and equal to the precision of the general model. The resulting F1 score is also similar to the one from the model based on the entire dataset.

Further we report the results from the Cluster 1 model

Observations
Total Number of companies: 645
Total Number of companies: 39
Total Number of companies: 606

Absolute Measures
TP: 28
TN: 319
FP(Type II error): 287
FN(Type I error): 11

Relative Measures
ROC score:0.45
Accuracy: 0.54
Precision: 0.09
Recall(TPR): 0.72
FPR: 0.47
F1 score: 0.16

Moving to the Cluster 1 model, which involves low-liquid and high-levered companies, has the highest accurate identification of target companies. The model correctly identifies 28 targets out of 39, 72% in relative measures. In contrast, the model has the poorest ability to predict actual non-targets, particularly 52% or 319 out of 606 non-targets. Considering the highly toward non-targets unbalanced (95:5) hold-out sample, the predictions mentioned above result in the lowest accuracy of 0.54 compared to the other models. However, due to the relatively accurate prediction of targets and smaller Type I error, the model produces the highest recall and precision scores, 0.72 and 0.09, respectively. The F1 score is also the highest and is equal to 0.16.

As can be noticed Clustered models characterize with diverged results. That can be attributed to an assumption that models overall predict well for the companies which are in bad financial health. This assumption is supported on both clustered models, as Cluster 0, which involved companies in good financial health, better identifies non-targets (potentially financially distressed), and Cluster 1, which involved companies with lower liquidity and higher leverage, better identifies the targets (again potentially financially distressed).

Finally we calculate and report also combined results of clustering to be able to better measure the impact of clustering on logistic regression prediction accuracy.

Observations
Total Number of companies: 1788
Total Number of companies: 84
Total Number of companies: 1704

Absolute Measures
TP: 55
TN: 1016
FP(Type II error): 688
FN(Type I error): 29

Relative Measures
Accuracy: 0.6
Precision: 0.07
Recall(TPR): 0.65
FPR: 0.4
F1 score: 0.13

As for the combined results derived based on a weighted cut-off, all metrics expectedly lie between Cluster 0 and Cluster 1. According to the combined outputs, 65% of targets and 60% of targets are identified through clustering, which is better than the model without clustering. The latter correctly identified only 58% of the targets and non-targets. Both Type I and Type II errors are higher for the general model, which allows us to conclude that clustering helps to reduce the errors. Due to higher accuracy and low erroneous predictions, the clustering results in better predictive power, as in terms of overall accuracy, 0.6 against 0.58, as well as F1 score (0.13 against 0.11) as a combined metric of precision versus recall trade-off.

Overall, the results from the current and previous sections allow us to confirm that clustering improves the prediction accuracy of logistic regression models for target prediction. It is also worth mentioning that Clustered models can be used as combined as well as separately in the portfolio construction process. Particularly, one can buy only targets predicted by the Cluster 1 or Cluster 0 model instead of buying them all, considering the different predictive abilities of two models.

Section 6: Portfolio returns

6.1 Announcement returns

Before estimating portfolio returns based on the predictions from 3 different models, announcement returns are calculated and illustrated with a plot. This provides an insight into the potential portfolio returns that can be generated by correctly identifying target companies.

First we create a dataframe containing the RICs of hold-out sample companies and key dates of estimation and observation periods of abnormal return calculation.

After the dataframe is created, abnormal returns for each target company in the holdout sample is estimated using the function described in the begining of this article.

After retrieving abnormal returns for all of the target companies, we calculate cumulative sum of the returns and plot through a lineplot.

In line with previous empirical studies (Jensen & Ruback, 1983; Danbolt, 2004), we can observe positive abnormal announcement returns during the event windows, which exceeds 25% after the announcement day. The decline in Average CAR after the announcement is explained by the fact that the stock price of the to-be-acquired company remains stable (equal to per stock acquisition price) while the market, against which abnormal returns are calculated, moves. During our observation, S&P 500 index, used as a market proxy, increased, resulting in a decline in the target company’s stock abnormal returns.

It can also be observed that a considerable portion of abnormal returns is already generated before the announcement date, which Schwert (1996) denoted as run-up returns. Significant run-up returns also explain the statistical significance and strong positive coefficient of the Abnormal return variable in our models.

The results of this section imply that shareholders of target companies can generate positive abnormal returns, and a target prediction model identifying targets at a certain accuracy can generate portfolio abnormal returns.

6.2 Portfolio returns

After evaluating the predictive power of the models and showing the potential abnormal returns that the models can potentially generate, we now estimate the ability and level of abnormal returns that models actually capture. Two types of portfolio returns are calculated in this section. First, we estimate portfolio abnormal returns to measure the actual return size that the models could capture from the total announcement returns. Second, we test an investment strategy of buying and selling predicted target company stocks and compare generated returns with the market return to measure the market-adjusted excess return of our portfolios

6.2.1 Portfolio Abnormal Returns

Here we estimate the level of abnormal returns that the portfolios constructed based on the the prediction models can capture. To do that, abnormal returns for each target company in the holdout sample is estimated using the function described in the beginning of this article.

First we calculate and report portfolio abnormal returns from the portfolio created by the model based on the entire dataset.

Portfolio composition
Predicted targtes: 770
Among which Actual Targets: 48
Among which Actual Non-Targets: 722

Accuracy Metrics
Recall(TPR): 0.57
FPR: 0.42
F1 score: 0.11

Portfolio Abnormal Return
From actual targets: 27.11
From actual non-targets: 6.3
Weighted Total: 7.6

The portfolio constructed based on the entire dataset comprises of 770 companies. Among them, only 48 are actual targets, which converts into 0.58 and 0.42 TPR and FPR, respectively. Despite large Type II errors, the model is able to generate 7.6 % of weighted total abnormal return. Additionally, we observe that actual targets generate considerably more returns, 27.11 %, which is diluted by the returns from non-target companies. Nevertheless, predicted as target non-targets still generate a positive 7.6 % abnormal return.

Then we calculate and report portfolio abnormal returns from the portfolio created by the model trained on Cluster 0 companies.

Portfolio composition
Predicted targtes: 428
Among which Actual Targets: 27
Among which Actual Non-Targets: 401

Accuracy Metrics
Recall(TPR): 0.6
FPR: 0.37
F1 score: 0.11

Portfolio Abnormal Return
From actual targets: 29.91
From actual non-targets: 6.17
Weighted Total: 7.67

Finally, we calculate and report portfolio abnormal returns from the portfolio created by the model trained on the Cluster 1 companies.

idx_lr_1 = pred_res_lr_1.index.values.tolist()
return_list_lr_1 = [return_list[index] for index in idx_lr_1]

pred_res_lr_1.insert(loc = len(pred_res_lr_1.columns), column = "Abnormal Return", value = return_list_lr_1)
pred_res_lr_1.head()

Portfolio composition
Predicted targtes: 315
Among which Actual Targets: 28
Among which Actual Non-Targets: 287

Accuracy Metrics
Recall(TPR): 0.72
FPR: 0.47
F1 score: 0.16

Portfolio Abnormal Return
From actual targets: 26.16
From actual non-targets: 4.6
Weighted Total: 6.52

Moving to returns generated by the portfolios constructed based on clustered models, we observe that the Cluster 0 portfolio generated the highest weighted total abnormal return of 7.66 % despite having the smallest precision and recall scores. This is merely due to the lowest number of inclusions of non-target companies (FPR is 0.36). Additionally, despite low TPR compared to the Cluster 0 portfolio, the 27 targets included in the Cluster 1 portfolio could generate around 4 % more abnormal return than those from Cluster 1. This can be because of the difference in characteristics of Cluster 0 and Cluster 1 companies. Companies in Cluster 0 are comparably in better financial health; hence possible synergetic targets for acquires, usually involving higher price premiums.

Portfolio composition
Predicted targtes: 743
Among which Actual Targets: 55
Among which Actual Non-Targets: 251

Accuracy Metrics
Recall(TPR): 0.65
FPR: 0.4
F1 score: 0.13

Portfolio Abnormal Return
From actual targets: 28.0
From actual non-targets: 5.52
Weighted Total: 7.18

As for the combined outputs, it can be observed that clustering resulted in the inclusion of a higher number of actual targets in the model at the same time by keeping Type II error lower. However, as can be concluded from comparing weighted total abnormal returns across the models, a higher number of actual targets and a lower number of actual non-targets did not convert into a higher return. 55 target companies of the clustered portfolios generated around 1% higher return than the 48 companies from the general model. At the same time, returns from actual non-targets from the general model’s portfolio exceeded non-targets in the clustered model portfolio. As a result, an overall higher weighted return for general model companies is reported. Hence, it can be concluded that actual returns are dependent not only on the model’s predictive power but also on the level of returns from both target and non-target companies.

In summary, the analysis of portfolio abnormal returns showed that the clustering approach systematically predicted both target and non-target companies better and achieved a higher F1 score. However, higher predictive power did not always converge into higher returns due to the difference in the size of the returns of classified companies. Nevertheless, the predictive model based on the Cluster 0 produced the highest portfolio abnormal returns in almost all scenarios, suggesting that one may benefit when buying only stocks predicted by the Cluster 0 model.

6.2.2 Portfolio investment returns

As already described in the section above, portfolio abnormal returns are calculated for the pre-defined observation period (60 days before and three days after the announcement), while in the real situation, even if one can correctly predict a potential target company, cannot really identify the optimal time of actual purchase and the holding period. Thus, in addition to determining portfolio abnormal returns, investment strategies are tested to identify whether a particular strategy of buying and selling potential target company stocks can generate market-adjusted excess return.

The investment strategy suggests annual rebalancing, which assumes buying and selling transactions for each year separately. According to this strategy, one will hold the stocks bought each year, either until the end of that year or three days after the acquisition. Considering that 2021 is still in progress, the observation end date is considered July 2021. Finally, observed returns are adjusted by market proxy, which is S&P 500 in the scope of this study.

In order to calculate market adjusted portfolio investment returns, we first create dataframe containing all securities with respective prices, including price for starting period, end period and the one after 3 day of acquisition announcement. The following sections form the dataframe, conducts API calls and inserts retrieved data into the dataframe.

After retrieving the necessary data we calculate and report profit per stock as described in the beginning of current section.

Next we retrieve market performance data, which is in our case S&P 500 index, and calculate and report market return for 2020 and 2021.

Market Return for 2020 is: 13.99%
Market Return for 2020 is: 13.05%

As in previous sections, we first calculate and report portfolio investment returns from the portfolio created by the model based on the entire dataset

Portfolio Return for 2020 is: 23.88%
Portfolio Return for 2020 is: 34.81%

Market adjusted Portfolio return for 2020 is: 9.889999999999999%
Market adjusted Portfolio return for 2021 is: 21.76%
Average Market adjusted Portfolio return is: 15.825%

Then we calculate and report portfolio investment returns from the portfolio created by the model trained on Cluster 0 companies.

Portfolio Return for 2020 is: 39.78%
Portfolio Return for 2020 is: 25.62%

Market adjusted Portfolio return for 2020 is: 25.79%
Market adjusted Portfolio return for 2021 is: 12.57%
Average Market adjusted Portfolio return is: 19.18%

Finally, we calculate and report portfolio investment returns from the portfolio created by the model trained on the Cluster 1 companies.

Portfolio Return for 2020 is: 10.71%
Portfolio Return for 2020 is: 40.78%

Market adjusted Portfolio return for 2020 is: -3.2799999999999994%
Market adjusted Portfolio return for 2021 is: 27.73%
Average Market adjusted Portfolio return is: 12.225000000000001%

Portfolio Return for 2020 is: 27.04%
Portfolio Return for 2020 is: 31.82%

Market adjusted Portfolio return for 2020 is: 13.049999999999999%
Market adjusted Portfolio return for 2021 is: 18.77%
Average Market adjusted Portfolio return is: 15.91%

The analysis of investment returns across different models, show that overall models enable generation of positive market adjusted returns. Particularly, the highest return is recorded by Cluster 0, whereas, the Cluster 1 model generated the lowest. Moreover the latter produced negative market adjusted return for 2020. Combined clustering results are a bit higher than the returns from the general model despite higher predictive power of clustering.

Overall, the analysis of this section shows that portfolio returns are dependent not only on correct predictions but also on the actual return of correctly predicted target company, market timing, and stock holding period.

Section 7: Conclusion

This Article aimed to test the feasibility of target prediction and examine the possibility of generating portfolio returns based on the prediction output. This Chapter discusses the findings on both separately.

The results suggest that it is possible to predict target companies at a certain degree. Moreover, comparing with and without clustering model results, we confirm the hypothesis that Clustering the sample improves prediction accuracy. Clustering the sample dataset through the Kmeans algorithm resulted in grouping higher liquid and lower levered companies in one cluster and the opposite to another. Clustered models produced better accuracy than the general model. Additionally, models on clustered data had better explanatory power as clustered models produced a more comprehensive view of significant variables. The effect of clustering was further tested on the hold-out sample. The pre-clustering model correctly identified 58% of targets and non-targets, whereas the clustered model identified 65% of targets and 60% of non-targets. Both Type I and Type II errors were higher for the general model, which allowed us to conclude that clustering helps to improve the accuracy. Overall, the clustering resulted in better predictive power, as in terms of overall accuracy, 0.6 against 0.58, as well as F1 score (0.13 against 0.11) as a combined metric of precision versus recall trade-off.

The second part of the analysis was concentrated on the calculation of portfolio returns. Both abnormal and investment returns were calculated and compared across three portfolios. Overall, the portfolio of predicted targets could generate positive abnormal returns. The returns based on the Optimal cut-off portfolio varied between 6.52% and 7.67%. The comparison of without clustering and clustered results suggests that the clustering approach systematically predicted both target and non-target companies better and achieved a higher F1 score. However, higher predictive power did not always converge into higher returns due to the difference in the size of the actual returns of classified companies. Nevertheless, the predictive model based on the Cluster 0 produced the highest portfolio abnormal returns in almost all scenarios, suggesting that one may benefit when buying only stocks predicted by the Cluster 0 model.

The results from the estimation of investment returns are consistent with the abnormal returns. Companies predicted by Cluster 0 generated the highest returns; however, the combined returns of clustered models were lower than those from the general model. Nevertheless, it is worth noting that the 2-years observation period is too short of concluding investment returns. Moreover, during 2020–2021 markets were in an upward trend and many non-target companies in our sample also generated positive returns. If the same analysis was done for a different period involving market downfalls, a different picture could have been experienced.

Overall, the article’s findings indicate that it is possible to predict target companies, and clustering can improve the prediction power of the models. However, higher accuracies do not always convert into higher returns as portfolio returns are also dependent on individual stock returns, market timing, and the stock holding period. This is making the abnormal return generation by identifying target companies an extremely challenging.

References

Barnes, P. (1998). Can takeover targets be identified by statistical techniques?: Some UK evidence. Journal of the Royal Statistical Society: Series D (The Statistician), 47(4), 573–591.

Danbolt, J. (2004). Target company cross‐border effects in acquisitions into the UK. European Financial Management, 10(1), 83–108.

Goergen, M., & Renneboog, L. (2004). Shareholder wealth effects of European domestic and cross‐border takeover bids. European Financial Management, 10(1), 9–45.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning. Cited on, 33.

Institute for Mergers, Acquisitions, and Alliances. (2021). M&A Statistics — Number and Value and Largest M&A Transactions by Region, Accessible at: https://imaa-institute.org/mergers-and-acquisitions-statistics/

Jensen, M.C. & Ruback, R.S. (1983). The Market for Corporate Control — The Scientific Evidence. Journal of Financial Economics, 11(1–4), 5–50.

Keown, A. J., & Pinkerton, J. M. (1981). Merger announcements and insider trading activity: An empirical investigation. The journal of finance, 36(4), 855–869.

MacKinlay, A. C. (1997). Event studies in economics and finance. Journal of economic literature, 35(1), 13–39.

Palepu, K. G. (1986). Predicting takeover targets: A methodological and empirical analysis. Journal of accounting and economics, 8(1), 3–35.

Piesse J., Lee CF., Lin L., Kuo HC. (2013) Merger and acquisition: Definitions, motives, and market responses. In: Lee CF., Lee A.C. (eds) Encyclopedia of Finance. Springer, Boston, MA

Powell, R. (2004). Takeover prediction models and portfolio strategies: a multinomial approach. Multinational Finance Journal, 8(1/2), 35–72.

Schwert, G. W. (1996). Markup pricing in mergers and acquisitions. Journal of Financial economics, 41(2), 153–192.

Tunyi, A. (2021). Fifty years of research on takeover target prediction: a historical perspective. In Qualitative Research in Financial Markets. Emerald Group Holdings Ltd.