0% found this document useful (0 votes)
18 views15 pages

CH 3 Onlineannex

The document describes data used in an analysis of the US mortgage market. It details the sources and variables in HMDA and bank call report data, and how fintechs, banks, and other lenders were identified. Additional details on geographical mapping and differences in bank and fintech business models are also provided.

Uploaded by

oke
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views15 pages

CH 3 Onlineannex

The document describes data used in an analysis of the US mortgage market. It details the sources and variables in HMDA and bank call report data, and how fintechs, banks, and other lenders were identified. Additional details on geographical mapping and differences in bank and fintech business models are also provided.

Uploaded by

oke
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

CONFIDENTIAL

CHAPTER

3
Online Annex 3.1. Case Study on Neobanks
Data description
The data set comprises 37 neobanks and 640 traditional banks across 18 economies, including in Europe
(UK, France, Germany, Italy, Spain, Lithuania, Poland, and Russia), Asia (Kazakhstan, Japan, Hong Kong
SAR, Indonesia, Korea, and China) and the Western Hemisphere (Canada, US, Brazil, and Mexico).
Assembling the data required the following steps:
Step 1—to identify the universe of neobanks, we employed several sources, including NeoBanks.app
(The list of neobanks and digital banks in the world in 2022), WhiteSight (2021), S&P Global Market
Intelligence (banks and savings banks tagged as “neobanks”), market reports on both unlisted and
listed neobanks and digital banks (Goldman Sachs 2022, Morgan Stanley 2021, and S&P Capital IQ
2021), policy notes (Clerc and others 2020) and extensive conversations with bank analysts, investors
and (including former) management of a number of neobanks and traditional banks.
Step 2—the initial sample (250) was narrowed down to those neobanks that make their financial
statements available and report them on a comparable format to banks. We then pulled their
consolidated financial data from S&P Global Market Intelligence or, alternatively, from their latest
publicly available filings (quarterly/semi-annual reports, Pillar-3 reports, etc.) for the two most recent full
fiscal years (generally 2019 and 2020). In the case of listed neobanks, we also relied on sell-side
research reports and models. The result of this was a sample of 37 neobanks in 18 economies.
Step 3—For each of the 18 economies identified in the previous step, we collected the consolidated
financial data for all active banks, both listed and unlisted, that are available in S&P Global Market
Intelligence. We excluded neobanks identified in the first step. In the UK, we also excluded
consolidated Group data and kept their UK business, whenever we are given this option. In the US, we
just kept the “midcap banks” (as defined by Morgan Stanley Research). All in all, we collected 640
traditional banks across 18 economies.

Methodology
We benchmarked each neobank against the universe of its respective local traditional banks, following
three steps:
Step 1—We calculated, for each institution, key performance metrics across nine dimensions: yields,
spreads, margins, efficiency, credit quality, capital, liquidity, profitability, and growth.
Step 2—We used traditional bank data to compute, for every single metric and individual economy, both
the (asset-weighted) average and the standard deviation. We then computed the distance of each
neobank to the (asset-weighted) average of its local (traditional) peer group, per metric, as number of
standard deviations. This data transformation allows us to make cross-country comparisons.
Step 3—To explore differences across geographies and types of institutions, we also split the sample per
region (Europe, Asia, Western Hemisphere), type of economy (AE vs EM), year of origin (pre and post
20101) and return profile (profitable vs loss-making), and compute median values for each of them.

1
The “neobank” concept originated in the early 2010s alongside the advent of policies supporting open banking around the World
(Microsoft, Linklaters and Accenture 2019); it is different to the “direct bank” (branchless bank) concept that had been around
since at least the early 1980s (Exton Research 2021, WhiteSight 2021).
Global Financial Stability Report Chapter 3

Online Annex 3.2. Case Study: US Mortgage Market


The annex describes the data used in the analysis, some additional stylized facts, and presents detailed
regression results and robustness checks.

Data
At the heart of the analysis is the detailed Home Mortgage Disclosure Act (HMDA) data. The dataset
contains a number of variables (characteristics) for almost all mortgage applications made in the US.2

The HMDA data is available on an annual (calendar year) basis from the US Consumer Financial
Protection Bureau. The analysis is based on the data from 2007–20, which was the latest available data
at the time of publication.3 The number of recorded data points was significantly expanded in 2018,
including the reporting of loan-to-value ratios, the age profile, interest rates and various details on loan
terms. Descriptions of the individual data points are available from the dedicated FFIEC website.4

The analysis uses only originated mortgages, identified in the data as approved and accepted mortgage
applications. The sample is further restricted to mortgages for home purchases or refinancing, first lien
mortgages,5 and loans for 1–4 family homes. This results in a total of about 110 million observations.

Identification of fintechs, non-banks, banks and credit unions

The identification of fintechs (fintech mortgage originators) follows the definition in the literature, including
some recent updates. The starting point is the seminal paper of Buchak and others (2019), who identify
fintechs as those offering an online application process without the need of human interaction. Those are
Quicken Loans (Rocket Mortgage), Amerisave, Guaranteed Rate, Cashcall, Homeward Residential/PHH
Mortgage Corporation, and Movement Mortgage. Jagtiani and others (2021) add two additional firms that
started to fully operate in 2016 or later: Better Mortgage and SoFi Mortgage. In total, these eight fintechs
are responsible for about 11 percent of mortgage origination in 2020 and yield a total of 6.7 million
origination observations (all fintechs are non-banks by definition).

Following the literature, non-banks are defined as all non-depository institutions, identified in the dataset
as all reporters with an agency code equal to 7 (regulatory agency is the US Department of Housing and
Urban Development (HUD)).

Banks are all depository institutions with a regulatory agency that is a bank supervisor (agency codes 1,
2, 3 and 9). Credit Unions are identified as all institutions regulated by the National Credit Union
Administration (NCUA).

2
Banks that have no offices in non-metropolitan areas and those below a certain asset threshold do not have to report their
mortgage applications. Non-banks (non-depository institutions) are covered as long as they originate a significant number of
mortgage loans (currently >100 closed-end mortgages or >200 open-end credit lines, but these thresholds have varied over time).
See https://files.consumerfinance.gov/f/documents/cfpb_2022-hmda-insitutional-coverage_03-2021.pdf.
3
Historical data is available at https://www.consumerfinance.gov/data-research/hmda/historic-data/, whereas data from 2017 is
available from https://ffiec.cfpb.gov/data-publication/.
4
https://ffiec.cfpb.gov/documentation/2020/.
5
Second or higher-lien mortgage applications represent of very small share of the overall sample (<3.5 percent of observations).

IMF | April 2022 2


Global Financial Stability Report Chapter 3

Bank data

For the purpose of this case study, banks are defined as all institutions filing regular US CALL reports
(FFIEC 031/041). This excludes most credit unions and thrifts, but it includes almost all other types of
depository institutions in the US.

Bank data are taken from the US CALL reports, for which data prior to 2010 is available from the Chicago
Fed, and the FFIEC from 2011 onwards.6 Balance sheet variables are measured at year-end, whereas
expense and income variables are aggregated from quarterly to annual values where necessary. The
data is merged with the HMDA data using the so-called RSSD identifier, as well as the FDIC certifier, and
OCC charter numbers.7

Geographical mapping

The HMDA provides the census tract, which is then translated into a ZIP code using both the 2010 US
Census Bureau data (from 2011 onwards) and the U.S. Department of Housing and Urban Development
ZIP code crosswalk files (for prior observations).8 The ZIP code with the highest overlap with the census
tract (based on percent of population or percent of residential addresses) is used.

The data on bank offices is from the FDIC survey of deposits (SOD), which contains the address and ZIP
code of all branches of all FDIC-insured banks in the US. To calculate the number of bank branches
within a given radius of a borrower, the sample is restricted to physical branches that offer the full range
of services (BRSERTYP=11,12).9

Distances between the ZIP code of the borrower and bank branches are calculated using the NBER ZIP
code distance database.10

Additional stylized facts—business models


Bank and fintech mortgage originators exhibit two additional significant differences.

First, while refinancing mortgages make up the largest share of fintech mortgages, banks tend to
originate a more similar proportion of mortgages for home purchases and refinancing (Figure 3.1, panel
1). The literature has not reached a definitive conclusion on the causes for this difference (Jagtiani and
others 2021), but mortgage refinancing enables fintechs to grow faster (the market for mortgage
refinancing is naturally larger than that for home purchases).

6
Chicago Fed website: https://www.chicagofed.org/banking/financial-institution-reports/commercial-bank-data-complete-2001–2010;
FFIEC website: https://cdr.ffiec.gov/public/pws/downloadbulkdata.aspx.
7
Prior to 2008, the HMDA data does not provide the necessary bank identifiers. The RSSDs, as well as the OCC Charter and FDIC
identifiers can, however, be constructed from the Respondent ID in the HMDA data. See https://s3.amazonaws.com/cfpb-hmda-
public/prod/help/2017-hmda-fig.pdf#page=14.
8
Census relationship file: https://www2.census.gov/geo/docs/maps-data/data/rel/zcta_tract_rel_10.txt; HUD USPS ZIP code
crosswalk files: https://www.huduser.gov/portal/datasets/usps_crosswalk.html#codebook.
9
The SOD data is annual as of end of June for each year, while the HMDA data includes all applications until the end of a calendar
year. Branches may have closed between end-June and end of the year or new branches may have opened between the
beginning of the year and end-June. Cases of branch openings and closures are, however, infrequent.
10
https://www.nber.org/research/data/zip-code-distance-database.

IMF | April 2022 3


Global Financial Stability Report Chapter 3

A second key difference is that banks retain a higher share of originated mortgages on their balance
sheet (panel 2). This illustrates both the much larger balance sheet and funding capacity of banks as well
as the “originate-to-distribute” business model of fintechs. Banks also originate a larger share mortgages
with high loans amounts (so-called “jumbo mortgages”) which exceed the limits set by the Federal
Housing Finance Agency (FHFA) and are therefore not eligible to be purchased, guaranteed, or
securitized by the government-backed enterprises Fannie Mae and Freddie Mac.

Figure 3.1 Type of mortgages and securitization


Fintechs specialize in refinancing… …and securitize and sell-off a larger share of originated
mortgage loans than banks.
1. Refinancing versus mortgages for home 2. Securitization by purchaser type
purchases (percent) (percent)

Source: US Home Mortgage Disclosure Act data, IMF staff calculations.

Additional stylized facts—access to credit


Fintechs, similar to other non-banks, tend to support access to mortgage credit for potentially
underserved borrowers. Non-banks, on average, provided a higher share of loans to borrowers with lower
incomes (Figure 3.2, panel 1). This is, at least in part, related to the younger age of borrowers attracted
by fintechs. Banks tend to have a particularly high share of borrowers in the top 5 percent of borrower’s
income, which naturally also tend to have higher loan amounts. A similar picture emerges when looking at
average income of borrowers relative to the wider metropolitan area they reside in (panel 2). This is
measured at the census tract level, which can be loosely thought of as the wider neighborhood.11

11
A census tract comprises between 1200 and 8000 people. See https://www.census.gov/programs-surveys/geography/about/glossary.html.

IMF | April 2022 4


Global Financial Stability Report Chapter 3

Figure 3.2 Mortgage borrowers by income and affluency of neighborhood


Non-banks, including fintechs, serve a higher share of …and borrowers in relatively less affluent
lower-income borrowers… neighborhoods (census tracts)
1. Share of mortgage origination by borrower 2. Share of mortgage origination by relative income
income level of the borrower’s census tract
(percent) (percent)

Source: US Home Mortgage Disclosure Act data, IMF staff calculations.


Note: Based on the entire data sample 2007–2020. Share of originations is based on the loan amount.

Econometric model and variable definitions


The key result for the impact of fintech competitive pressure on banks’ mortgage interest income is based
on the following model:
𝑀𝑜𝑟𝑡𝑅𝐸 , 𝛽𝐶𝑜𝑚𝑝𝑃𝑟𝑒𝑠𝑠𝑢𝑟𝑒𝐹𝑖𝑛𝑡𝑒𝑐ℎ , 𝛾𝑋, 𝜀 , 1

where 𝑀𝑜𝑟𝑡𝑅𝐸 , is the return on equity in year 𝑡 related to bank’s 𝑏 mortgage interest income (not
origination income) from loans backed by 1-4 family real estate. 𝐶𝑜𝑚𝑝𝑃𝑟𝑒𝑠𝑠𝑢𝑟𝑒𝐹𝑖𝑛𝑡𝑒𝑐ℎ , ← measures
the competitive pressure from fintechs of a given bank’s 𝑏 mortgage origination business, measured at
the ZIP code-level 𝑧, and aggregated to the bank level b. It is defined as 𝐶𝑜𝑚𝑝𝑃𝑟𝑒𝑠𝑠𝑢𝑟𝑒𝐹𝑖𝑛𝑡𝑒𝑐ℎ ,
∑ 𝑆ℎ𝑎𝑟𝑒𝑀𝑜𝑟𝑡𝑔𝑎𝑔𝑒𝑂𝑟𝑖𝑔 , , ∆𝑀𝑎𝑟𝑘𝑒𝑡𝑆ℎ𝑎𝑟𝑒𝐹𝑖𝑛𝑡𝑒𝑐ℎ , ← , the share (in percent) of mortgage
origination in a given ZIP-code area 𝑧 for a given bank 𝑏 in year 𝑡 multiplied by the change in the market
share of fintechs in a ZIP-code area 𝑧, aggregated over all ZIP code-areas in which a given bank has
mortgage originations. The higher this number, the higher the competitive pressure from fintechs for a
given bank. The number varies between -100 (fintechs disappear from all areas where a bank is active) to
+100 (fintechs take the entire market in all areas where a given bank is active). 𝑋 are controls that vary at
the bank-level and over time. A key control in 𝑋 are IT-related expenses (as a share of total expenses).

Results and robustness checks


Table 3.1 presents the key results of the regressions with 𝑀𝑜𝑟𝑡𝑅𝐸 , as the dependent variable
(winsorized at the 1 percent and 99 percent level to limit the effect of outliers). No Zip codes served and
HHI measure a given bank’s market diversification and its concentration across ZIP codes respectively.
HHI is the Herfindahl-Hirschmann index. Data processing expenditures are bank-level measures (CALL

IMF | April 2022 5


Global Financial Stability Report Chapter 3

report series RIAD C017), relative to bank equity (consistent with the dependent variable). For models (5)
and (6), the sample is restricted to those banks that report these expenditures.12

Bank-level controls include the equity ratio (total bank equity to total assets) as well as the deposit ratio
(deposits as a share of total non-equity liabilities). Both are highly significant, but do not add much to the
explanatory power of the regressions. Unobserved bank-level differences due to, for instance, a
persistent difference in size or business model are captured by bank-level fixed-effects. Analogously,
common market-related movements in mortgage interest income, due to changes in demand or risk-free
rates, are captured by time fixed-effects. The results presented in the main text are based on model (6).
The marginal effect of competitive is -0.422. Banks with additional IT expenditures of about 3.37 percent
of bank equity (=0.422/0.125) can offset the effect of a 1 percent increase in the fintech composite market
share (fintech competitive pressure).

The interaction term in model (7) shows that the marginal effect of competition does not significantly
change with data processing expenditures (DPE). Higher DPE does, however, increase mortgage-related
income across all specifications and thereby can help to offset the potential impact of competitive
pressure from fintechs. 13

Table 3.1 Baseline regression results


(1) (2) (3) (4) (5) (6) (7)
Competitive -3.566*** -2.613*** -0.422** -0.395** -0.391* -0.422** -0.417**
pressure (t-1) (1.88e-05) (0.00336) (0.0482) (0.0441) (0.0502) (0.0380) (0.0397)
No ZIP codes 0.000443 0.000806* 0.000878*
served
HHI -0.000117 -8.86e-05 -8.70e-05
Data processing 0.151* 0.125* 0.146*
expenditure (0.0662) (0.0555) (0.0837)
(DPE)
Competitive 0.0140
pressure (t-1) x (0.388)
DPE
Observations 32,411 32,411 22,752 22,289 27,656 21,945 27,656
R-squared 0.014 0.098 0.834 0.850 0.830 0.853 0.830
Year FE YES YES YES YES YES YES
Bank FE YES YES YES YES YES YES
Bank-level YES YES
controls
Note: ***, ** and * denotes significance at the 1 percent,5 percent,10 percent level respectively. P-values in parenthesis. Standard errors are double-clustered at the year and bank level.

The additional results for other dependent variables shown in the main text are provided in Table 3.2.,
models (1) and (4). Models (1)–(3) show the results for the change in the deposit share (percent of total
non-equity liabilities) as the dependent variable, whereas models (4)–(6) have the mortgage lending
share (percent of total loans) as the left-hand side variable. Otherwise, the specifications are consistent
with those in Table 3.1. Even without including proper controls, the effect of competitive pressure from
fintechs is not significant.

12
Generally, if above $100K and >7 percent of a bank’s “other non-interest expenses”.
13
All results are robust to excluding the period of the Global Financial Crisis (GFC).

IMF | April 2022 6


Global Financial Stability Report Chapter 3

Table 3.2 Additional regression results

(1) (2) (3) (4) (5) (6)


Competitive pressure -0.0300 0.0193 -0.0292 -0.721 -0.387 -0.716
(t-1) (0.485) (0.483) (0.626) (0.299) (0.318) (0.386)
No ZIP codes served -0.000376 0.0135
HHI 3.25e-05 -0.000120
Observations 30,499 30,011 21,872 31,940 31,472 22,288
R-squared 0.063 0.213 0.443 0.000 0.252 0.252
Year FE YES YES YES YES YES YES
Bank FE YES YES YES YES
Bank-level controls YES YES
Note: The dependent variable for models (1)-(3) is the change in the deposit share (percent of total non-equity liabilities) and for models (4)–(6) the change (percent) in the mortgage lending share ( percent of
total loans).***, ** and * denotes significance at the 1 percent,5 percent,10 percent level respectively. P-values in parenthesis. Standard errors are double-clustered at the year and bank level.

IMF | April 2022 7


Global Financial Stability Report Chapter 3

Online Annex 3.3. Risk Analysis on DeFi Lending


This annex describes the data, models, and estimation strategy employed in the analysis of the various
risks discussed in the DeFi section, including the details of the event study carried out to assess the risk
of cyberattacks.
Probability of Liquidation and Expected Losses
Following Merton (1974) and Black and Cox (1976), a quantitative model is proposed which exploits DeFi
user account-level data that includes all assets borrowed and posted as collateral as of end-2021 for
selected DeFi platforms: Aave v2, Compound v2, and C.R.E.A.M. Finance on both Ethereum and
Polygon network.

Modeling crypto asset portfolio. The dynamics of crypto asset price are modelled as follows.

𝑑𝑝 ,
𝜇 𝑑𝑡 𝜎 𝑑𝑊 , 𝑘 1,2, ⋯ 𝐾, 𝑡 0 2
𝑝 ,

where 𝒑 𝑝 , , ,⋯,
is the vector of price of crypto asset 𝑘 at time 𝑡, 𝑝 , 0, 𝝁 𝜇 , ,⋯, is the
drift, 𝚺 diag 𝜎 , ,⋯, is the volatility matrix, 𝑊 , 𝑘 1,2, ⋯ , 𝐾 is a multi-dimensional standard
Brownian motion with 𝑑𝑊 𝑑𝑊 𝜌 , 𝑑𝑡, and 𝝆 𝜌, , , ,⋯,
is the correlation matrix.14

A user’s borrowing value and collateral value can be represented as a portfolio of assets borrowed and
posted as collateral. Namely, 𝑉 , ∑ 𝑛 , , 𝑝 , , where 𝑉 , is the value of borrowing assets, 𝑛 , is the
,
number of borrowing of 𝑘-th assets. The dynamics of the borrowing value is 𝒘 , diag 𝒑 𝑑𝒑𝒕 ,
,

where 𝑤 , 𝑤 , , , ,⋯,
is the vector of the share of the borrowed asset 𝑘 within the total borrowing.
Similarly, 𝑉 , ∑ 𝑛 , , 𝑝 , where 𝑉 , is the value of collateral assets, 𝑛 , is the number of holding of 𝑘-
,
th assets. The dynamics of collateral is 𝒘 , diag 𝒑 𝑑𝒑𝒕 , where 𝑤 , 𝑤 , , , ,⋯,
is the vector
,

of the share of the collateral 𝑘 within the total value of collateral posted.

𝑑𝑉 ,
𝜇 𝑑𝑡 𝜎 𝑑𝑊 , 𝑖 ∈ 𝑏, 𝑐 , 𝑡 0 3
𝑉,

𝒘𝒃, 𝛒𝚺𝚺 𝒘 ,
where 𝜇 𝒘 𝝁 and 𝜎 𝒘 , 𝚺𝚺 𝒘 , for 𝑖 ∈ 𝑏, 𝑐 , and 𝜌 .

Probability of liquidation. Liquidation is triggered when a user fails to maintain the collateral value to
meet the collateral requirement. Suppose 𝜃 is the collateral factor (that is, discounting factor of the
collateral set by the DeFi platform), the maximum loanable value of the user is then defined as

14
It is possible that the drift and volatility may change over time, or there can be jumps in the price that bring discontinuity in the
price. Hence the assumption of geometric Brownian motion is strong given the uncertainties regarding crypto asset price. The
estimates should be viewed as approximation. By simplifying the model, closed-form formulae of probability of liquidation and
expected loss can be derived, which provides economic intuition, such as the relationship between volatilities of assets borrowed
and posted as collateral with the riskiness.

IMF | April 2022 8


Global Financial Stability Report Chapter 3

𝑉 , 𝒘 , 𝒑 where 𝑉 , is the value of maximum loanable assets, 𝑤 , 𝑤 , , 𝜃 , , ,⋯,


is the vector
of weight of assets within the total collateral posted by the user multiplied by the collateral factor.

Similar to the collateral value, the dynamics of loanable assets can be written by the following.

𝑑𝑉 ,
𝜇 𝑑𝑡 𝜎 𝑑𝑊 , ,𝑡 0 4
𝑉 ,

𝒘𝒃, 𝛒𝚺𝚺 𝒘 ,
where 𝜇 𝒘 , 𝝁 and 𝜎 𝒘 , 𝚺𝚺 𝒘 , and 𝜌 .

Liquidation is triggered when the health indicator, 𝐻𝑒𝑎𝑙𝑡ℎ 𝑉 , /𝑉 , falls below 1. The timing of
triggering liquidation, 𝜏 𝑡, where the health falls and passes the threshold, is stochastic. The probability
of triggering liquidation before the horizon 𝑇 (𝑃𝐿 𝑡, 𝑇 ) can be estimable by the following equation,

2 log 𝐻 𝜇̂
𝑃𝐿 𝑡, 𝑇 𝑃 𝜏 𝑇 Φ 𝑑 exp Φ 𝑑 5
𝜎

where Φ ⋅ is a cumulative distribution function of standard normal distribution,

log 𝐻𝑒𝑎𝑙𝑡ℎ 𝜇̂ 𝑇 𝑡 log 𝐻𝑒𝑎𝑙𝑡ℎ 𝜇̂ 𝑇 𝑡


𝑑 ,𝑑 6
𝜎 √𝑇 𝑡 𝜎 √𝑇 𝑡

for 𝜇̂ 𝜇 𝜇 , and 𝜎 𝜎 𝜎 2𝜎 𝜎 𝜌 .

The platform-level probability is estimated by the weighted average of user level estimates of the
probability with the weights proportional to the amount of value of borrowing.

Modeling expected loss. In case of liquidation, the platform will call for a liquidator in exchange for a
liquidation bonus, say, 𝛼 100 percent of the principal, and receive the repayment from the liquidator. If
this process completes instantly, liquidation loss will not happen. However, liquidation may take some
time. Suppose liquidation completes at time 𝑇 𝜏 𝜏′, namely 𝜏′-periods after it was triggered). The
collateral value may fall below the principal (and the liquidation bonus) due to price fluctuations. The cash
,
flow of the platform is 𝑃𝑎𝑦𝑜𝑓𝑓 min 𝛼𝑉 , 𝑉 , ,𝑉 , 𝑉 , 1 max 1 𝛼 , 0 . The loss
,

, ,
rate is then defined as 𝐿𝑜𝑠𝑠 max 1 𝛼 .
, ,

Consequently, the expected loss (𝐸𝐿 𝑡, 𝑇 ) can be obtained from the following equation:

𝐸𝐿 𝑡, 𝑇 𝑃 𝜏 𝑇 ⋅ 𝐸 𝐿𝑜𝑠𝑠 |𝜏 𝑇 𝑃 𝜏 𝑇 1 𝛼 Φ 𝑑 𝑒 Φ 𝑑 7

where

log 1 𝛼 log 1/Θ 𝜇̂ 𝜏


𝑑 , 𝑑 𝑑 𝜎 √𝜏 , 8
𝜎 √𝜏

IMF | April 2022 9


Global Financial Stability Report Chapter 3

, ∑ , ,
for 𝜇̂ 𝜇 𝜇 ,𝜎 𝜎 𝜎 2𝜎 𝜎 𝜌 , and Θ ∑ 𝜃 𝑤 is the
, ∑ , ,

weighted average collateral factor at time 𝑡. Since 𝑉 , 𝑉 , holds at time 𝜏, the sufficiency of the
, ,
collateral at time 𝜏 is given by .
, ,

Parameter estimation. The drift 𝝁, volatility 𝚺, and correlation 𝝆 are estimated using daily crypto asset
price returns. Euler approximation is applied to the price process:

𝑹 diag 𝒑 𝒑 𝒑 𝝁𝒅 Δ𝑡 𝚺𝐝 √Δ𝑡𝒆 9

where 𝑹 𝑅 , is the 𝐾 1 -dimensional time-series of crypto asset price returns at


, ,⋯ ; , ,⋯,

time 𝑡 , 𝑛 1,2, ⋯ 𝑁; 𝒆 𝑒 is the error term with i.i.d. 𝐾 1 -dimensional Gaussian


∈ , , , ,⋯,
distribution ∼ 𝑁 𝟎 , 𝑰 ; 𝝁𝒅 is the 𝐾 1 -dimensional drift parameter vector denoted by annual rate;
and 𝚺𝐝 𝚺𝐝 ′ is the 𝐾 𝐾 -dimensional covariance matrix of the annual return. Δ𝑡 𝑡 𝑡 is the time
interval.

The estimator of drift parameter 𝝁 and covariance 𝚺𝒅 𝚺𝒅 are obtained as

1 1
𝝁 𝑹 and 𝚺𝒅 𝚺𝒅 𝑹 𝝁𝒅 𝑹 𝝁𝒅 . 10
𝑁Δ𝑡 𝑁 1 Δ𝑡

𝚺𝒅 is given by Cholesky decomposition. Volatility 𝜎 , ,⋯, is available from the diagonal element, and
correlation 𝜌 , , , ,⋯,
is available by normalizing (𝑖, 𝑗) element of the covariance matrix 𝚺𝒅 𝚺𝒅 by the
product of volatility 𝜎 and 𝜎 .

The horizon 𝑇 is set to 1 (year), the time interval is set as Δ𝑡 (year) to use daily data for all available
crypto price used in the selected DeFi lending platform: Aave v2, Compound v2, and C.R.E.A.M Finance
as of 2021-12-22. The data source is CoinGecko which spans from January 1, 2019 to December 22,
2021 at daily frequency.15 User-level borrowing and collateral outstanding data is available for all crypto
assets used in the selected platforms from The Graph. The average duration of liquidation 𝜏′ are
estimated around 0.35 (year), by assuming an exponential distribution (𝜏 ∼ 𝐸𝑥𝑝 𝜆 ), based on the data of
liquidation records of Compound v2 available from The Graph and the weekly snapshots of all account
balance during 2020.16
Event Analysis on Cyberattacks
A comprehensive dataset regarding the dates and magnitudes of cyberattacks on DeFi platforms was
constructed from various sources, including the following: Chainalysis (2021), CryptoSec.info (2022),

15
The estimation was carried out by winsorizing the daily returns at 1 percent and 99 percent to filter out outliers.
16
The weekly account level balance data was available only during 2020 from its data API. Given that the number of users has
increased significantly after 2021, shorter liquidation duration is likely for more recent period. The estimate of 𝜏 0.35 should
involve considerable uncertainty.

IMF | April 2022 10


Global Financial Stability Report Chapter 3

ImmuneFi (2021), and rekt (2022).17 We identified 93 incidents as cyberattacks to DeFi during the period
of January 1, 2020 to January 15, 2022.

CAR analysis on crypto asset prices

Using this dataset, the cumulative abnormal returns (CAR) analysis is performed with respect to crypto
assets price returns and total value locked in the platforms.

This analysis exploits daily data of crypto asset price available from CoinGecko, including 11,722 assets
in total, starting from January 2, 2020 to December 22, 2021.18 To estimate the abnormal returns, the
chapter first constructs a dataset of logarithmic market returns which are computed by taking all assets
into a basket with weighted average according to the market capitalization.19 The returns of the basket
are regarded as the market portfolio for each platform.

Using the market returns, the following one-factor model is estimated for crypto assets issued by the
attacked platforms:

𝑅𝑒𝑡𝑢𝑟𝑛 , 𝑅𝑖𝑠𝑘 𝑓𝑟𝑒𝑒 𝑟𝑎𝑡𝑒 𝛼 𝛽 𝑀𝑎𝑟𝑘𝑒𝑡 𝑟𝑒𝑡𝑢𝑟𝑛 𝑅𝑖𝑠𝑘 𝑓𝑟𝑒𝑒 𝑟𝑎𝑡𝑒 𝐴𝑅 , 11

where 𝐴𝑅 is defined as the abnormal returns.20 The first two components of the RHS constitute the
systemic variation.21

𝑆𝑅 , 𝛼 𝛽 𝑀𝑎𝑟𝑘𝑒𝑡 𝑟𝑒𝑡𝑢𝑟𝑛 𝑅𝑖𝑠𝑘 𝑓𝑟𝑒𝑒 𝑟𝑎𝑡𝑒 12

Estimation is carried out within the 60 days window prior to the attack. Among 93 episodes, there are
episodes where the same platform was attacked multiple times. In such a case, only the first attack is
included in the sample, as the first attack often changes the way the market behaves, as shown in the
chapter.22

The abnormal returns 𝐴𝑅 , are estimated as:

𝐴𝑅 , 𝑅𝑒𝑡𝑢𝑟𝑛 , 𝑅𝑖𝑠𝑘 𝑓𝑟𝑒𝑒 𝑟𝑎𝑡𝑒 𝛼 𝛽 𝑀𝑎𝑟𝑘𝑒𝑡 𝑟𝑒𝑡𝑢𝑟𝑛 𝑅𝑖𝑠𝑘 𝑓𝑟𝑒𝑒 𝑟𝑎𝑡𝑒 13

where 𝛼 , 𝛽 ) are the estimated coefficients. The cumulative abnormal returns are defined as follows:

17
Multiple sources (including additional news sources, press releases, any announcements, etc.) were used to validate the dates
and numbers for final judgment.
18
Actual number of samples used in estimation varies depending on the availability of the data during the estimation window for
each cyberattack incident. On average, more than 5,800 assets are used to construct the market returns.
19
Individual returns are winsorized at 1 percent and 99 percent to filter out outliers.
20
Different from simple returns, logarithmic returns have more consistency and higher precision. Given the magnitude of the shock,
the chapter use logarithmic returns instead of simple returns. Note 𝑅 log . This suggests 𝑃𝑟𝑖𝑐𝑒
𝑃𝑟𝑖𝑐𝑒 exp 𝑅 , and 𝑃𝑟𝑖𝑐𝑒 𝑃𝑟𝑖𝑐𝑒 exp ∑ 𝑅 𝑃𝑟𝑖𝑐𝑒 exp ∑ 𝑆𝑅 exp 𝐶𝐴𝑅 , justifying the last
term exp 𝐶𝐴𝑅 to be considered as the variation associate with the cumulative abnormal returns.
21
Hu and others (2018) reports high correlation of crypto asset returns with the market portfolio constituted by crypto assets, which
justifies the one-factor model such as Liu and others (2022).
22
In addition, samples with less than 7 days of datapoints are dropped. Consequently, 28 episodes are selected.

IMF | April 2022 11


Global Financial Stability Report Chapter 3

𝐶𝐴𝑅 , 𝐴𝑅 , 14

Finally, the logarithmic CAR is converted into CAR: 𝐶𝐴𝑅 exp 𝐶𝐴𝑅 1.

Figure 3.3 Cumulative Abnormal Returns


Cumulative Abnormal Returns of Token Prices issued by DeFi Platforms after Cyberattacks

(percent) Days after the incident


1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
0

‐25

‐50

‐75 Interquartile range Median


Sources: Chainalysis, CoinGecko, CryptoSec.info, Defi Llama, ImmuneFi, rekt, IMF staff calculations.
Note: In the event analysis on DeFi token prices, cumulative abnormal return (CAR) is computed by a regression of excess
returns over the market excess return, a weighted average of all crypto assets minus risk-free rate.

CAR analysis on total value locked

A similar analysis is carried out for total value locked of DeFi platform. In this case, the sample includes
all DeFi platforms for which data is available from Defi Llama, including 1,028 DeFi projects in total.23
Instead of abnormal returns against the market, excess growth of total value locked is measured relative
to the total market growth:

𝐸𝑥𝑐𝑒𝑠𝑠 𝐺𝑟𝑜𝑤𝑡ℎ , 𝐺𝑟𝑜𝑤𝑡ℎ , 𝑀𝑎𝑟𝑘𝑒𝑡 𝐺𝑟𝑜𝑤𝑡ℎ , 15

Similar to the CAR analysis on the crypto assets price returns, CAR is estimated as

𝐶𝐴𝑅 , 𝐸𝑥𝑐𝑒𝑠𝑠 𝐺𝑟𝑜𝑤𝑡ℎ , 16

and converted into simple return format by 𝐶𝐴𝑅 exp 𝐶𝐴𝑅 1.

The estimated results are presented in the chapter Figure 3.11 panel 2.

23
Actual number of samples used in estimation varies dependent on the availability of the data during the estimation window for
each cyberattack incident. On average, more than 130 DeFi projects were used to construct the market returns.

IMF | April 2022 12


Global Financial Stability Report Chapter 3

Online Annex 3.4. Efficiency Analysis on Financial


Institutions and DeFi Platforms
This section provides the estimation method of the price-cost margins of financial intermediaries and their
marginal costs as well as the decomposition of marginal costs. This analysis follows the estimation
method in Berger and others (2009), who calculate the Lerner Index as a proxy for the market power.

Data Description
Bank/nonbank-level granular data from Fitch Connect comprises 18,011 banks from 137 countries (37
AEs and 100 EMs), 1,136 nonbanks from 46 countries (20 AEs and 26 EMs), spanning from 2007 to
2020 at the annual frequency. The daily data from selected DeFi platforms (Aave and Compound) covers
the period between January 2, 2020 to January 28, 2022.24

Estimation of Margins and Marginal Costs


The common approach in the literature is the ‘intermediation approach’ where deposits are an
intermediate input in the production of loans (Freixas and Rochet 1997). We take the assumption of a
single output, in line with a seminal study by Berger and others (2009) and Weill (2013) using total assets
as the output measure, while revenue associated with the output is the interest and non-interest income.

We then estimate the price-cost margins and marginal costs of banks, nonbanks, and DeFi platforms.25
The price-cost margin 𝑃𝐶𝑀 , is the difference between price and marginal cost, which is calculated as:

𝑃𝐶𝑀 , 𝑃, 𝑀𝐶 , 17

where 𝑃 , is the price of total assets proxied by the ratio of total revenues (the sum of interest and
non-interest income) to total assets for firm26 𝑖 at time 𝑡 (𝑃𝐶𝑀 would be zero in case of perfect
competition).

As marginal costs are not directly observable for an individual firm, we use a trans-log total cost function27
to estimate the parameters of the cost function and use them to derive the marginal costs.

𝛽
ln 𝐶 , 𝛽 𝛽 ln 𝑄 , ln 𝑄 , 𝛾 ln 𝑊 ,, 𝛿 ln 𝑄 , ln 𝑊 ,,
2

1 𝜓
𝜙 ln 𝑊 ,, ln 𝑊 , , 𝜓 𝑇 𝑇 𝜃 𝑇 ln 𝑊 ,, Γ𝑋 , 𝜀, 18
2 2

where 𝐶 , is the total cost (or expenses) for firm 𝑖 at time 𝑡; 𝑄 , is total assets, a proxy for bank output;
𝑊 ,, 𝑘 1,2,3 are input prices reflecting labor costs, funding costs, and operational costs, respectively.

24
Data is downloadable from their data API.
25
Samples are winsorized at 1 percent and 99 percent to filter out outliers.
26
The term “firm” indicates banks, non-banks, and DeFi platforms.
27
The translog cost function is frequently used in the banking literature, providing an estimation for the marginal cost of production
for banks (Weill 2013).

IMF | April 2022 13


Global Financial Stability Report Chapter 3

For input prices, we proxy labor costs using the ratio of personnel expenses to total assets, funding costs
using the ratio of interest expenses to total liabilities, and operational costs using the ratio of other
operational expenses to total assets.28 𝑇 is a time trend to capture technological changes. 𝑋 , are the
firm-characteristic control variables, such as equity to asset ratio and share of non-interest operational
income. The cost function is estimated using a panel regression with fixed cross-section effects and
clustered errors for each country.

The marginal cost can be derived by differentiating the cost function as follows:

𝜕𝐶 , 𝜕 ln 𝐶 , 𝐶 , 𝐶,
𝑀𝐶 , ⋅ 𝛽 𝛽 ln 𝑄 , 𝛿 ln 𝑊 ,, ⋅ 19
𝜕𝑄 , 𝜕 ln 𝑄 , 𝑄 , 𝑄,

The margin and marginal costs are averaged over time for each firm 𝑖, weighted by revenues, and we
chose the average value for each country. As Berger, Klapper, and Turk-Ariss (2009) pointed out, these
estimates do not capture risk premia in the prices of firms’ products or services.

Cost Decomposition
The share of each input cost to total costs can be calculated as follows. We assume the total cost is
composed of three input factors, 𝐹 𝑘 1,2,3 , as follows:

C 𝑊𝐹 𝑊𝐹 𝑊 𝐹 20

then the share of input factor 𝑆 can be derived with a simple transformation:

𝐹 𝜕𝐶 ⁄𝜕𝑊 𝜕 ln 𝐶 𝐶 1 𝜕 ln 𝐶
𝑆 𝑊 𝑊 𝑊
C 𝐶 𝜕 ln 𝑊 𝑊 C 𝜕 ln 𝑊

𝛾 𝛿 ln 𝑄 𝜙 ln 𝑊 21

where 𝛾 , 𝛿 , 𝜙 are the coefficients from the estimation of the total cost function shown above.

28
Other operational expenses are defined as non-staff related operating expenses incurred through the normal course of business,
which includes, depreciation, amortization, administrative expenses, occupancy costs, software costs, operating lease rentals,
audit and professional fees, and other operating expenses of an administrative nature.

IMF | April 2022 14


Global Financial Stability Report Chapter 3

References
Berger, Allen N., Leora F. Klapper, and Rima Turk-Ariss. 2009. “Bank Competition and Financial Stability:
An Empirical Evaluation.” Journal of Financial Services Research 35: 99–118.
Black, Fisher, and John C. Cox, 1976. “Valuing Corporate Securities: Some Effects of Bond Indenture
Provisions.” Journal of Finance, Vol. 31, no. 2, (May 1976): 351-367.
Buchak, Greg, Gregor Matvos, Tomasz Piskorski, and Amit Seru. 2018. "Fintech, Regulatory Arbitrage,
and the Rise of Shadow Banks." Journal of Financial Economics 130 (3): 453-483.
Chainalysis. 2021. “The Chainalysis 2021 Crypto Crime Report” (https://go.chainalysis.com/2021-Crypto-
Crime-Report.html).
Clerc, Laurent, Arthur Moraglia and Sylvain Perron. 2020. “Neobanks: Stakeholders Seeking Profitability”.
ACPR Analyses et syntheses, No 113 – 2020. Study carried out by the Research and Risk
Analysis Directorate and the FinTech Unit of the Autorité de Contrôle Prudentiel et de Résolution
(ACPR) at the Banque de France. 2020_etude_neobanques_en.pdf (banque-france.fr).
CryptoSec.info. 2022. “DeFi Exploit Timeline” (https://cryptosec.info/defi-hacks/) accessed on January 16,
2022.
Exton Research. 2021. “Report Neobanks 2021: Shifting from Growth to Profitability”. Report-Neobanks-
2021.pdf (extonconsulting.com).
Freixas, Xavier and Jean-Charles Rochet. 1997. “The Microeconomics of Banking”, MIT Press,
Cambridge MA.
Goldman Sachs. 2022. “Nu Holdings: Leading Fintech in LatAm Well Positioned to Deliver Long-Term
Growth”. January 3, 2022.
Hu, Albert, Christine A. Parlour, and Uday Rajan. 2018. “Cryptocurrencies: Stylized Facts on a New
Investible Instrument.” Working paper, University of California, Berkeley.
Immunefi. 2021. “Immunefi Crypto Losses Report” (https://immunefi.com/immunefi-crypto-losses-report-
2021/)
Jagtiani, Julapa, Lauren Lambie-Hanson, and Timothy Lambie-Hanson. 2021. "Fintech Lending and
Mortgage Credit Access." The Journal of FinTech 1 (1): 2050004
Liu, Yukun, Aleh Tsyvinski, Xi Wu. 2022. “Common Risk Factors in Cryptocurrency,” Journal of Finance,
Accepted Author Manuscript. https://doi.org/10.1111/jofi.13119.
Merton, Robert C. 1974. "On the Pricing of Corporate Debt: The Risk Structure of Interest Rates." Journal
of Finance Vol. 29, no. 2, (May 1974): 449-470.
Microsoft, Linklaters and Accenture. 2019. "Open Banking: A Shared Opportunity." RE489V8
(microsoft.com)
Morgan Stanley. 2021. “Kakao Bank Corp: Mechanics, Motor & Mettle to Lead Digital Banking”. October
11, 2021.
NeoBanks.app (NeoBanks.app: List of 247 neobanks and digital banks in the world - January 2022)
accessed on January 23, 2022.
Rekt. 2022. “leaderboard” (https://rekt.news/leaderboard/) accessed on January 16, 2022.
S&P Capital IQ. 2021. “Strong Interest Income Tips the Scale for Digital Banks in Europe, Asia”.
November 10, 2021.
Weill, Laurent. 2013. “Bank Competition in the EU: How Has it Evolved?” Journal of International
Financial Markets, Institutions and Money, 26:100–112
WhiteSight. 2021. “Back to The Future: The Evolution of Branchless Banking”. October 6, 2021. Back To
The Future: The Evolution of Branchless Banking (whitesight.net)

IMF | April 2022 15

You might also like