Can Google Trends Improve Housing Market Forecasts
Can Google Trends Improve Housing Market Forecasts
ECONOMICS
We augment linear pricing models for the housing market commonly used in the
literature with Google trends data in order to assess whether or not crowd-
sourced search query data can improve the forecasting ability of the models. We
estimate both sets of models (excluding and including the search query data) in
order to assess statistical fit. We then compare various performance measures of
the augmented linear model’s out-of-sample, dynamic forecasts against a baseline
version. We find that augmenting the models to take advantage of the availability
of Google trend data does not significantly improve the forecasting performance
of the models.
Introduction
Prior to making a large purchase, many individuals perform online research.
This is certainly true for any individual interested in vacationing at a certain
locale, during a certain time of year. Taken to its logical next step, a purchaser
of a home would most certainly perform their due diligence in gathering as
much information as possible regarding the many different amenities of the
(potential) geographic location and the many dimensions of the home if a
candidate home (or set of homes) has been chosen. One of the least costly
methods of gathering information for anyone with access to the internet is the
search engine. In this paper, we exploit the vast availability of search query data
to find out if the online search behavior which preempts purchasing activity in
housing markets can improve the forecasting ability of econometric models.
We demonstrate that incorporating crowd-sourced online search query data
into models of the housing market does not improve their forecasting ability.
Relative forecasting ability here is the comparison of various performance
measures across models which include the search query data versus models
where the query data is absent in dynamic, out-of-sample forecasts.1
Google Trends is a freely available online tool provided by Google, which
has been in operation since 2006. It enables one to input search query terms
and it provides a series of data depicting the relative interest in that term by
other individuals making similar search queries. The reason we chose Google
as the search engine to provide this data is the dominating market share Google
1 For readers unfamiliar with dynamic forecasting, refer to the appendix for an explanation. Examples of time series forecast performance
measures include (R.M.S.D.) root mean squared deviation, (M.A.P.E.) mean absolute percentage error, etc.
Can Google Trends Improve Housing Market Forecasts?
Figure 1: Time series for the market shares of various search engines. The plot at the top corresponds to searches
conducted on a mobile device or a tablet, while the plot at the bottom corresponds to searches conducted on a desktop.
NOTE: the plot at the top (mobiles and tablets) has been logged, given the vast market share of Google. Source:
https://www.netmarketshare.com/
occupies in the search engine space. Figure 1 depicts the time series data for
Google’s search engine market share in comparison to a handful of other
popular search engines.
The plots in figure 1 demonstrate that the data we obtain from Google
Trends can be assumed approximately representative owing to the vast majority
of search queries on the internet performed using Google. Google’s dominance
in the market is most extreme for searches conducted from mobile devices and
from tablets. The difference between Google and the other search engines is so
vast, the top of figure 1 has been logged in order to allow for the ranking of
competing search engines.
For our readers who may not be familiar with what the information from
Google Trends looks like, as a simple example figure 2 shows the time series of
search intensity for the two 2016 presidential candidates during the time frame
leading up to the first debate.
July turned out to be a very good month for raising contributions for the
Trump campaign, while the same bode true for Clinton leading up to August.
A different (and perhaps more likely) explanation may be that the spikes in
online interest are tied to the uncovering of various scandals and the continual
search term (e.g. Hillary Clinton) is normalized across all of the computed data
points of this specific term (Hillary Clinton). Therefore, we see that although
all search terms are divided by the same base, the search index for every term is
not comparable across different terms.
The housing price data we will be testing our models against is the S&P/
Case-Shiller 20-City Composite Home Price Index provided by FRED. We
then take the Metropolitan Statistical Area (MSA) city-level data on housing
sales available on Zillow’s data page and weigh the data in accordance with
the weighing scheme outlined by the Case-Shiller 20 city housing index.2 We
follow the same steps in creating the series for online search queries: Google
provides the trend data at the city-level and this allows us to to create a national
data series on Google Trends which is consistent with the weighing of our sales
data and the Case-Shiller index. Since Google Trends data is available from
January 2004 through 2016, all of our data series begin on January 2004. In
all of the empirical exercises, we split the dataset into a training period and a
(out of sample) forecast period. All of the forecasts are performed out of sample
as dynamic, one-step-ahead forecasts and evaluated using a root mean-squared
deviation metric.
We estimate a baseline linear model following previous studies in the
literature. We then re-estimate the model including the Google Trend data.
We find that adding the Google Trend data as an explanatory variable in our
baseline training/testing period insignificantly improved the performance of
the forecast, decreasing the root mean squared deviation from 6.7576 to 6.7483
(decrease of 0.14%) and decreasing the mean absolute percentage error from
96.201 to 94.565. In a robustness check, repeating the exercise with a shorter
and longer training window both deteriorate the forecast performance with the
RMSD increasing by 0.314% and the MAPE increasing from 109.75 to 111.23
for the shorter training window and the RMSD increasing by 1.425% and the
MAPE increasing from 158.68 to 161.14 for the longer training window.
Related literature
This paper builds upon and contributes to a growing body of work which
seeks to exploit and/or investigate the predictive power of online search queries
in forecasting the future values of various assets.
Beracha & Wintoki (2013) examine whether abnormal housing price
changes in a city can be predicted by abnormal search intensity for real estate
terms in that specific city. By arguing that search intensity for real estate terms
for a specific city can be treated as a proxy for housing market sentiment
for that city, Beracha and Wintoki run a regression of abnormal home price
changes on the lagged abnormal search intensity for the search terms “real
estate” and “rent” for each Metropolitan Statistical Area (MSA) in their
sample. They discover that, by using search intensity for real estate terms as a
2 The data were freely available up until 2017, when Zillow changed their policies regarding the availability of their data. The data is no longer
free; we thus chose to only include the data in this study that were freely available at the time. As a result, our raw data files span 2004 - 2016.
proxy for housing sentiment, abnormal search intensity for real estate terms in
a specific city can help predict abnormal future housing price changes. They
also find that the predictions for cities with abnormally high search intensity
outperform the predictions for cities with abnormally low real estate search
volume by as much as 8.5% over a two-year period.
Askitas (2016) creates a buyer-seller index based on the ratio of Google
Trends home “buy” queries to home “sell” queries. The index correlates with
the SP/Case-Shiller Home Price Index and makes for a fairly decent method to
now-cast home prices. Our focus is a bit different as we are testing the out-of-
sample forecasting ability with and without online query data and we are also
including both past prices and actual housing sales volumes provided by Zillow
as independent variables in our models.
Vosen & Schmidt (2011) intend to check how indicators from Google
Trend search terms perform when compared to two traditional survey-based
indicators, the University of Michigan Consumer Sentiment Index (MCSI)
and the Conference Board Consumer Confidence Index (CCI), in forecasting
private consumption. Vosen & Schmidt (2011) form a baseline model with
MA(1) process and augment the model by adding the MCSI, CCI and Google
factors into the model. They then conduct both in-sample and out-of-sample
forecasting and find consistent results in supporting Google data as the
superior data source for predicting private consumption during their testing
period from January 2005 to September 2009.
Finally, additional empirical papers which demonstrate some evidence of
the fore- and now-casting power of Google data for specific macroeconomic
variables include D’Amuri & Marcucci (2012) (forecasting the unemployment
rate), Coble & Pincheira (2017) (now-casting building permits) and Nymand-
Andersen & Pantelidis (2018) (now-casting car sales).
Empirical methodology
We estimate a simple set of time series regressions (including and excluding
the Google data) and then perform an out-of-sample forecast. We then
compare the out-of-sample forecasts with the actual Case-Shiller HPI. We then
calculate and provide several measures of forecast performance statistics to
compare the models with. For all statistical exercises, the training block spans
August of 2004 through December of 2012; we will be testing the forecasting
ability of the resulting estimated models on the remaining data which spans
January of 2013 through August of 2016. For robustness, we repeat the
exercise using a smaller (August 2004 – July 2010) and larger (August 2004 –
July 2014) training period.3
The 20 U.S. cities and their corresponding index weights for the Case-Shiller 20 city house price index. This study utilizes the 2010
weighing.
Data description
Case-Shiller Home Price Index
Based on the work of Karl Case, Robert Shiller and Allan Weiss, The S&P
CoreLogic Case-Shiller Home Price Index Series are “the leading measures of
U.S. residential real estate prices, tracking changes in the value of residential
real estate both nationally as well as in 20 metropolitan regions.”4 The three
major indices included in the series are Case-Shiller 20-City Composite Home
Price Index, the Case-Shiller 10-City Composite Home Price Index, and the
Case-Shiller U.S. National Home Price Index. For the purpose of our research,
which focuses on the housing market at the national level, we select the Case-
Shiller 20-City Composite Home Price Index for its coverage of 20 significant
housing markets in the U.S. at the city level.
A detailed list of the 20 cities included in the composite is presented in table
1.
In order to create the Google search query index that we use to conduct our
study, we weigh the city-level Google search query data using the exact same
weights as the Case-Shiller. Again, our aim is to remain as consistent as possible
when marrying the housing and search query data.
4 http://us.spindices.com/index-family/real-estate/sp-corelogic-case-shiller
Table 2: Unit-root test results for training data period (2004:08 - 2012:12)
ADF and KPSS unit-root tests for all the variables considered in the estimation. All tests conducted with 12 lags as frequency of data is
monthly.
5 http://www.zillow.com/corp/About.htm
6 An HP-filtered set of time series data is not appropriate for this type of analysis; since the HP-filter is two-sided and this is an exercise in testing
forecasting ability, at the forecast point the filter would rely on data which hasn’t arrived.
7 ADF is the acronym for the Augmented Dickey-Fuller test and KPSS is the acronym for the Kwiatkowski–Phillips–Schmidt–Shin test.
Estimation
We proceed by breaking the data set into a training block and a testing
block. The training set will cover August of 2004 through December of 2012,
and the testing data will span the remainder of the set from January 2013
through August of 2016.8 We will be formulating an AR model for the Case-
Shiller HPI, which also has differenced (Case-Shiller weighted) housing sales
and differenced (Case-Shiller weighted) Google search query data as regressors.
In order to establish a benchmark, we initially estimate a model excluding the
search query data.
Without Google
Following a battery of various combinations, the model we chose as “best”9
is
8 Robustness results using smaller and larger training windows provided in the Appendix.
9 The model we chose also takes on the same structure as in Wu & Brynjolfsson (2015).
10 For example, it is well known that housing prices exhibit a fair amount of persistence, and housing sales typically contribute to housing price
inflation with a slight lag; the resultant model fits this prior.
11 While city-wide indexes (similar to ours in the current study) exhibit persistence, forecasting excess returns at the individual house level remains
difficult due to the relatively high level of noise in individual prices.
Table 3: Linear model for Case-Shiller price index excluding search query data, using observations 2005:08–2012:12 ( = 89)
Dependent variable:
Standard errors based on Hessian
Baseline model estimates excluding Google Trend explanatory variables. Note that this is for the training period. In order to test the
forecasting capabilities, we will follow this with an estimation of a training period’s worth of observations, followed by an out-of-sample
forecast.
With Google
We follow the same procedure in isolating best econometric model when
incorporating the Google Trend data as we did in choosing the model without
the search query data (The model given by equation (2)). The resultant
regression specification is
12 According to homes.com, for the average home buyer, the process from shopping to first mortgage payment can take anywhere from 2 to 6
months. This doesn’t take into account the early online research stages conducted by most potential buyers.
Table 4: Linear model for Case-Shiller price index including search query data, using observations 2005:08–2012:12 ( = 89)
Dependent variable:
Standard errors based on Hessian
Baseline model estimates including Google Trend independent variables. Note that this is for the training period. In order to test the
forecasting capabilities, we will follow this with an estimation of a training period’s worth of observations, followed by an out-of-sample
forecast.
Discussion of results
Two meaningful statistical measures presented by table 5 are the Root Mean
Squared Deviation and the Mean Absolute Percentage Error. The
root mean squared deviation , is defined as
Figure 3: Plot at the top illustrates the forecast for the Case-Shiller weighed housing price data in a model excluding the
search query data, while the plot at the bottom depicts the forecast including the search query data.
Both forecasts are plotted against the actual housing data. The data shown begin in 2009 while the forecasts commence January of 2013
(the testing period). Tail end of the Great Recession is banded in gray.
Out-sample-forecast evaluation statistics for both linear models (2) (excluding Google Trends) and (3) (including Google Trends).
Submitted: November 30, 2020 MST, Accepted: March 01, 2021 MST
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0
International License (CCBY-SA-4.0). View this license’s legal deed at https://creativecommons.org/
licenses/by-sa/4.0 and legal code at https://creativecommons.org/licenses/by-sa/4.0/legalcode for more
information.
References
Askitas, N. (2016). Trend-spotting in the housing market. Cityscape - A Journal of Policy Development
and Research, 18(3), 185–198. https://www.huduser.gov/portal/periodicals/cityscpe/vol18num2/
ch8.pdf
Beracha, E., & Wintoki, B. (2013). Forecasting Residential Real Estate Price Changes from Online
Search Activity. Journal of Real Estate Research, 35(3), 283–312. http://EconPapers.repec.org/
RePEc:jre:issued:v:35:n:3:2013:p:283-312
Case, K. E., & Shiller, R. J. (1989). The Efficiency of the Market for Single-Family Homes. The
American Economic Review, 79(1), 125–137. http://www.jstor.org/stable/1804778
Case, K. E., & Shiller, R. J. (1990). Forecasting Prices and Excess Returns in the Housing Market.
Real Estate Economics, 18(3), 253–273. https://doi.org/10.1111/1540-6229.00521
Choi, H., & Varian, H. (2012). Predicting the Present with Google Trends. Economic Record, 88, 2–9.
https://doi.org/10.1111/j.1475-4932.2012.00809.x
Coble, D., & Pincheira, P. (2017). Nowcasting Building Permits with Google Trends (MPRA Paper
No. 76514). University Library of Munich, Germany. https://ideas.repec.org/p/pra/mprapa/
76514.html
D’Amuri, F., & Marcucci, J. (2012). The predictive power of Google searches in forecasting
unemployment (Temi Di Discussione (Economic Working Papers) No. 891). Bank of Italy,
Economic Research and International Relations Area. https://EconPapers.repec.org/
RePEc:bdi:wptemi:td_891_12
Hjalmarsson, E., & Hjalmarsson, R. (2009). Efficiency in housing markets: Which home buyers know
how to discount? Journal of Banking & Finance, 33(11), 2150–2163. https://ideas.repec.org/a/eee/
jbfina/v33y2009i11p2150-2163.html
Nymand-Andersen, P., & Pantelidis, E. (2018). Google econometrics: nowcasting Euro area car sales and
big data quality requirements (Statistics Working Paper No. 30). European Central Bank.
https://www.ecb.europa.eu/pub/pdf/scpsps/
ecb.sps30.en.pdf?21f8c889572bc4448f92acbfe4d486af
Vosen, S., & Schmidt, T. (2011). Forecasting private consumption: survey-based indicators vs. Google
trends. Journal of Forecasting, 30(6), 565–578. https://doi.org/10.1002/for.1213
Wickens, M. (2014). How Useful are DSGE Macroeconomic Models for Forecasting? Open Economies
Review, 25(1), 171–193. https://doi.org/10.1007/s11079-013-9304-6
Wu, L., & Brynjolfsson, E. (2015). The Future of Prediction: How Google Searches Foreshadow
Housing Prices and Sales. In Economic Analysis of the Digital Economy (pp. 89–118). University of
Chicago Press. https://doi.org/10.7208/chicago/9780226206981.003.0003
Appendix
Table 6: Shorter training period model excluding search query data, using observations 2005:08–2010:07 ( = 60)
Dependent variable:
Standard errors based on Hessian
Regression results for the ARIMA model, excluding the search query data, estimated on the smaller training set.
Dynamic forecasting
If we have the following sample AR(1) model
Now assume that only the data up until period is available so that we
estimate
Then we can take our estimate for from (7) and feed this in for in (6).
The result is what we refer to as a dynamic forecast.
Shorter training window
Tables 6 and 7 show the regression results for two other combinations of
training/forecasting periods.
Table 8 contains out-of-sample performance measures for two other
combinations of training/forecasting periods.
Longer training window
Table 11 contains out-of-sample performance measures for a longer
training/shorter forecasting period.
Table 7: Shorter training period model including search query data, using observations 2005:08–2010:07 ( = 60)
Dependent variable:
Standard errors based on Hessian
Regression results for the ARIMA model, including the search query data, estimated on the smaller training set.
Forecast results for both models, estimated on the smaller training set.
Table 9: Longer training period model excluding search query data, using observations 2005:08–2014:07 ( = 108)
Dependent variable:
Standard errors based on Hessian
Regression results for the ARIMA model, excluding the search query data, estimated on the larger training set.
Table 10: Longer training period model including search query data, using observations 2005:08–2014:07 ( = 108)
Dependent variable:
Standard errors based on Hessian
Regression results for the ARIMA model, including the search query data, estimated on the larger training set.
Table 11: Forecast Evaluation Statistics for Longer Training Period Models
Forecast results for both models, estimated on the larger training set.