100% of statisticians would say this is a terrible method for predicting elections. However, in the case of 2016βs presidential election, analyzing the geographic search volume of a few telling keywords βpredictedβ the outcome more accurately than Nate Silver himself.
The 2016 US Presidential Election was a nail-biter, and many of us followed along with the famed statisticianβs predictions in real time on FiveThirtyEight.com. Silverβs predictions, though more accurate than many, were still disrupted by the election results.
In an effort to better understand our country (and current political chaos), I dove into keyword research state-by-state searching for insights. Keywords can be powerful indicators of intent, thought, and behavior. What keyword searches might indicate a personal political opinion? Might there be a common denominator search among people with the same political beliefs?
Itβs generally agreed that Fox News leans to the right and CNN leans to the left. And if weβve learned anything this past year, itβs that the news you consume can have a strong impact on what you believe, in addition to the confirmation bias already present in seeking out particular sources of information.
My crazy idea: What if Republican states showed more βfox newsβ searches than βcnnβ? What if those searches revealed a bias and an intent that exit polling seemed to obscure?
The limitations to this research were pretty obvious. Watching Fox News or CNN doesnβt necessarily correlate with voter behavior, but could it be a better indicator than the polls? My research says yes. I researched other media outlets as well, but the top two ideologically opposed news sources β in any of the 50 states β were consistently Fox News and CNN.
Using Google Keyword Planner (connected to a high-paying Adwords account to view the most accurate/non-bucketed data), I evaluated each state’s search volume for βfox newsβ and βcnn.β
Eight states showed the exact same search volumes for both. Excluding those from my initial test, my results accurately predicted 42/42 of the 2016 presidential state outcomes including North Carolina and Wisconsin (which Silver mis-predicted). Interestingly, “cnn” even mirrored Hillary Clinton, similarly winning the popular vote (25,633,333 vs. 23,675,000 average monthly search volume for the United States).
In contrast, Nate Silver accurately predicted 45/50 states using a statistical methodology based on polling results.
This gets even more interesting:
The eight states showing the same average monthly search volume for both βcnnβ and βfox newsβ are Arizona, Florida, Michigan, Nevada, New Mexico, Ohio, Pennsylvania, and Texas.
However, I was able to dive deeper via GrepWords API (a keyword research tool that actually powers Keyword Explorer’s data), to discover that Arizona, Nevada, New Mexico, Pennsylvania, and Ohio each have slightly different βcnnβ vs βfox newsβ search averages over the previous 12-month period. Those new search volume averages are:
βfox newsβ avg monthly search volume |
βcnnβ avg monthly search volume |
KWR Prediction |
2016 Vote |
|
---|---|---|---|---|
Arizona |
566333 |
518583 |
Trump |
Trump |
Nevada |
213833 |
214583 |
Hillary |
Hillary |
New Mexico |
138833 |
142916 |
Hillary |
Hillary |
Ohio |
845833 |
781083 |
Trump |
Trump |
Pennsylvania |
1030500 |
1063583 |
Hillary |
Trump |
Four out of five isnβt bad! This brought my new prediction up to 46/47.
Silver and I each got Pennsylvania wrong. The GrepWords API shows the average monthly search volume for βcnnβ was ~33,083 searches higher than βfox newsβ (to put that in perspective, thatβs ~0.26% of the stateβs population). This tight-knit keyword research theory is perfectly reflected in Trumpβs 48.2% win against Clintonβs 47.5%.
Nate Silver and I have very different day jobs, and he wouldnβt make many of these hasty generalizations. Any prediction method can be right a couple times. However, it got me thinking about the power of keyword research: how it can reveal searcher intent, predict behavior, and sometimes even defy the logic of things like statistics.
Itβs also easy to predict the past. What happens when we apply this model to today’s Senate race?
Can we apply this theory to Alabamaβs special election in the US Senate?
After completing the above research on a whim, I realized that weβre on the cusp of yet another hotly contested, extremely close election: the upcoming Alabama senate race, between controversy-laden Republican Roy Moore and Democratic challenger Doug Jones, fighting for a Senate seat that hasnβt been held by a Democrat since 1992.
I researched each Alabama county β 67 in total β for good measure. There are obviously a ton of variables at play. However, 52 out of the 67 counties (77.6%) 2016 presidential county votes are correctly βpredictedβ by my theory.
Even when giving the Democratic nominee more weight to the very low search volume counties (19 counties showed a search volume difference of less than 500), my numbers lean pretty far to the right (48/67 Republican counties):
It should be noted that my theory incorrectly guessed two of the five largest Alabama counties, Montgomery and Jefferson, which both voted Democrat in 2016.
Greene and Macon Counties should both vote Democrat; their very slight βcnnβ over βfox newsβ search volume is confirmed by their previous presidential election results.
I realize state elections are not won by county, theyβre won by popular vote, and the state of Alabama searches for βfox newsβ 204,000 more times a month than βcnnβ (to put that in perspective, thatβs around ~4.27% of Alabamaβs population).
All things aside and regardless of outcome, this was an interesting exploration into how keyword research can offer us a glimpse into popular opinion, future behavior, and search intent. What do you think? Any other predictions we could make to test this theory? What other keywords or factors would you look at? Let us know in the comments.
Also, if you’ve enjoyed this post, check out Sam Wang’s Google-Wide Association Studies! –Fascinating read.