Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Examining Presidential Candidates Via Google Trends

Posted by timothy on Sat Jun 07, 2008 02:15 PM
from the correlation-!-causation dept.
Michael Giuffrida writes "Google Trends is a free application produced by Google that shows how often a given keyword is searched for, over time. After seeing how candidates in the 2008 primaries have done in Google Trends in different states, it's clear that this tool can be very useful for campaigns." Read on below for some of the specifics about how these candidates have fared, Google-wise.
"For example, in New Hampshire, in the days leading up to the Jan. 8 primary, Clinton was searched for the most, followed by Obama, followed by Edwards — which was how the primary results turned out. In other words, the candidates most searched for on Google by users in New Hampshire were also the candidates with the most number of votes. This works for many other states as well.

For the first 37 Democratic primaries and caucuses, 32 states had enough data on Google to make a prediction. This method correctly predicted 27 of those 32 sates. Predictions aside, the tool is also useful simply in finding out how popular a candidate is in different states, assuming that the more popular candidates are entered more often as a search term in Google (an assumption that was verified, at least for the Democratic primaries, by the positive association found)."
+ -
story

Related Stories

This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • by RiotingPacifist (1228016) on Saturday June 07 2008, @02:19PM (#23695101)
    Sir, every time you mention bird rights and protection your page rank seams to rocket.
  • 27 out of 32 (Score:5, Insightful)

    by TubeSteak (669689) on Saturday June 07 2008, @02:22PM (#23695115) Journal
    Interesting, but I don't think it's that hard to beat an 84% accuracy rate with traditional methods.
    • Where is Ron Paul? (Score:5, Interesting)

      by bbagnall (608125) on Saturday June 07 2008, @03:03PM (#23695487) Homepage
      Why include Giuliani and Thompson and not Ron Paul? I would rather see Ron Paul than those other two who Ron Paul consistently beat. He was kicking butt in terms of Google searches.
      • by Ucklak (755284) on Saturday June 07 2008, @03:41PM (#23695741)
        That's the same thought I had. Ron Paul was all over the map from Google Trends, Facebook, MySpace, and even Digg where he still has more support than Obama.

        If anything, those trends didn't equate to votes which also means that this article is nothing more than a fluff piece and not to be taken seriously.
      • by jasonditz (597385) on Saturday June 07 2008, @11:31PM (#23698249) Homepage
        TFA says they deliberately excluded him because he was so popular on the internet and his search results don't correspond to actual votes.
        In other words... their examination works great, except when it doesn't. And in that case, we'll just not included that data in the final results.
        Wish I could've gotten away with that in college.
      • Because it's easy to predict the past via cherry picking. It's hard, if not impossible, to predict the future.
        • Re: (Score:2, Insightful)

          "No-one credible ever considered Paul to be a contender for the GOP nomination. He was the very model of an issue candidate."

          As opposed to one the current candidates that seems to have NO solid issues to base his run for president on?

          I'll leave it to you to figure who I'm speaking about.

            • by cayenne8 (626475) on Saturday June 07 2008, @09:03PM (#23697639) Homepage Journal
              "Everyone knows the ultimate manifestation of politics is "what you base your run" for Presidency on. I can't wait til the election is over so that people like you stop feeling entitled (or interested) to talk about politics.

              Eventually it will get off the front pages and your blather will seem, as usual, insubstantial. But as soon as words like "candidate" and "president" start to come into your post, your an expert."

              What makes you think it stops at the election? It keeps going just as strong after that. Politics is what makes ours (and most other) countries tick...everyone should keep their ears and eyes open to what our elected ones are doing....and constantly try to evaluate them as to their performance, and see if they need to be replaced the next cycle.

              There are always elections going on...Representatives, Senators, Governors, local councils, judges, police...DA's...Sheriffs...etc.

              It isn't like we only have things up for a vote every 4 years...there are multiple times to vote on a number of offices and issues locally and nationally every year.

        • Re: (Score:2, Insightful)

          Feeding the troll (nom nom nom)...

          Ron Paul was not a serious candidate at any stage of the campaign
          Not even enough to the fact he raised more money debt free and free from PAC than any other GOP candidate and broke 2 fundraising day records?

          Giuliani was leading in polls before the primaries
          Giuliani had Roger Ailes support in the media, no other candidate had that level of media support. Oprah's support of Obama didn't even match what Murdoch Media did for Guiliani.
    • Re:27 out of 32 (Score:5, Interesting)

      by cayenne8 (626475) on Saturday June 07 2008, @03:47PM (#23695791) Homepage Journal
      "Interesting, but I don't think it's that hard to beat an 84% accuracy rate with traditional methods."

      I think if you want to predict the presidential winner, you should go with the tried and true method....see which candidates halloween mask sells the most!! That has been an accurate predictor for decades now....It appears at least so far, now that they are tracking the masks throughout the primary season too, that Obama has the lead in the mask poll [topix.net] .

      I dunno...at this point, I figure dressing up as Obama or McCain would be equally as scary to most of us....

      :-)

      • BuyCostumes.com, the world's largest online retailer of halloween costumes, accessories and party supplies has 100% successfully predicted the next President of the United States since 2000 based solely on the sales of candidates' masks.
        About BUYSEASONS, Inc.
        Founded in 1999...

        I think maybe the parent was going for a Funny mod
        These last two Presidential elections were anything but normal.
      • Wow, they have a 100% prediction since 2000! It's almost like there were only 2 elections that they've guessed right! We can totally trust these guys.
      • Weekly Reader Poll (Score:4, Interesting)

        by OakDragon (885217) on Saturday June 07 2008, @05:59PM (#23696717) Journal
        Every election cycle, you can find stories like this [usatoday.com] about the Weekly Reader poll:

        Pundits come and pundits go, but one group has quietly predicted the winner of the presidential election every four years since the Eisenhower administration: kids.

        Most political junkies won't give it the time of day, but the Weekly Reader presidential poll of schoolchildren has pegged every winner since 1956.

        As far as informal polls go, this one is supposed to be one of the most accurate.
  • by Anonymous Coward on Saturday June 07 2008, @02:24PM (#23695123)
    ...without actually RTFA:

    "For the Republican Primaries, last names could easily be used. Ron Paul was excluded. His last name is too common. Using his full name is not a good solution either, because he had massive popularity on the Internet, becoming a meme of sorts, which did not at all correspond with his actual successes (or lack thereof) in the primaries."
    • by amccaf1 (813772) on Saturday June 07 2008, @03:15PM (#23695553)

      "For the Republican Primaries, last names could easily be used. Ron Paul was excluded. His last name is too common. Using his full name is not a good solution either, because he had massive popularity on the Internet, becoming a meme of sorts, which did not at all correspond with his actual successes (or lack thereof) in the primaries."
      Translation: Including Ron Paul would have indicated our hypothesis was incorrect, so we excluded him.
      • They're using a candidate's popularity in Google Trends as a measurement of the success of their campaign.

        For Ron Paul, his popularity on the internet has nothing to do with his real-life political success - as grandparent poster said, he's an internet meme. You think there's any correlation between "Ron Paul" jokes on forums and genuine interest in his campaign?

        Their "hypothesis" for the other candidates Google trends measures the success of their campaign. With Ron Paul, it's measuring something else entirely.

        • Re: (Score:2, Insightful)

          So, popularity on Google trends means that a candidate has succeeded in getting his/her message across, or that people are interested in what s/he has to say, or that people like forwarding e-mail jokes about the candidate, or that the candidate has reached some critical mass of Internet meme-ness, or some other undefined level of Internet interest.

          In other words, popularity on Google indicates popularity on Google. While I can't argue with the truth of that statement, I can quibble with its usefulness...
          • I just realized that I hadn't RTFA closely enough to realize that this report was actually someone's High School project, so I have to apologize for the unnecessary harshness in my tone there.
        • Still sounds like they're excluding data to make their hypothesis true. "Using the internet to predict who will win is great, unless the candidate is popular on the internet."
        • Re: (Score:3, Insightful)

          Doesn't that prove the point "Including Ron Paul would have indicated our hypothesis was incorrect, so we excluded him." It means that this method is not a reliable predictor of political success.

          On another note, listening to some of Ron Paul's speeches, he doesn't seem to have expected a realistic chance of winning the presidency. His campaign from early on seems to have been focussed on getting his message out and attempting to reform the republican party by packing it with libertarians. I will be inte
          • There are such things as statistical outliers and externalities.

            They're not talking about "zomg google popularity means they'll win!" They're saying, "Watch for your search graphs to spike after a speech to see if it was effective."

            Contrived example: Barack Obama makes a big speech about social security. If, suddenly, the number of searches for "Barack Obama Social Security" spikes, you could conclude that maybe you reached some people, generated some interest.

            Now, boys and girls, this is where we stop and think. This would work for Barack Obama because those searches track well with the speech he made. It would not work for Ron Paul given his status as an internet meme. The correlation between Ron Paul searches and events in his campaign is going to be just as weak as correlations between Chuck Norris and (actual) events in his life, or between lolcats and pet food product safety.

            I don't know how to make it any clearer. There is no "hypothesis." They have not framed this a statistical H0: Google doesn't control the elections and calculated a p-value. They haven't ignored data that would disprove some part of string theory. They're just saying this:

            Tracking search trends can be interesting for candidates. Less so for Ron Paul.

            • Never mind - as I read the rest of the FA, they are trying to predict elections.

              ^.^

              But still, my point stands - they point out that this isn't useful for Ron Paul, because interest in Ron Paul on the internet has little to do with his actual campaign. But, nobody as the same Chuck Norris-like admiration of Hillary, so her results are related to campaigns.

        • by laddiebuck (868690) on Saturday June 07 2008, @05:40PM (#23696601)
          So how do you know that say, searches for "Barack" are any more meaningful than searches for "Ron Paul"? No, the writer had a neat theory and got so attached to it that he discarded a bad data point instead of admitting it didn't work for Ron Paul.

          By including Ron Paul but at least being complete and honest, the analysis would have been more worthwhile than what it now is -- a pile of rubbish.
  • Not the whole story (Score:3, Interesting)

    by shma (863063) on Saturday June 07 2008, @02:24PM (#23695125)
    It seems their method does rather poorly when looking at republican primaries. Overall, they comment: "In reality, only about half of the "predictions" before the 2/12 primaries were actually accurate. "

    Still, one would expect logically that interest in a candidate is related to their poll numbers. But you need a better way to distinguish between negative interest and positive interest: how many people are searching "Obama AND Wright video" vs "Obama AND race speech"? With a more detailed model they could be on to something.
  • Old farts (Score:5, Insightful)

    by Hankapobe (1290722) on Saturday June 07 2008, @02:25PM (#23695131)
    Folks, especially you statisticians out there, what's wrong with these pictures?

    The sample: folks on the internet.

    Who's missing: folks who aren't on the internet; like old farts, poor people and Amish. As far as the old farts are concerned: they are the most well organized and the most vocal political group in this country. I think these charts reflect nothing.

    • It's not just poor, old and Amish. Who gets counted in this "poll"? The tech savvy, the people who practically live on the 'net, those that feel the urge to tell the whole world their political view.

      This is neither a majority nor a representative sample. I do predict an overwhelming victory for Obama according to those polls, even though I am almost certain McCain is going to win the election.
      • But these charts certainly do not reflect *nothing* -- just be careful what conclusions you draw.

        OK, so they reflect trends on the internet and folks who are active on the internet: nothing else.

        • Re: (Score:2, Insightful)

          But these charts certainly do not reflect *nothing* -- just be careful what conclusions you draw.

          OK, so they reflect trends on the internet and folks who are active on the internet: nothing else.


          Exactly. The author states several times that this ONLY looks at a small subset of the population, that this is NOT a proper statistical analysis. Oh, wait, this is /. Nobody here RTFA.
          • The author states several times that this ONLY looks at a small subset of the population, that this is NOT a proper statistical analysis. Oh, wait, this is /. Nobody here RTF

            You know what? I read TFA a few times. There is just vague references for not drawing conclusions, in other words; just CYA. They do not say anything specific. So, don't get all sanctimonious with me about reading the fucking article.

  • by Anonymous Coward on Saturday June 07 2008, @02:31PM (#23695199)
    There are still a lot of people without computers/internet/education that won't be using Google, thus you could seriously under-represent the base of a particular candidate. Dewey defeat Truman, anyone?
  • by jeske (7383) on Saturday June 07 2008, @02:44PM (#23695319) Homepage
    The author is quick to dismiss using "Obama" and "Hilary" as the indicator terms for fear that Obama's huge online campaign popularity is not representative of the true popularity of the two candiates.

    However, he makes no mention of the fact that "Obama" and "Hillary" are the most popularly used terms to refer to the candidates. Almost all candidates are refferred to primarily by their last names. Hilary is a special case where we use her first name because her husband was so recently president.

    His use of "Barack" and "Hilary" is about as statistically accurate as using "Barack" and "Rodham". Fortunatly, this inaccuracy is obviously visible in his numbers, because using his first-name method it quickly looks like Edwards might be a write-in candidate to rival them both.

    Please slashdot, stop posting braindead stories.
  • by MSTCrow5429 (642744) on Saturday June 07 2008, @02:46PM (#23695349)
    "For the Republican Primaries, last names could easily be used. Ron Paul was excluded. His last name is too common. Using his full name is not a good solution either, because he had massive popularity on the Internet, becoming a meme of sorts, which did not at all correspond with his actual successes (or lack thereof) in the primaries."

    So, in other words, any data that contradicts the hypothesis will be thrown out.

    • Re: (Score:2, Interesting)

      Good thing someone did compare Ron Paul to everyone else in Google Trends: http://www.youtube.com/watch?v=7iW5kOB1pmg [youtube.com]
    • That assessment isn't quite fair. One could make a reasonable argument that the Ron Paul campaign was one that would have biased any polling of this sort, appealing to the type of demographic that would google candidate's names and spend time look up this information. The Paul crowd was different from that which would search McCain or Huckabee, so avoid this distortion, the author left off Paul.

      That is to say, Ron Paul was a consuming cult that was able to organize a massive, yet still a niche, support grou
  • by mgiuffrida (1303757) on Saturday June 07 2008, @02:53PM (#23695401)
    Hey, guys, You're all right. There are serious statistical flaws here. I chose the keywords "hillary" and "barack" and "edwards" because that was the model that showed the best success. If one were actually predicting this live, he or she would soon see that "hillary" and "barack" do produce better results and would switch to those terms, simply because it fits the model better. Every election will have its outliers (Obama, Ron Paul). But hopefully this brief overview will provoke some campaign analysts to actually try to interpret Google Trends results for themselves, using more advanced models.
  • The choice of presidential candidates tells me is that it does not matter who gets in the WhiteHouse because there is something more powerful behind the scenes who is really pulling the strings, International bankers? (Builderburgers) multi-national corporations? BIG money.

    Most of the time conspiracy theorists sound whacko to me but sometimes they sure sound like they have a little insight most people are not aware of.

  • Does this mean that the next President of the United States will be Tila Tequila?

  • by yulek (202118) on Saturday June 07 2008, @03:44PM (#23695769) Homepage Journal
    i've been using google trends for several years to see what information people really need as they go about their daily lives.

    Global Warming vs. Paris Hilton [google.com]

    Global Warming vs. Iraq [google.com]

    (ignore the bottom chart, it is irrelevant to my study)
  • Basically I'm wondering "WTF is this supposed to be measuring?"

    • Only so called "Front runners" get mentioned as usual, the rest get practically censored.

      It may not be censorship. For some states, even front runners Clinton or Obama had very low volume. In Arkansas "there isn't any search volume until the day of the primary, so no predictions can be made" (go figure). Search volume for Gravel and Kucinich may not have been statistically significant.

      Also note that they excluded Paul from the Republican analysis because his popularity on the internet didn't translate to

      • The correct thing to do in that case would be to include the results and explain that certain factors (such as higher popularity on the internet) can lead to the results being incorrect. Not to pretend that a candidate who fared better than two of the five republicans they DID show never existed.
      • Re: (Score:2, Interesting)

        I did, in fact, do some research [tnr.com]. :)

        Had it been one article, I could buy that, but we're talking _years_ worth of racist venom in his newsletter. There wasn't a flood, but there was a steady enough stream that even the most laissez faire (rim shot) editor should have noticed it.

        And something from Anti War Radio and Prison Planet? Come on, there's got to be better material in his defense out there. At the risk of engaging in the ad hominem fallacy, that station is largely populated by complete lunatic