Big Data and Data Inequality: Research Is Just The Beginning

There was a recent hoo-ha at a scientific conference in France, when Bernardo Huberman was furious when researchers from Google and a contributing university presenting results of social data analysis declined to share the data.

John Markoff, Big Data Troves Stay Forbidden to Social Scientists via  NYTimes.com

The issue came to a boil last month at a scientific conference in Lyon, France, when three scientists from Google and the University of Cambridge declined to release data they had compiled for a paper on the popularity of YouTube videos in different countries.

The chairman of the conference panel — Bernardo A. Huberman, a physicist who directs the social computing group at HP Labs here — responded angrily. In the future, he said, the conference should not accept papers from authors who did not make their data public. He was greeted by applause from the audience.

In February, Dr. Huberman had published a letter in the journal Nature warning that privately held data was threatening the very basis of scientific research. “If another set of data does not validate results obtained with private data,” he asked, “how do we know if it is because they are not universal or the authors made a mistake?”

He added that corporate control of data could give preferential access to an elite group of scientists at the largest corporations. “If this trend continues,” he wrote, “we’ll see a small group of scientists with access to private data repositories enjoy an unfair amount of attention in the community at the expense of equally talented researchers whose only flaw is the lack of right ‘connections’ to private data.”

Facebook and Microsoft declined to comment on the issue. Hal Varian, Google’s chief economist, said he sympathized with the idea of open data but added that the privacy issues were significant.

“This is one of the reasons the general pattern at Google is to try to release data to everyone or no one,” he said. “I have been working to get companies to release more data about their industries. The idea is that you can provide proprietary data aggregated in a way that poses no threats to privacy.”

The debate will only intensify as large companies with deep pockets do more research about their users. “In the Internet era,” said Andreas Weigend, a physicist and former chief scientist at Amazon, “research has moved out of the universities to the Googles, Amazons and Facebooks of the world.”

And of course, big data is worth big money — leaving aside the privacy concerns — and controlling access to that data is central to the aspirations of companies like Google, Facebook, and others.

Research is just the first place where the latent data inequality of the post normal world will come to light. We will each of us — as individuals — be divided from the inherent value of information about our activities and the inferences that can be made about them. As a society, we will find corporations that do not have our interests at heart working to exploit the potential value of our aggregated data exhaust. We are an exploitable resource — like the oceans of fish or the oil beneath the ground — and these companies plan to harvest all the value without our involvement.

We will find that we don’t own the information about ourselves anymore than we own our DNA. (Yes, others can patent your genes: see The Tissue-Industrial Complex.)

The New York Times

Notes

  1. starwatchingsoulgrowing reblogged this from wildcat2030
  2. girlengineer reblogged this from stoweboyd
  3. jordankeats reblogged this from emergentfutures
  4. mygenerate reblogged this from emergentfutures
  5. thenagainmaybeitsme reblogged this from emergentfutures
  6. emergentfutures reblogged this from futuramb
  7. mnzaman reblogged this from futuramb
  8. jingbeipei reblogged this from stoweboyd
  9. robanhk reblogged this from stoweboyd
  10. ahandsomestark reblogged this from stoweboyd
  11. rasiej reblogged this from stoweboyd
  12. wtfai reblogged this from stoweboyd
  13. This was featured in #Tech
  14. kuon09reblog reblogged this from gtokio
  15. singular1 reblogged this from stoweboyd
  16. utonesm reblogged this from futuramb
  17. thehelplessmind reblogged this from stoweboyd
  18. futuramb reblogged this from stoweboyd and added:
    This is really an extremely important issue for the future!!
  19. alnator reblogged this from stoweboyd
  20. do-nothing reblogged this from stoweboyd
  21. gtokio reblogged this from stoweboyd
  22. stackfm reblogged this from stoweboyd and added:
    Data. Who creates it, who owns it, and who abuses it. Personally, i hate giving my data to corporations that mine it and...
  23. wildcat2030 reblogged this from stoweboyd
  24. This was featured in #Science
  25. crazygm reblogged this from stoweboyd
  26. stoweboyd posted this

← Previous Post Next Post →

About

Web anthropologist, futurist, author. My focus is the future, and the tectonic forces pushing business, media, and society into an unclear and accelerating future. more.

Working on longer format projects, Sign up for the newsletter.

GigaOM Research analyst and curator.

Also writing beaconstreets.com.

Contact me. or ask me a question.



My Vizify profile.

Socialogy

  • John Hagel | John offers up some great insights, like the fact that passion is lower the larger that businesses get.

  • Euan Semple | A chat with my old pal, and the author of Organizations Don't Tweet, People Do

  • Will McInnes | The author of Culture Shock and managing director of Nixon/McInnes

  • Jennifer Magnolfi | An interview with the woman who said, 'Work is not a place you go, it's a thing you do'.

  • Hot Now

  • What Drives Us? | A draft chapter of my book, discussing motivations, Maslow's hierarchy, and fluidarity.

  • Socialogy: Interview With John Hagel | I Speak with Joh Hagel about the innovation at the edge.

  • Complex organisation arises from webs of interaction among causal factors | So, it turns out that DNA is, in fact, a great metaphor for business culture, but only after you realize that DNA is not a few hundred off-on switches, but instead a universe of unknowable complexities, that we can interact with, and understand at some abstract cartoonish level, but not control, and never fully comprehend.

  • Bitcoin May Be the Global Economy’s Last Safe Haven | Paul Ford

  • Innovators Get Better With Age | Companies make a mistake by relying too much on the innoations of the young, because Nobel laureats don't come into their prime until their 50s.

  • Oldie

  • Infodemics | 2009 | Passing incomplete or inaccurate information about some risk event can make people take actions that increase the damage of the event itself.