Picture of author.
16 Works 1,175 Members 33 Reviews 1 Favorited

About the Author

Works by Viktor Mayer-Schonberger

Tagged

Common Knowledge

Members

Reviews

38 reviews
[B]ig data is about three major shifts of mindset that are interlinked and hence reinforce one another. The first is the ability to analyze vast amounts of data about a topic rather than be forced to settle for smaller sets. The second is a willingness to embrace data’s real-world messiness rather than privilege exactitude. The third is a growing respect for correlations rather than a continuing quest for elusive causality.

That’s “big data” the concept, to which my reactions were, show more respectively, bogglement, disagreement, and suspicion. And then there’s Big Data the book, wherein the authors unpacked their ideas and transformed mine.

First, about the mind-numbing amount of data, coming from everywhere -- Google and Facebook and public surveillance cameras for sure, but suffice it to say that everything electronic is gathering data, and everything that connects to the Internet is uploading the data to someone. And about the format of data, which has morphed from 75% analog in 2000 to 93% digital in 2007 (estimated to be >98% in 2013). Second, that the tidy, structured data of relational databases is now miniscule (estimated at 5%) compared with the as-yet untapped, error-ridden stuff of real life, like blogs and video. And third, that conceiving hypotheses, gathering perfect, representative data, and reaching causal conclusions is nowhere near as valuable or timely as finding correlations (the “what, not why”) in a gigantic mess of data. The authors characterize big data as, “the equivalent of impressionist painting, wherein each stroke is messy when examined up close, but by stepping back one can see a majestic picture.” Fascinating!

Then they address the problems of big data and, unlike most “alarmist” book I’ve read, they propose solutions. They advise that the ship has sailed on individuals being in control of their private information and online footprints (e.g. via opting out or being anonymous), especially with the secondary and tertiary (and quaternary, and...) markets that re-analyze data long after it’s been collected. So they suggest that the data users be held accountable through law/regulation similar to what’s in place for other industries that hold potential for public harm. They suggest a new professional -- a “data scientist” or “algorithmist” -- who isn’t the do-er who queries big data but rather the outside-the-lines thinker with a big-data mindset who “peers into databases to make a discovery” that creates new value. And they caution against “what’s-past-is-prologue” thinking -- where personal history and the statistics of correlation drive everything from basing your credit score upon the credit scores of your Facebook friends, to Minority Report-like “predictive policing” -- arguing instead for safeguards that recognize free will and actual behaviors.

Here is a book with the awe I’ve been seeking! I turned every page with excitement about what would be on the next page. There’s some repetition, but it’s usually with a twist that enhances internalization and recollection, and there are dozens of fascinating business examples along the way. It’s optimistic not alarmist; rather than running to find a doomsday hidey-hole, I came away transformed. It’s the best book I’ve read so far this year.

(Review based on an advance reading copy provided by the publisher.)
show less
½
Viktor Mayer-Schonberger published his book, "Delete," in 2009 when digital surveillance was nowhere near the current level. Those were heady days when most people jumped on the digital bandwagon without bothering about privacy, data protection, or the dangers of the cyber world.
This book's lessons are valid today, even though fifteen years have passed since its first publication. I grew up as a proud member of, possibly, the last analog generation, and forgetting was part of our culture. show more
Viktor Mayer-Schonberger starts the book by stating that forgetting is part of human nature. Remembering events requires effort, hence the widespread use of phrases like, 'dredging up old memories.' Words like 'dredging' connote effort and imply energy expenditure. We remember when we needed to. In his book, the author quoted a few cases of people with perfect memories, preventing such people from living in the present.
If humans are a bundle of memories, no one should wish to have perfect memory. However, machines do not forget in the digital age, and with their massive computing and file retrieval capabilities, the past can bite hard.
As Viktor Mayer-Schonberger argues, why shouldn't we advocate and demand that the digital world forget our past? He analyzed six approaches in his book and acknowledged that no single approach is perfect as a standalone solution. He also recognized that the regulatory world must move fast, or else it will be impossible to keep pace with the rapid changes in the digital sphere.
Finally, he recognized that the user's responsibility for managing their privacy, removing traces, and being circumspect while living in the digital universe.
With virtual and augmented reality becoming more commonplace, and AI technology flooding our lives, a second edition will be required. The challenges we will encounter due to these technologies is on top of the issues we faced a decade ago.
show less
Big data, one of recent years' new buzzwords, has now gotten itself a book with said title. Mayer-Schönberger and Cukier's "Big data: a revolution that will transform how we live, work and think" focuses mostly on what businesses can do with big data, and you ain't gonna find no much material as a technological-oriented data scientist. The book is from 2013 and already seems dated in the light of the Snowdon revelations. The authors critique of personal big data collection does not mentions show more the dragnet operations of signal intelligence agencies besides an 8-line William Binney-paragraph.

The authors claim three features of big data ("three major shifts of mindset"): "More", messy and correlation rather than causality. I am not entirely convinced that these features distinguish big data. Interventional A/B-testing seems at least to some degree to probe causality rather than just correlation. Such tests are continuously done by major Internet companies on unsuspecting users on large scale. Thus I would say big data processing is indeed probing causality. I neither agree that the big data is more messy than old-time small data. Anyone working seriously with small data may easily find the handling of such data can be a considerable headache and require some processing and 'understanding'. Indeed big data technologies have brought us means for handling messy data in a more structured way (JSON, NoSQL, Semantic Web, Wikidata). The reason why small data may feel less messy could be because the clean-up of small data can be done manually in a spreadsheet by a non-programmer, while for big data you need automatic tools and probably a programmer.

The authors also claim that we will see a rise in the profession called 'the algorithmist' whose job it will be to review algorithms. I do not think this is likely. The closest will probably get is the Google advisor board on the 'right to be forgotten'.

The authors also fails to give us a proper critique of big data hype: Their initial example on Google Flu Trends is dated: A publication from March 2014 shows a wrong flu prevalence estimation from Google Flu Trends (see 'The Parable of Google Flu: Traps in Big Data Analysis'). The Zeo EEG big data ZEO mentioned in the book hailed back in 2013 as one of the "8 Best Sleep Tracking Apps and Devices" has run out of money, is 'out of business' and you won't find a response from www.myzeo.com.

While the authors tell us that companies collect vast amount of data and that "Companies may be powerful" they ensure us on page 156 that the companies "don't have the state's powers to coerce". Well, yes. But the states have the ability to coerce the company to hand over any personal data. Indeed U.S. companies are coerced to hand over oversea data. Loretta A. Preska of the United States District Court told that to Microsoft. And within the U.S. PRISM program the handover is determined in secret FISA courts.

But in general there is a good allround discussion of the issues of Big Data, e.g., the notion of "collect everything" for not necessarily a predefined purpose. It will be interesting to have the opinions of the authors in light of the Snowdon issues.
show less
The book gave me a new appreciation as to how we can go about making decisions in the future. It also helped me understand a bit more about the debate whirling around the NSA and its desire to get all the information that it can. On the other hand, it reminded me that making decisions based solely on data can be dangerous, as in the case of Robert McNamara and the Vietnam War.

Lists

Awards

You May Also Like

Associated Authors

Statistics

Works
16
Members
1,175
Popularity
#21,895
Rating
½ 3.7
Reviews
33
ISBNs
67
Languages
11
Favorited
1

Charts & Graphs