Data Management

11:08 AM
Julio Gómez, Attivio
Julio Gómez, Attivio

The Big Joke About Big Data

If you want your big data strategy to succeed, you must realize that it involves much more than just weblog data.

It never ceases to amaze me how frantically people rush to shoehorn their products, companies or internal IT strategies into the latest buzzword. Every large banking, investment and insurance company claims to have a big data strategy. The big joke is that these strategies are often very small.

Julio Gomez, Attivio
Julio Gomez, Attivio

Let me get right to the point: big data is way, way more than storing, organizing and crunching all the log files you used to throw away. Defining big data as what can be done with Hadoop or Splunk amounts to a hijacking of the term. I can't blame vendors for wanting to be synonymous with the next big thing. But it's a joke.

Sure, log files are part of big data, which is generally agreed to be all data too large and/or complex to be handled by traditional databases, but for which there are software tools emerging that can actually tame these beastly information stores and extract value (paraphrasing Wikipedia's Big Data page here). But log files are not only just a subset of big data, they're not even the most valuable part!

Ask any marketer if clickstream data is important to understanding their customers better and they will say "yes." Ask them if the ability to integrate, correlate and analyze all customer communications including social media, email, online customer chat, call center notes and open-ended survey questions and they will say "heck, yes!" You see, log files help answer only the "what" questions (what are customers doing on our web site), but don't touch the "why," which lives in the unstructured content.

Darin Stewart, a Research Director at Gartner, has just published a report titled "Big Content: The Unstructured Side of Big Data." In it he has noted that, "Unstructured content represents as much as 80% of an organization's total information assets." And the report goes on to state that "The true potential of big data is only realized when the source information pool is a hybrid of structured and unstructured information."

Unstructured content has long been an unsolved problem, granted. But technologies are emerging that make that goldmine of insight accessible to analytical frameworks. Big data is essentially ALL Data. Pinning your career on a big data strategy that deals just with log files is risky and virtually guarantees you will be delivering legacy on arrival.

And that's No Joke.

About The Author:Julio Gómez is General Manager, Financial Services, for Attivio, a Boston-area software company whose core product, Active Intelligence Engine™ (AIE), is a unified information access (UIA) platform. AIE integrates all types of data and content in a universal index to deliver complete information enriched with sophisticated analytics.

Comment  | 
Print  | 
More Insights
More Commentary
Is Big Data a Problem or an Opportunity?
When it comes to data, financial services firms are, as a rule, quite circumspect. They fear cyberattacks, data theft, data loss, security breaches, data privacy, and human error.
Data Integrity: A Necessity, Not an Option
Financial institutions that have taken on the data integrity task in the past now have to spend more money on hardware, software, and people just to keep up with the demand.
What Colombia’s New IT Campaign Means for Latin American Tech Investment
Colombia’s campaign is the latest example of how Latin America is trying to edge into the global technology space.
Initial Margin: When Does More Turn Out to Be Less?
Changing margin regulations are set to affect the OTC derivative market, including initial margin risk models for non-cleared OTCs.
The Mainframe Innovation Drag
It may be time for a consortium of firms motivated around the objective of eliminating the mainframe. What if every self-clearing firm decided to participate in building a modern, back-office system as an open-source, cloud-based project?
Register for Wall Street & Technology Newsletters
White Papers
Current Issue
Wall Street & Technology - July 2014
In addition to regular audits, the SEC will start to scrutinize the cyber-security preparedness of market participants.
5 Things to Look For Before Accepting Terms & Conditions
5 Things to Look For Before Accepting Terms & Conditions
Is your corporate data at risk? Before uploading sensitive information to cloud services be sure to review these terms.