Enterprises Don’t Have Big Data - They Have Bad Data

PayPal co-founder and venture capitalist Peter Thiel commonly harps on the tech community for overusing buzzwords like “cloud” and “big data.” 

Companies often tout all their terabytes and petabytes of data, and their massive teams of data scientists running huge Hadoop clusters with Apache Kafka streams that are such a competitive advantage.

The truth is, most of them suffer from one of the old adages in computing: garbage in, garbage out. Not only do most of them actually not have Big Data in terms of data complexity or volume, but most of them actually have Useless Data, and it’s probably hurting their business. According to Experian Data Quality, inaccurate data affects the bottom line of 88 percent of organizations and impacts up to 12 percent of revenues.

Some companies actually have good data and know how to use it. From mature, web-native companies like Google to engineering-based companies like Boeing, the companies listed below have successfully managed enormous amounts of data and used it to make true data-driven decisions.

Examples of Sensible Data Use

Accounting for a third of peak-time Internet traffic in the US, Netflix collects massive amounts of data about its users’ viewing habits, and can break it down by region, time of day, watching hours and a plethora of other data. This has put them in a unique position of being able to accurately predict what viewers want.

IBM has teamed up with the Weather Company to combine two very large sets of data and accurately analyze how the weather impacts business. Spanning everything from retail to insurance, they’ll be able to accurately provide real-time insights into how temperature changes impact sales or how insurance companies can save dollars by advising their clients to move their cars.
The Icahn School Of Medicine At Mount Sinai New York City-based school has tasked Jeff Hammerbacher, famously known as Facebook’s first data scientist, to lead the development of a computer that analyzes the medical information they’ve collected from the half a million patients they treat per year.

Working with the head of Mount Sinai’s Institute for Genomics and Multiscale Biology, they’re working to make predictions that could cut the cost of healthcare — from assessing a patient’s medical history and risk factors to determine how often they’ll need healthcare to allowing doctors to prescribe treatments based on risk models gathered from genomics and lab data.

Amazon has access to unprecedented insights about its users, from what books they’re reading to how often they’re restocking cotton balls. While other companies have backburner customer support, Amazon has made it a key to its business by emphasizing the importance of communication and direct relationships with their consumers. Amazon uses its wealth of data about their users to immediately provide representatives with relevant information about a customer the moment they need support, streamlining the process and solidifying their loyalties.

Whereas past work experience has often been the model for hiring new employees, Xerox found that hiring for its call centers had an entirely different basis for success. Using big data, the organization found that a potential employee’s personality was the real predictor of whether they would stay — creative people tended to stick it out, inquisitive people did not. Armed with this information, and a hire survey rather than a hiring manager, they were able to cut their employee turnover rate at all their call centers by 20 percent in six months.

Most companies don’t use data well

Enterprises have historically spent far too little time thinking about what data they should be collecting and how they should be collecting it. Instead of spear fishing, they’ve taken to trawling the data ocean, collecting untold amounts of junk without any forethought or structure. Deferring these hard decisions has resulted in data science teams in large enterprises spending the majority of their time cleaning, processing and structuring data with manual and semi-automated methods.

Building smart, usable data is what every company should strive to create

DJ Patil, the recently appointed Chief Data Scientist of the White House, summarizes the data problem well, noting that “you have to start with a very basic idea: Data is super messy, and data cleanup will always be literally 80 percent of the work. In other words, data is the problem.”

But it’s not all bad news. According to the industry research firm Wikibon, 52 percent of data tool investments are being spent on technologies for ingesting and organising data so that it can be more readily accessible and prepared for analysis. However, the key to tackling this properly isn’t just spending on more or better tools.

Applying Big Data To Your Business

To truly turn an enterprise into a data company, here are some guidelines and methods that have been performed by some of the best data companies in the world.

Start by understanding the type of data you need to analyze first — is it event data, financial data, graph data or something else? This is the most important factor in determining whether you a need to capture data at the most atomic level or in some other format.
Don’t Over-Delegate. Many businesses hand off setting up analysis to developers or IT without involving the actual business users — it’s critical that those who are actually going to be using the data are involved with understanding exactly how it is being collected and aggregated to avoid critical problems down the road.

As a corollary to don’t over-delegate, don’t let business users either give generic use cases (e.g. “we want to track lead sources”) or spec out irrelevant use cases. Every piece of data needs to fit into an analytical framework and be part of solving a problem. Appoint either a highly technical business user or business-savvy tech lead to own the final signoff here.
Make sure you understand the source and types of data. Where does your data originate? Is it accurate? If you don’t know the answers to these questions, start looking into it now.

There are many great analytical tools out there. Undertake a formal “bake-off” process once you’ve defined your key use cases for your business and end users, and evaluate against your needs versus potential cool features you may never end up using.

Building an enterprise with smart, usable data is what every company should strive to create.

TechCrunch: http://tcrn.ch/1KuFsav

« Common Cyber Threats You Need to Be Aware Of
Cyber Threats Create Business Opportunities »

CyberSecurity Jobsite
Perimeter 81

Directory of Suppliers

NordLayer

NordLayer

NordLayer is an adaptive network access security solution for modern businesses — from the world’s most trusted cybersecurity brand, Nord Security. 

CYRIN

CYRIN

CYRIN® Cyber Range. Real Tools, Real Attacks, Real Scenarios. See why leading educational institutions and companies in the U.S. have begun to adopt the CYRIN® system.

Perimeter 81 / How to Select the Right ZTNA Solution

Perimeter 81 / How to Select the Right ZTNA Solution

Gartner insights into How to Select the Right ZTNA offering. Download this FREE report for a limited time only.

Jooble

Jooble

Jooble is a job search aggregator operating in 71 countries worldwide. We simplify the job search process by displaying active job ads from major job boards and career sites across the internet.

Clayden Law

Clayden Law

Clayden Law advise global businesses that buy and sell technology products and services. We are experts in information technology, data privacy and cybersecurity law.

HackerOne

HackerOne

HackerOne was started by hackers and security leaders who are driven by a passion to make the internet safer.

ClearedJobs.Net

ClearedJobs.Net

ClearedJobs.Net is a career site and job fair company for professionals seeking careers in the defense, intelligence and cyber security communities.

Messageware

Messageware

Messageware is the market leader in securing, enhancing, and customizing Microsoft Exchange and Outlook Web App.

SISA

SISA

SISA is a payment security specialist providing payment security assurance services, training and products to over 1,000 customers across the globe.

7Safe

7Safe

7Safe has been delivering hands-on digital security training courses since 2001 and offer e a portfolio of university and industry-accredited courses.

Echoworx

Echoworx

Echoworx primary and exclusive focus is providing organizations with secure email services.

Digiserve

Digiserve

Digiserve by Telkom Indonesia is an end-to-end managed solutions provider committed to empowering enterprises in Indonesia.

Accredia

Accredia

Accredia is the national accreditation body for Italy. The directory of members provides details of organisations offering certification services for ISO 27001.

Option3

Option3

Option3 (formerly Option3Ventures - O3V) primarily seek control investments in the growing cybersecurity mid-market, seeking to build champions with the scale to bring cutting-edge products to market.

Portshift

Portshift

Portshift leverages the power of Kubernetes and Service-Mesh to deliver a single source of truth for containers and cloud-native applications security.

Veratad Technologies

Veratad Technologies

Veratad Technologies, LLC is a world class provider of online/real-time Identity Verification, Age Verification, Fraud Prevention and Compliance Solutions.

Kalima Systems

Kalima Systems

Kalima’s mission is to securely collect, transport, store and share Industrial IoT (IIoT) trusted data in real time with devices, services and mobile workers.

Cenobe Cyber Security

Cenobe Cyber Security

Cenobe provides customized solutions to keep you ahead of potential threats and ensure the security of your organization's systems and data.

HIFENCE

HIFENCE

HIFENCE delivers cybersecurity and networking services that make your company safer and more secure. That’s all we do, so you can concentrate on all the things that you do best.

SecZone

SecZone

SecZone is a Chinese enterprise with a mission to "Make It Secure." We are dedicated to driving software security innovation globally.

Bearer

Bearer

Bearer helps modern teams ship trustworthy products with the help of our code security solution built for security, privacy and engineering teams.