What Makes A Data Scientist?

Advances in technology have disrupted nearly every industry and created career opportunities that were once implausible. So it should come as no surprise that nearly half of the 25 "Best Jobs in America" according to Glassdoor are tech-related.

What may be surprising, however, is that in 2016, “data scientist” came in at the top of the list.

Simply put, data scientists are big data wranglers. They explore and analyze datasets in order to understand and organize data, identify underlying patterns and trends, and develop methods which identify how to best extract and summarize information from the data that can be used to inform better decision making.

A McKinsey study predicts that by 2018 the number of data science jobs in the United States alone will exceed 490,000. However, despite demand, there will be fewer than 200,000 available data scientists to fill these positions. Globally, this demand is projected to exceed supply by more than 50 percent in the next two years.

It All Starts with Math

A career in data science begins not only with a love for mathematics, but also with a knack for applying mathematical concepts to topics from other aspects of life both academically and in general.

Traditionally, school curriculums do not emphasize many quantitative toolsets required for analyzing and manipulating large volumes of data such as statistics, matrix algebra, and hands-on exercises geared at translating these methods into numerical algorithms. While this is starting to change as more emphasis is placed on STEM education (science, technology, engineering and math), middle and high school mathematics curriculums tend to still primarily focus on preparing students for calculus.

However, other analytical toolsets such as statistics and discrete math offer critical and different ways of thinking that is key to data science.

To bring it to a consumer level, fitness trackers are a perfect example of disorganized data. When you enter information into a fitness tracker, you tend to do lazy things.

For example, after you ride a bike or go for a run, you may input the distance you traveled; however, there is so much additional information that could have also been added.

  • How many minutes did you exercise?
  • Did you ride a road bike, a mountain bike or a beach cruiser?
  • Did you run on a treadmill or a trail?
  • At what resistance or pace did you ride?
  • What about your age, weight and activity level?
  • All of these factors help improve the data quality and inform a more complete story about your fitness and health.

When it comes to enterprise-level initiatives, data science teams tackle the challenge of identifying and developing ways to produce measureable outputs of value from data of variable quality originating from disparate sources. Decision makers want to see summary numbers presented in an informative and consumable way. In the desire to see whole numbers, users do not always understand the importance of also looking at the statistical certainty around data measurements.
Every organization’s data might start “messy,” but it all holds valuable insights that can affect the bottom line. Data scientists can help organizations transform the data being collected in ways that will ultimately help achieve business objectives.

Opening the Door for Data Scientists

In a turbulent energy market, identifying efficiencies and realizing cost savings from data is critical for many of these businesses to stay afloat. But this is just in one sector – many other organizations have identified the need for a data science team, though few have thus far been able to fill these types of roles.

In order to effectively build a talent pipeline for data scientists, there needs to be more of a focus on teaching quantitative skills beyond calculus prep in a mathematics education. There must be increased awareness at the high school and college levels of what skill sets are in demand so programs may be tailored accordingly.

Every year the number of opportunities for this skill set grows, and the need for data scientists at a range of companies has never been greater.

Beyond math skills, prospective data scientists need to know how to think creatively and develop context and a story for the data they are analyzing. Data scientists need to be talented with numbers, but they must also excel at problem solving leveraging various types of data.

The art of taking qualitative phenomenon and quantifying it in a meaningful way is a difficult challenge largely due to the fact it is an open-ended task and not straightforward like a number crunching process. However, everything can be modeled into a mathematical story, and having the ability to look at data sets and develop strategic insights from a business mindset is what makes data scientist so valuable.

Information-Management:

 

« Cybersecurity Due Diligence Is Critical
What Executives Really Should Know About Social Media »

ManageEngine
CyberSecurity Jobsite
Check Point

Directory of Suppliers

The PC Support Group

The PC Support Group

A partnership with The PC Support Group delivers improved productivity, reduced costs and protects your business through exceptional IT, telecoms and cybersecurity services.

Authentic8

Authentic8

Authentic8 transforms how organizations secure and control the use of the web with Silo, its patented cloud browser.

LockLizard

LockLizard

Locklizard provides PDF DRM software that protects PDF documents from unauthorized access and misuse. Share and sell documents securely - prevent document leakage, sharing and piracy.

Tines

Tines

The Tines security automation platform helps security teams automate manual tasks, making them more effective and efficient.

CSI Consulting Services

CSI Consulting Services

Get Advice From The Experts: * Training * Penetration Testing * Data Governance * GDPR Compliance. Connecting you to the best in the business.

Spiceworks

Spiceworks

Spiceworks provide a range of free apps for IT professionals including network inventory, network monitor, and help desk.

JumpCloud

JumpCloud

JumpCloud's Directory-as-a-Service (DaaS) is the single point of authority to authenticate, authorize, and manage the identities of a business’s employees and the systems and IT resources they need.

Akin Gump Strauss Hauer & Feld

Akin Gump Strauss Hauer & Feld

Akin is a leading global law firm providing innovative legal services and business solutions to individuals and institutions. Practice areas include Cybersecurity, Privacy and Data Protection.

DataVantage

DataVantage

DataVantage data masking and data management software helps you prevent data breaches, pass compliance audits and meet regulatory requirements such as HIPAA and PCI DSS.

Nethemba

Nethemba

Nethemba provide pentesting and security audits for networks and web applications. Other services include digital forensics, training and consultancy.

Aptiv

Aptiv

Aptiv is a global technology company that develops safer, greener and more connected solutions enabling the future of mobility.

ValidSoft

ValidSoft

ValidSoft is a security software company, providing telecommunications-based multi-factor authentication, identity and transaction verification technology.

CorkBIC International Security Accelerator

CorkBIC International Security Accelerator

CorkBIC International Security Accelerator invests in early stage disruptive companies in the security industry including, Cybersecurity, Internet of Things (IOT), Blockchain and AI.

Cyber Risk Aware

Cyber Risk Aware

Cyber Risk Aware provide a security awareness and phishing simulation platform that focuses on real threats and educates and empowers employees to be the first line of defence.

Anonomatic

Anonomatic

Anonomatic’s mission is to make data privacy secure, simple and cost effective. We are Data and Privacy Experts who are passionate about helping organizations solve PII compliance.

FluidOne

FluidOne

FluidOne are an award-winning Connected Cloud Solutions provider. We design tailored solutions to help customers and partners digitally transform their IT and communications.

Dropzone AI

Dropzone AI

Dropzone AI are creating a generational leap in SecOps by using AI to automate cyber expertise and tooling.

Multipoint Group

Multipoint Group

Multipoint is an information security and protection solutions company operating in the South EMEA region through value-added distribution channels.

CorePLUS Technologies

CorePLUS Technologies

CorePlus solutions are designed to empower organizations with the tools they need to ensure the utmost protection for their assets, people, and information.

Pantherun Technologies

Pantherun Technologies

Pantherun is a pioneering force in the realm of encryption technology and data protection solutions.

Mplify Alliance

Mplify Alliance

Mplify’s mission is to amplify global network and service innovation, interoperability, and resilience through collaboration, standardization, automation, and certification.