Data science is all the rage. Almost every CMO I know wants a data scientist for their very own – they are the status symbol du jour for senior executives everywhere. But, building the right data science team for your organization is not as easy as picking the right data scientist. Data science starts by asking the right questions, and the first question to ask is: What is data science?
Some people believe that data science is just a sexy name that mathletes made up to get better-paying jobs. For the sake of this writing, let’s define data science as “the analysis of data using the scientific method with the primary goal of turning information into action.”
How do they do it? Data scientists use a variety of mathematical tools to help answer questions and uncover patterns that contribute to the results, but it’s not just math. It's much, much more.
In order to turn information into action, you need a team that is proficient in the three foundational skills:
- Domain Expertise – to define the problem space
- Mathematics – for theoretical structure and problem
- Mathematics – for theoretical structure and problem
- Computer Science – to provide the environment where data is manipulated
Data science exists at the intersection of these three foundational skills; discounting or overweighting any of them will yield suboptimal results.
You know your business. In order to put data science to work, you are going to use 100 percent of your business knowledge, institutional memory and intuition to ask the right questions. Everyone wants to know how to increase sales – that’s question one. But domain experts can ask more specific questions that will yield measurable, actionable improvements, such as the following:
- Can we improve productivity in XYZ Department by increasing the usability of ABC data sets?
- Can increased access to scanner data, share of basket data, heuristic weather pattern data and parking lot density data increase our return on assets?
- Can we use our product attributes data sets to improve competitiveness?
The more specific your questions are, the more likely you are to get actionable results.
There is a lot of math in data science. The mathematicians on your data science team will be world-class problem solvers. They will be experts in statistical modeling, signal processing, probability models, pattern recognition, predictive analytics and a bunch of subspecialties that you learned in college mathematics class but have long since forgotten.
Data science becomes magical when brilliant mathematical constructs are applied to big data sets (vast amounts of data too big for humans to deal with), yielding unexpected actionable insights. The best teams develop AI, pattern-matching and machine-learning tools that generate the building blocks for predictive models. Great mathematicians are a key component to any data science department, but they cannot and do not work alone.
Data science happens inside computer systems. It cannot exist anywhere else. Having the right architecture for your data science function is as important as having the right architecture for your physical work environment.
Is your current CTO/CIO knowledgeable about the technical requirements for your data science team? Big data requires special storage, special handling and special network capabilities. The tools are different, computer “horsepower” requirements are different – in fact, almost everything you need for your data science team will need to be purpose built, rented, borrowed or partnered with.
Data Science Readiness Assessments
How should you think about getting ready for data science? There is a short list of steps you should consider:
Audit Data Assets: Assign a team or hire a consulting firm to audit your existing data sets and data-gathering systems. This will help with the creation of appropriate RFPs for potential partners, suppliers and potential acquisition targets.
Craft a Roadmap: Build a roadmap to get from where you are to having a working data science department by quantifying the best methods for identifying, obtaining and transforming data sets to make them suitable for the production of statistical evidence.
The Time Is Now!
Best-in-class companies realize the importance of analytics. The goal is a data-driven business strategy with an operating model that enables cross-functional collaboration, governance, metrics and change management. You’ll have to create methodologies to empower ongoing data scientific research. You will need to build or buy appropriate infrastructure, including analytics platforms, visualization tools and big data environments. You will find ways to manage data from 3rd-party partnerships, enforce data governance and develop best practices data munging and wrangling.
You will have the right resources:
- Business Analysts for problem definition, solution design and analytics roadmaps
- Research and Big Data Engineering for data science, experiment design and training
- Model Development for data preparation, profiling and model building and validation
- Operations for visualizations, QA, data management, maintenance and implementation
And then, you will be ready for data science.
We have a team ready to help you with your data science readiness assessment. Just shoot me an email, and I’ll be happy to work with you to help you achieve your business goals.