While the results of data science are often productized, and put into other analytic systems and incorporated into mission-critical enterprise systems and business processes, the core value of data science is about innovation. The goal is to examine data and come up with new insights that can help run your business better.\u00a0\nThe work of data scientists is also highly dependent on others \u2014 most especially IT. Yet, many of the ways that IT operates don\u2019t naturally support the work of data scientists. IT craves predictability and is focused on running mission-critical systems in a stable and reliable way while enabling users to solve their own problems to the greatest extent possible. But the type of IT environment that\u2019s appropriate for a business analyst is different from the space a data scientist needs.\nThis article examines how IT can create the perfect laboratory for data scientists \u2013 one that supports experimentation and creativity \u2013 so they don\u2019t end up doing work they shouldn\u2019t be doing. In such an environment, data scientists can go as fast and as far as they want to go without barriers, while IT provides on- and off-ramps for the data science process.\nWhy the IT-data science dynamic is tricky\nThink of a business as a factory, where there\u2019s a relationship between the people who create the products and those who are running the factory. In many ways, IT is like those in charge of running a stable factory. Data scientists, by contrast, are both coming up with new ways to make the factory run better and products for the factory to make and then push out to the market. IT should provide data scientists with the raw materials and capabilities to do their jobs, as well as testing their prototypes. Eventually, once the products are proven and reliable, data scientists want their work to become part of the factory, hand off their oversight to IT, and not have to babysit these projects anymore. This allows data scientists to continue to do what they do best: experiment and innovate.\nIT generally follows standardized processes, methodologies, systems, and tools. IT also relies on automation, as IT can\u2019t function using manual processes in unique environments. By contrast, for data science, each problem and each scientist is singular, and every problem has a unique set of data and tools required to solve that problem.\u00a0\u00a0\nEffective data science often operates comfortably in the unknown. The problems data scientists are trying to solve are generally open-ended and require adaptability \u2014 they will need access to all the data that\u2019s available to be able to experiment, with no fixed time horizon, to find the best solution. This goes against the fixed time frames and predictability IT wants to operate within. IT generally operates in a predictable manner that manages changes in an orderly way, whereas data science is agile and spontaneous, seeking to go where the data leads. In data science there are always new tools, techniques, algorithms, and research that are being incorporated into the work. For IT to properly support data science departments, it must keep up with that.\u00a0\u00a0\nIn essence, it is IT\u2019s job to make sure the factory floor runs efficiently, whereas data scientists\u2019 jobs are to push the boundaries. Self-evidently, there\u2019s an inherent tension between these two roles. IT and\u00a0data scientists come to loggerheads when data scientists want to take risks IT doesn't want to accept or when IT hasn\u2019t established a solid enough foundation for data scientists to do their work independently. The ideal relationship is when data science has a baseline of capabilities from IT, but IT also creates limits to prevent unnecessary risks.\u00a0\n Getty Images\nOn-ramps and off-ramps: The optimal IT-data science relationship\nSo what does the optimal IT-data science relationship look like? For starters, IT creates on-ramps that allow data scientists to do their work. This involves preparing supportive environments for data scientists. IT creates data products incorporating all data that are both usable and accurate, and unifies all sources of data into one or more product, people, or customer objects. Ideally, data scientists are free to create new purpose-built data sets to drive innovation. IT must provide an environment where the data science team can operate at all levels of an organization\u2019s data stack, and bring in new data when necessary.\u00a0\nOnce data scientists get something right with their innovations, IT must provide an off-ramp from the lab so the models and analytics and the data supply chains that feed them can be passed off to the IT team to run. In advanced enterprises, IT can provide the data science team with tools to do work that is easily transferred into production. IT and data science teams must work together to establish plans for converting something into a production environment. Data scientists must be aware that they\u2019re not working in an experimentation vacuum and have to keep in mind the practical implications of how their creations can be mainstreamed and brought to market. To make this a reality, IT should help data scientists avoid common pain points in their work, including unnecessarily onerous data prep, making it simple to find and prepare data (often through data catalogs), and finding ways to test and support data ops.\u00a0\nIT can also support data science is by working collaboratively in an R&D-like fashion where the production process never stops. In such a setup, when data scientists come up with new tools, IT can start the validation of the tool even before the product is ready for production. It\u2019s not enough for IT to understand how to support the data scientist\u2019s innovation; IT also has to have a data science production factory that can accept new algorithms for productizing and bringing to operational maturity, with all the required resiliency, compliance, and other factors. This speeds up the iterative process by allowing data scientists to focus on creating new algorithms instead of building the infrastructure to put those algorithms to use. Additionally, IT can aid data scientists by ensuring they have the computing power necessary to build models. If data scientists can\u2019t build effective models because they don\u2019t have the GPUs or data available they need, data science will fail to function properly.\nBy providing the on- and off-ramps which empower data scientists to do their work, a symbiotic and harmonious relationship can develop in which IT and data scientists create a thriving production cycle for the business.\u00a0\nTo further explore data science best practices and how to adopt a framework that maximizes business productivity and accelerates time to value, check out the new IDC white paper: Industrializing Data Science with Data Analytics Factory Framework (DAF).\nInterested in discussing ways to improve collaboration between your IT teams and data scientists? Contact Matt Maccaux at: email@example.com.\n____________________________________\nAbout Matt Maccaux\n\nAs Global Field CTO for HPE Ezmeral software, Matt brings deep subject-matter expertise in big data analytics and data science, machine learning, application development & modernization, and IoT as well as cloud, virtualization, and containerization technologies.