Using analytics and machine learning (ML) to better understand your customers has become an everyday event in any data-driven enterprise. The good news is that organizations have large data volumes at their disposal to build and train ML models. The not so good news is that data science and engineering teams are faced with a series of blockers\/challenges that prevent them from being as productive as possible. These challenges include complex data access approaches, the need to migrate and\/or reformat data before analysis can begin, and the adoption of different operating models. But the challenges don\u2019t stop there.\nOne of the biggest challenges is that data engineers, analytic users and data scientist come to their jobs with completely different points of view when it comes to data, which means they have different goals and use different tools (as shown in Figure 1). Data engineers are taking data and building pipelines to create the connection points for the other two personas. They prefer tools based on open-source technology to innovate faster while reducing lock-in to proprietary technology stacks. Analytic users live in a SQL-based world, so they prefer tools like Presto-SQL, and Apache Spark on Kubernetes for an agnostic platform that deploys any application or framework into any environment or infrastructure. Data scientists build data pipelines, but they approach it in two different ways: senior scientists prefer Jupyter notebooks, PyTorch, and Apache Spark. And citizen scientists prefer to use pre-integrated solution stacks.\n\n\nFigure 1. Data science personas, challenges, and blockers\n\n\nAnother big challenge is where the data resides. There are data centers, cloud (existing or future), and edge \u2013 all which have a set of infrastructure and services with specific access paths that can disrupt both productivity and established application and persona access patterns. \u00a0\nHow do you begin to solve for these challenges? Without getting into specific implementations, let\u2019s agree on a few key principles.\n\n\nA solution should provide a unified platform that increases productivity through a simple and secure data experience. This doesn\u2019t mean that your data has to move to a single location. But it does mean that personas can utilize a self-service app store to download the libraries, pre-configured templates, or certified ISV solutions they want to use with single-click download and deployment. \u00a0\n\n\nAutomate everything end-to-end including provisioning of tools\/libraries\/frameworks so teams can get to work quickly.\n\n\nSimplify data access through a converged file and object system, known as a data fabric, that abstracts underlying infrastructure to reduce complexity. The data fabric should support files, objects, streams, and databases; ingest and transform the data into a single, persistent data store.\n\n\nHave an open-source foundation that allows data science teams to pick up and drop their work onto any infrastructure: on premises, cloud, or edge.\n\n\nHPE Ezmeral delivers a secure, unified analytics platform that is optimized for on-premises, edge, and cloud deployments to deliver frictionless access to data. The integrated app store (Figure 2) enables one-click download and deployment of opinionated stacks and certified ISV solutions, or allows you to build or bring you own open-source tools\/stacks, all supported by HPE 24x7. \u00a0\nHPE\u2019s integrated data fabric enables direct access across hybrid\/multicloud environments through both open-source and standard interfaces. Accessing data using the native S3 API, NFS, HDFS, POSIX, or CSI reduces the need to change existing access methods for applications or users. HPE\u2019s data fabric abstracts the underlying infrastructure \u2013 this means you can access data on bare metal, cloud, on-premises, or edge to reduce complexity and create a bridge that allows traditional and modern applications and processes to securely access the same datasets on the same system.\n\n\nFigure 2. Example of the app store within HPE Ezmeral\n\n\nThe app store experience boosts the data science and engineering team\u2019s productivity by deploying native best-of-breed open-source tools, libraries, and frameworks out-of-the box, such as Apache Spark 3.x Operator on Kubernetes, Delta Lake, Hive, or Thrift. If users are using older versions of Apache Spark, HPE Ezmeral can accommodate multiple versions running concurrently. If you prefer a different toolset, utilize the built-in app workbench to build or bring your own open-source stacks.\nHPE Ezmeral Unified Analytics addresses the key pain points of data analytics. It delivers high performance, cost efficiency, and a secure and unified data experience to connect to data wherever it exists. The open-source foundation means you can move to a modern analytic platform without refactoring or moving data without accumulating any additional technical debt.\nRead the new solution brief, Modernize Data Analytics, to learn more. Or visit HPE Ezmeral software.\n____________________________________\nAbout Joann Starke\n\nJoann\u2019s domain knowledge and technical expertise have contributed to the development and marketing of cloud, analytics, and automation solutions. She holds a B.S. in marketing and computer science. Currently she is the senior marketing engineer for HPE Ezmeral Data Fabric.