Hortonworks is working with Hewlett Packard Labs to enhance the efficiency and scale of memory for the enterprise and to dramatically improve memory utilization. Credit: Thinkstock Hadoop distribution specialist Hortonworks has joined forces with HP Labs, HP Enterprise’s central research organization, in an effort to dramatically improve the performance of Spark workloads. [ Related: Hortonworks Boosts Support for Enterprise Needs in HDP 2.2 ] At a press conference in San Francisco Tuesday, the two companies announced a collaboration that has already born some fruit: Enhanced shuffle engine technologies. Faster sorting and in-memory computations, which has the potential to dramatically improve Spark performance. Better memory utilization. Improved performance and usage for broader scalability, which will help enable new large-scale use cases. “We’re hoping to enable the Spark community to derive insight more rapidly from much larger data sets without having to change a single line of code,” says Martin Fink, executive vice president and CTO of HP Enterprise and a member of Hortonworks’ board. “We’re very pleased to be able to work with Hortonworks to broaden the range of challenges that Spark can address.” Fink explains that HP Labs had been conducting research on the efficiency and scale of memory for the enterprise, as well as ways to enhance memory utilization for the enterprise. “Part of that research activity is we rewrote the shuffle engine from Java to C++,” he says. “We saw that we had rewritten a bunch of algorithms to make much more efficient use of memory and enabled ways that you could scale memory even more.” [ Related: Review: Spark Lights a Fire Under Big Data Processing ] In fact, certain customers that leveraged HP Labs’ work found that it increased the performance of certain workloads from 5X to 15X. “I’ve been around for a long time,” Fink says. “It’s not often that you come out with 15X performance increases on certain workloads. We knew we needed this to be part of a greater whole.” Fink notes that HP Enterprise chose to open source its research with the help of Hortonworks due to Hortonworks’ dedication to openness and collaboration. “This collaboration indicates our mutual support of and commitment to the growing Spark community and its solutions,” adds Scott Gnau, CTO of Hortonworks. “We will continue to focus on the integration of Spark into broad data architectures supported by Apache YARN as well as enhancements for performance and functionality and better access points for applications like Apache Zeppelin.” Zeppelin is an incubating Apache project that provides a Web-based notebook that enables interactive data analytics. Related content brandpost Sponsored by Freshworks When your AI chatbots mess up AI ‘hallucinations’ present significant business risks, but new types of guardrails can keep them from doing serious damage By Paul Gillin Dec 08, 2023 4 mins Generative AI brandpost Sponsored by Dell New research: How IT leaders drive business benefits by accelerating device refresh strategies Security leaders have particular concerns that older devices are more vulnerable to increasingly sophisticated cyber attacks. By Laura McEwan Dec 08, 2023 3 mins Infrastructure Management case study Toyota transforms IT service desk with gen AI To help promote insourcing and quality control, Toyota Motor North America is leveraging generative AI for HR and IT service desk requests. By Thor Olavsrud Dec 08, 2023 7 mins Employee Experience Generative AI ICT Partners feature CSM certification: Costs, requirements, and all you need to know The Certified ScrumMaster (CSM) certification sets the standard for establishing Scrum theory, developing practical applications and rules, and leading teams and stakeholders through the development process. By Moira Alexander Dec 08, 2023 8 mins Certifications Certifications Certifications Podcasts Videos Resources Events SUBSCRIBE TO OUR NEWSLETTER From our editors straight to your inbox Get started by entering your email address below. Please enter a valid email address Subscribe