Hadoop

Hadoop news, analysis, how-to, opinion and video. | CIO

data integration
open source share letting go free bird

data lakes

Why some Data Lakes are built to last

Hadoop-based Data Lakes can be game-changers, but too many are under performing. Here's a checklist to make your data lake a wild success.

real estate online 2

How an online real estate company optimized its Hadoop clusters

Every night, Trulia crunches more than a terabyte of new data and cross-references it with about 2 petabytes of existing data to deliver the most up-to-data real estate information to its users. Here's how it ensures consistent...

intro

Cool new products from big data’s Hadoop World show

There’s a big world of big data tools and services, and many of the leading ones are on display this week at Strata World/Hadoop World in San Jose. From the latest distributions of open source database technology to handy tools for...

hybrid query service

AtScale simplifies connecting BI tools to Hadoop

The startup, which specializes in creating OLAP-like virtual cubes on Hadoop, has added a Hybrid Query Service Service that allows its platform to natively support MDX and SQL queries.

big data rescue

Review: Databricks makes big data dreams come true

Cloud-based Spark machine learning and analytics platform is an excellent, full-featured product for data scientists

Hadoop

Big data gets runtime specification

The ODPi Runtime Specification and test suite is designed to help organizations write Hadoop-based applications once with the confidence that they will run on a variety of Hadoop distributions.

data integration

SAP's HANA Vora bridges divide between enterprise and Hadoop data

The SAP HANA Vora software is designed to allow companies to analyze data stored in Hadoop, enterprise systems and other distributed data sources.

MapR adds in-Hadoop document database

MapR delivers support for containers, security

With the general availability of its Converged Data Platform, MapR Technologies brings Hadoop together with Spark, Web-scale storage, NoSQL and streaming capabilities in a unified cluster designed to support next-generation big data...

statistics stats big data analytics

Hortonworks and HP Labs join forces to boost Spark

Hortonworks is working with Hewlett Packard Labs to enhance the efficiency and scale of memory for the enterprise and to dramatically improve memory utilization.

Hortonworks sign

Hortonworks release cadence balances innovation with reliable Hadoop core

The Hadoop distribution vendor will update core Apache Hadoop components once a year, while continually updating services that run on top of Hadoop.

bi business intelligence ts

How different SQL-on-Hadoop engines satisfy BI workloads

A new benchmark of SQL-on-Hadoop engines Impala, Spark and Hive finds they each have their own strengths and weaknesses when it comes to Business Intelligence (BI) workloads.

walgreens

Walgreens CIO starts with the customer and works backward

Having already produced a successful mobile application, Walgreens CIO Abhi Dhar is implementing Hadoop to lower the cost of storing and processing the large amounts of data that app is generating.

happy birthday hadoop
Q&A

Apache Hadoop turns 10

On the 10-year-anniversary of the birth of the Apache Hadoop project, co-creator Doug Cutting reflects on Hadoop's beginnings and where its future.

Hadoop

The top 5 Hadoop distributions, according to Forrester

In a new report Forrester Research’s big data analysts say that adopting Hadoop is “mandatory” for any organization that wishes to do advanced analytics and get actionable insights on their data. But which vendor is best to use?

partnerships tech

Tableau partners for BI on Hadoop

Startup AtScale will help Tableau accelerate BI on Hadoop with improvements to performance and enterprise-grade security without the need to move, transform or sample data.

Network room and mainframes with virtual city in the cloud

Q&A: Why Syncsort introduced the mainframe to Hadoop

In an interview with IDGE, Josh Rogers and Lonne Jaffe of Syncsort explain how they plan to transform big iron and traditional data warehouse/analytics

mapr converged data platform final 12 3 15

MapR brings down data silos with converged data platform

The Hadoop distribution specialist has announced MapR Streams, which will combine with its Hadoop distribution and NoSQL database to integrate file, database, stream processing and analytics to support new data-driven applications.

uniting databases

New Splice Machine RDBMS unites OLTP and OLAP

The 2.0 version of Splice Machine's relational database brings together the scalability of Hadoop and the in-memory performance of Spark.

Load More